TY - JOUR
T1 - An accelerated randomized proximal coordinate gradient method and its application to regularized empirical risk minimization
AU - Lin, Qihang
AU - Lu, Zhaosong
AU - Xiao, Lin
N1 - Publisher Copyright:
© 2015 Society for Industrial and Applied Mathematics.
PY - 2015
Y1 - 2015
N2 - We consider the problem of minimizing the sum of two convex functions: one is smooth and given by a gradient oracle, and the other is separable over blocks of coordinates and has a simple known structure over each block. We develop an accelerated randomized proximal coordinate gradient (APCG) method for minimizing such convex composite functions. For strongly convex functions, our method achieves faster linear convergence rates than existing randomized proximal coordinate gradient methods. Without strong convexity, our method enjoys accelerated sublinear convergence rates. We show how to apply the APCG method to solve the regularized empirical risk minimization (ERM) problem and devise efficient implementations that avoid full-dimensional vector operations. For ill-conditioned ERM problems, our method obtains improved convergence rates than the state-of-the-art stochastic dual coordinate ascent method.
AB - We consider the problem of minimizing the sum of two convex functions: one is smooth and given by a gradient oracle, and the other is separable over blocks of coordinates and has a simple known structure over each block. We develop an accelerated randomized proximal coordinate gradient (APCG) method for minimizing such convex composite functions. For strongly convex functions, our method achieves faster linear convergence rates than existing randomized proximal coordinate gradient methods. Without strong convexity, our method enjoys accelerated sublinear convergence rates. We show how to apply the APCG method to solve the regularized empirical risk minimization (ERM) problem and devise efficient implementations that avoid full-dimensional vector operations. For ill-conditioned ERM problems, our method obtains improved convergence rates than the state-of-the-art stochastic dual coordinate ascent method.
KW - Accelerated proximal gradient method
KW - Convex optimization
KW - Coordinate descent method
KW - Empirical risk minimization
KW - Randomized algorithm
UR - http://www.scopus.com/inward/record.url?scp=84953335004&partnerID=8YFLogxK
U2 - 10.1137/141000270
DO - 10.1137/141000270
M3 - Article
AN - SCOPUS:84953335004
VL - 25
SP - 2244
EP - 2273
JO - SIAM Journal on Optimization
JF - SIAM Journal on Optimization
SN - 1052-6234
IS - 4
ER -