4.5 Article

Block-Coordinate Gradient Descent Method for Linearly Constrained Nonsmooth Separable Optimization

Journal

JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS
Volume 140, Issue 3, Pages 513-535

Publisher

SPRINGER/PLENUM PUBLISHERS
DOI: 10.1007/s10957-008-9458-3

Keywords

Nonsmooth optimization; Linear constraints; Support vector machines; Bilevel optimization; l(1)-regularization; Coordinate gradient descent; Global convergence; Linear convergence rate; Complexity bound

Funding

  1. National Science Foundation [DMS-0511283]

Ask authors/readers for more resources

We consider the problem of minimizing the weighted sum of a smooth function f and a convex function P of n real variables subject to m linear equality constraints. We propose a block-coordinate gradient descent method for solving this problem, with the coordinate block chosen by a Gauss-Southwell-q rule based on sufficient predicted descent. We establish global convergence to first-order stationarity for this method and, under a local error bound assumption, linear rate of convergence. If f is convex with Lipschitz continuous gradient, then the method terminates in O(n (2)/epsilon) iterations with an epsilon-optimal solution. If P is separable, then the Gauss-Southwell-q rule is implementable in O(n) operations when m=1 and in O(n (2)) operations when m > 1. In the special case of support vector machines training, for which f is convex quadratic, P is separable, and m=1, this complexity bound is comparable to the best known bound for decomposition methods. If f is convex, then, by gradually reducing the weight on P to zero, the method can be adapted to solve the bilevel problem of minimizing P over the set of minima of f+delta (X) , where X denotes the closure of the feasible set. This has application in the least 1-norm solution of maximum-likelihood estimation.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available