The analysis of decomposition methods for support vector machines

Chih-Chung Chang; Chih-Wei Hsu; Chih-Jen Lin

doi:10.1109/72.857780

Outline

Title

Abstract

The analysis of decomposition methods for support vector machines

Chih-Jen Lin

1999

https://doi.org/10.1109/72.857780

visibility

…

description

15 pages

link

1 file

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

Abstract The support vector machine (SVM) is a new and promising technique for pattern recognition. It requires the solution of a large dense quadratic programming problem. Traditional optimization methods cannot be directly applied due to memory restrictions. Up to now, very few methods can handle the memory problem and an important one is the" decomposition method." However, there is no convergence proof so far.

Thomas Serafini

Optimization Methods and Software, 2005

This work deals with special decomposition techniques for the large quadratic program arising in training Support Vector Machines. These approaches split the problem into a sequence of quadratic programming subproblems which can be solved by efficient gradient projection methods recently proposed. By decomposing into much larger subproblems than standard decomposition packages, these techniques show promising performance and are well suited for parallelization. Here, we discuss a crucial aspect for their effectiveness: the selection of the working set, that is the index set of the variables to be optimized at each step through the quadratic programming subproblem. We analyse the most popular working set selections and develop a new selection strategy that improves the convergence rate of the decomposition schemes based on large sized working sets. The effectiveness of the proposed strategy within the gradient projectionbased decomposition techniques is shown by numerical experiments on large benchmark problems, both in serial and parallel environments.

downloadDownload free PDF View PDFchevron_right

Parallel decomposition approaches for training support vector machines

Umberto Villano

Advances in Parallel Computing, 2004

We consider parallel decomposition techniques for solving the large quadratic programming (QP) problems arising in training support vector machines. A recent technique is improved by introducing an efficient solver for the inner QP subproblems and a preprocessing step useful to hot start the decomposition strategy. The effectiveness of the proposed improvements is evaluated by solving large-scale benchmark problems on different parallel architectures.

downloadDownload free PDF View PDFchevron_right

Some Improvements to a Parallel Decomposition Technique for Training Support Vector Machines

Gaetano Zanghirati

Lecture Notes in Computer Science, 2005

We consider a parallel decomposition technique for solving the large quadratic programs arising in training the learning methodology Support Vector Machine. At each iteration of the technique a subset of the variables is optimized through the solution of a quadratic programming subproblem. This inner subproblem is solved in parallel by a special gradient projection method. In this paper we consider some improvements to the inner solver: a new algorithm for the projection onto the feasible region of the optimization subproblem and new linesearch and steplength selection strategies for the gradient projection scheme. The effectiveness of the proposed improvements is evaluated, both in terms of execution time and relative speedup, by solving large-scale benchmark problems on a parallel architecture.

downloadDownload free PDF View PDFchevron_right

A note on the decomposition methods for support vector regression

Chih-Jen Lin

2001

Abstract The dual formulation of support vector regression involves with two closely related sets of variables. When the decomposition method is used, many existing approaches use pairs of indices from these two sets as the working set. Basically they select a base set first and then expand it so that all indices are pairs. This makes the implementation different from that for support vector classification. In addition, a larger optimization sub-problem has to be solved in each iteration.

downloadDownload free PDF View PDFchevron_right

On the convergence of hybrid decomposition methods for SVM training1

Laura Palagi, S. Lucidi

2006

Support Vector Machines (SVM) is a widely adopted technique both for classiflcation and regression problems. Training of SVM requires to solve a linearly constrained convex quadratic problem. In real applications the number of training data may be very huge and the Hessian matrix can- not be stored. In order to take into account this issue a common strategy consists in

downloadDownload free PDF View PDFchevron_right

A formal analysis of stopping criteria of decomposition methods for support vector machines

Chih-Jen Lin

2002

Abstract In a previous paper, the author (2001) proved the convergence of a commonly used decomposition method for support vector machines (SVMs). However, there is no theoretical justification about its stopping criterion, which is based on the gap of the violation of the optimality condition. It is essential to have the gap asymptotically approach zero, so we are sure that existing implementations stop in a finite number of iterations after reaching a specified tolerance.

downloadDownload free PDF View PDFchevron_right

SVM : Reduction of Learning Time

Lynda Zaoui

International Journal of Computer Applications, 2010

Training a support vector machine (SVM) leads to a quadratic optimization problem with bound constraints and one linear equality constraint. Despite the fact that this type of problem is well understood, there are many issues to be considered in designing an SVM learner. In particular, for large learning tasks with many training examples, off-the-shelf optimization techniques for general quadratic programs quickly become intractable in their memory and time requirements. Here we propose an algorithm which aims at reducing the learning time, this algorithm is based on the decomposition method proposed by Osuna dedicated to optimizing SVMs: it divides the original optimization problem into sub problems computable by the machine in terms of CPU time and memory storage, the obtained solution is in practice more parsimonious than that found by the approach of Osuna in terms of learning time quality, while offering similar performances.

downloadDownload free PDF View PDFchevron_right

On The Optimal Working Set Size in Serial and Parallel Support Vector Machine Learning With The Decomposition Algorithm

Bruno Lang

Australasian Conference on Knowledge Discovery and Data Mining, 2006

The support vector machine (SVM) is a well- established and accurate supervised learning method for the classification of data in various application fields. The statistical learning task - the so-called training - can be formulated as a quadratic optimiza- tion problem. During the last years the decompo- sition algorithm for solving this optimization prob- lem became the most frequently used

downloadDownload free PDF View PDFchevron_right

Parallel decomposition methods for linearly constrained problems subject to simple bound with application to the SVMs training

Laura Palagi

Computational Optimization and Applications, 2018

We consider the convex quadratic linearly constrained problem with bounded variables and with huge and dense Hessian matrix that arises in many applications such as the training problem of bias support vector machines. We propose a decomposition algorithmic scheme suitable to parallel implementations and we prove global convergence under suitable conditions. Focusing on support vector machines training, we outline how these assumptions can be satisfied in practice and we suggest various specific implementations. Extensions of the theoretical results to general linearly constrained problem are provided. We included numerical results on support vector machines with the aim of showing the viability and the effectiveness of the proposed scheme.

downloadDownload free PDF View PDFchevron_right

Global Convergence of Decomposition Learning Methods for Support Vector Machines

Norikazu Takahashi

IEEE Transactions on Neural Networks, 2006

His research interests include nonlinear circuit theory, neural networks, and optimization theory.

downloadDownload free PDF View PDFchevron_right

Loading Preview

Sorry, preview is currently unavailable. You can download the paper by clicking the button above.

Laura Palagi, S. Lucidi

Computational Optimization and Applications, 2007

In this work we consider nonlinear minimization problems with a single linear equality constraint and box constraints. In particular we are interested in solving problems where the number of variables is so huge that traditional optimization methods cannot be directly applied. Many interesting real world problems lead to the solution of large scale constrained problems with this structure. For example, the special subclass of problems with convex quadratic objective function plays a fundamental role in the training of Support Vector Machine, which is a technique for machine learning problems. For this particular subclass of convex quadratic problem, some convergent decomposition methods, based on the solution of a sequence of smaller subproblems, have been proposed. In this paper we define a new globally convergent decomposition algorithm that differs from the previous methods in the rule for the choice of the subproblem variables and in the presence of a proximal point modification in the objective function of the subproblems. In particular, the new rule for sequentially selecting the subproblems appears to be suited to tackle

downloadDownload free PDF View PDFchevron_right

Linear convergence of a decomposition method for support vector machines

Chih-Jen Lin

2001

Abstract Recently the asymptotic convergence of some commonly used decomposition methods for support vector machines has been established. However, their local convergence rates are still unknown. In this paper, under the assumptions that the kernel matrix is positive definite and the problem is non-degenerate, we prove the linear convergence of a popular decomposition method.

downloadDownload free PDF View PDFchevron_right

Decomposition methods for linear support vector machines

Chih-Jen Lin

2003

Abstract We explain that decomposition methods, in particular, SMO-type algorithms, are not suitable for linear SVMs with more data than attributes. To remedy this difficulty, we consider a recent result by SS Keerthi and C.-J. Lin (see http://www. csie. ntu. edu. tw/∼ cjlin/papers/limit. ps. gz, 2002) that for an SVM which is not linearly separable, after C is large enough, the dual solutions are at similar faces. Motivated by this property, we show that alpha seeding is extremely useful for solving a sequence of linear SVMs.

downloadDownload free PDF View PDFchevron_right

Stability analysis of the decomposition method for solving support vector machines

Daniel Lai

Proceedings of 2005 International Conference on Intelligent Sensing and Information Processing, 2005., 2005

In situations where processing memory is limited, the Support Vector Machine quadratic program can be decomposed into smaller sub-problems and solved sequentially. The convergence of this method has been proven previously through the use of a counting method. In this initial investigation, we approach the convergence analysis by treating the decomposed sub-problems as subsystems of a general system. The gradients of the subproblems and the inequality constraints are explicitly modelled as system variables. The change in these variables during optimization form a dynamic system modelled by vector differential equations. We show that the change in the objective function can be written as the energy in the system. This makes it a natural Lyapunov function which has an asymptotically stable point at the origin. The asymptotic stability of the whole system then follows under certain assumptions.

downloadDownload free PDF View PDFchevron_right

A convergence rate estimate for the SVM decomposition method

Daniel Lai

Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005., 2005

The training of Support Vector Machines using the decomposition method has one drawback; namely the selection of working sets such that convergence is as fast as possible. It has been shown by Lin that the rate is linear in the worse case under the assumption that all bounded Support Vectors have been determined. The analysis was done based on the change in the objective function and under a SVMlight selection rule. However, the rate estimate given is independent of time and hence gives little indication as to how the linear convergence speed varies during the iteration. In this initial analysis, we provide a treatment of the convergence from a gradient contraction perspective. We propose a necessary and sufficient condition which when satisfied provides strict linear convergence of the algorithm. The condition can also be interpreted as a basic requirement for a sequence of working sets in order to achieve such a convergence rate. Based on this condition, a time dependant rate estimate is then further derived. This estimate is shown to monotonically approach unity from below.

downloadDownload free PDF View PDFchevron_right

A study on SMO-type decomposition methods for support vector machines

Chih-Jen Lin

2006

Abstract Decomposition methods are currently one of the major methods for training support vector machines. They vary mainly according to different working set selections. Existing implementations and analysis usually consider some specific selection rules. This paper studies sequential minimal optimization type decomposition methods under a general and flexible way of choosing the two-element working set.

downloadDownload free PDF View PDFchevron_right

An algorithm for training a large scale support vector machine for regression based on linear programming and decomposition methods

Pablo Perea

Pattern Recognition Letters, 2012

This paper presents a method to train a Support Vector Regression (SVR) model for the large-scale case where the number of training samples supersedes the computational resources. The proposed scheme consists of posing the SVR problem entirely as a Linear Programming (LP) problem and on the development of a sequential optimization method based on variables decomposition, constraints decomposition, and the use of primal–dual interior point methods. Experimental results demonstrate that the proposed approach has ...

downloadDownload free PDF View PDFchevron_right

The support vector decomposition machine

Geoffrey Gordon

Proceedings of the 23rd international conference on Machine learning - ICML '06, 2006

In machine learning problems with tens of thousands of features and only dozens or hundreds of independent training examples, dimensionality reduction is essential for good learning performance. In previous work, many researchers have treated the learning problem in two separate phases: first use an algorithm such as singular value decomposition to reduce the dimensionality of the data set, and then use a classification algorithm such as naïve Bayes or support vector machines to learn a classifier. We demonstrate that it is possible to combine the two goals of dimensionality reduction and classification into a single learning objective, and present a novel and efficient algorithm which optimizes this objective directly. We present experimental results in fMRI analysis which show that we can achieve better learning performance and lower-dimensional representations than two-phase approaches can.

downloadDownload free PDF View PDFchevron_right

A Class of Parallel Decomposition Algorithms for SVMs Training

Laura Palagi

arXiv (Cornell University), 2015

The training of Support Vector Machines may be a very difficult task when dealing with very large datasets. The memory requirement and the time consumption of the SVMs algorithms grow rapidly with the increase of the data. To overcome these drawbacks, we propose a parallel decomposition algorithmic scheme for SVMs training for which we prove global convergence under suitable conditions. We outline how these assumptions can be satisfied in practice and we suggest various specific implementations exploiting the adaptable structure of the algorithmic model.

downloadDownload free PDF View PDFchevron_right

On the convergence of hybrid decomposition methods for SVM training

Laura Palagi, S. Lucidi

Furthermore, we present two specific practical realizations of the general hybrid model where a caching strategy can be advantageously exploited.

downloadDownload free PDF View PDFchevron_right

Cited by

On the working set selection in gradient projection-based decomposition techniques for support vector machines

Thomas Serafini

Optimization Methods and Software, 2005

downloadDownload free PDF View PDFchevron_right

Support vector machine as an efficient tool for high-dimensional data processing: Application to substorm forecasting

Valeriy Gavrishchaka

Journal of Geophysical Research, 2001

The support vector machine (SVM) has been used to model solar wind-driven geomagnetic substorm activity characterized by the auroral electrojet (AE) index. The focus of the present study, which is the first application of the SVM to space physics problems, is reliable prediction of large-amplitude substorm events from solar wind and interplanetary magnetic field data. This forecasting problem is important for many practical applications as well as for further understanding of the overall substorm dynamics. SVM has been trained on symbolically encoded AE index time series to perform supercfitical/subcfitical classification with respect to an application-dependent threshold. It is shown that SVM performance can be comparable to or even superior to that of the neural networks model. The advantages of the SVM-based techniques are expected to be much more pronounced in future space weather forecasting models, which will incorporate many types of high-dimensional, multiscale input data once real time availability of this information becomes technologically feasible.

downloadDownload free PDF View PDFchevron_right

The analysis of decomposition methods for support vector machines

Sign up for access to the world's latest research

Abstract

Related papers

Related papers

Cited by