Academia.eduAcademia.edu

Quasi-Newton Methods

description404 papers
group20 followers
lightbulbAbout this topic
Quasi-Newton methods are iterative optimization algorithms used to find local minima or maxima of functions. They approximate the Hessian matrix of second derivatives using gradient information, improving convergence speed compared to gradient descent methods, while avoiding the computational cost of calculating the full Hessian.
lightbulbAbout this topic
Quasi-Newton methods are iterative optimization algorithms used to find local minima or maxima of functions. They approximate the Hessian matrix of second derivatives using gradient information, improving convergence speed compared to gradient descent methods, while avoiding the computational cost of calculating the full Hessian.

Key research themes

1. How can variants of Newton's method achieve third-order convergence without requiring higher-order derivatives?

This theme investigates modifications to the classical Newton's method to achieve cubic (third-order) convergence rates while avoiding the computational cost of evaluating second or higher-order derivatives. The interest lies in developing iterative schemes that retain the rapid convergence advantages of higher-order methods but remain practical by only utilizing first-order derivative information. This balance is crucial for solving nonlinear equations efficiently when derivative calculations are expensive or impractical.

Key finding: Proposed a variant of Newton's method that approximates the indefinite integral involved in Newton’s iteration via the trapezoidal rule instead of a rectangle, leading to a method with at least cubic convergence without... Read more
Key finding: Analyzed a family of third-order Newton variants that avoid explicit second derivative evaluations by approximating them using first derivatives at multiple points via integral approximations such as the trapezoid, harmonic,... Read more
Key finding: Developed a two-point iterative method based solely on Newton’s method that extends convergence to cases where traditional Newton's iterations fail (oscillations, divergence, or stationary points of the derivative). The... Read more

2. What explicit convergence rates and Hessian approximation properties can greedy quasi-Newton methods guarantee, and how do they differ from classical approaches?

This theme focuses on recent advances in quasi-Newton methods that improve convergence guarantees through directional selection strategies. Classical quasi-Newton updates typically use differences of successive iterates and provide mainly asymptotic convergence results. Greedy quasi-Newton methods select update directions to maximize progress measures, allowing explicit, non-asymptotic superlinear convergence bounds for both the iterates and the convergence of Hessian approximations. Understanding these improvements informs algorithm design for fast and reliable second-order optimization.

Key finding: Introduced a class of greedy quasi-Newton methods based on Broyden family updates (including DFP, BFGS, and SR1) that select basis vectors greedily to maximize local progress. Proved novel explicit non-asymptotic superlinear... Read more

3. How can quasi-Newton strategies be integrated with preconditioning in nonlinear conjugate gradient methods to enhance convergence and stability?

This theme explores the design of matrix-free preconditioners for nonlinear conjugate gradient (NCG) methods using quasi-Newton updates. By approximating inverse Hessians or average Hessian information through secant or secant-like conditions, these quasi-Newton preconditioners improve convergence speed and robustness of NCG schemes. Incorporation of damping techniques addresses practical challenges such as Wolfe line search conditions, especially in highly nonlinear or large-scale optimization problems.

Key finding: Reviewed and developed quasi-Newton inspired, matrix-free preconditioners for nonlinear conjugate gradient methods that satisfy secant or modified secant equations, effectively approximating inverse Hessian structures in an... Read more
Key finding: Proposed a new variable-metric SR1 quasi-Newton update incorporating Barzilai-Borwein step size information to improve convergence and positive definiteness of the Hessian approximations. Illustrated that the updated inverse... Read more
Key finding: Developed two new preconditioned conjugate gradient schemes combining the Hestenes-Stiefel CG parameter with novel self-scaling symmetric rank-one and self-scaling Davidon-Fletcher-Powell (DFP) quasi-Newton updates... Read more

All papers in Quasi-Newton Methods

New approximate secant equations are shown to result from the knowledge of (problem dependent) invariant subspace information, which in turn suggests improvements in quasi-Newton methods for unconstrained minimization. It is also shown... more
New approximate secant equations are shown to result from the knowledge of (problem dependent) invariant subspace information, which in turn suggests improvements in quasi-Newton methods for unconstrained minimization. A new limitedmemory... more
The properties of multilevel optimization problems defined on a hierarchy of discretization grids can be used to define approximate secant equations, which describe the second-order behaviour of the objective function. Following earlier... more
In this paper we analyze the use of structured quasi-Newton formulae as preconditioners of iterative linear methods when the inexact-Newton approach is employed for solving nonlinear systems of equations. We prove that superlinear... more
A random search algorithm for unconstrained local nonsmooth optimization is described. The algorithm forms a partition on R n using classification and regression trees (CART) from statistical pattern recognition. The CART partition... more
The aim of this work is to test the Levemberg Marquardt and BFGS (Broyden Fletcher Goldfarb Shanno) algorithms, implemented by the matlab functions lsqnonlin and fminunc of the Optimization Toolbox, for modeling the kinetic terms... more
At each iteration of the augmented Lagrangian algorithm, a nonlinear subproblem is being solved. The number of inner iterations (of some/any method) needed to obtain a solution of the subproblem, or even a suitable approximate stationary... more
The stabilized version of the sequential quadratic programming algorithm (sSQP) had been developed in order to achieve fast convergence despite possible degeneracy of constraints of optimization problems, when the Lagrange multipliers... more
A set of four-dimensional variational data assimilation (4D-Var) experiments were conducted using both a standard method and an incremental method in an identical twin framework. The full physics adjoint model of the Florida State... more
In this paper, the performances of the quasi-Newton BFGS algorithm, the NEWUOA derivative free optimizer, the Covariance Matrix Adaptation Evolution Strategy (CMA-ES), the Differential Evolution (DE) algorithm and Particle Swarm... more
A neural network (NN)-based nonlinear predictive control (NPC) is described for control of turbine power with variation in gate position. The studied plant includes the tunnel, surge tank and penstock effect dynamics. Multilayer... more
In the Fourier-domain flattening methods presented previously (Lomask and Claerbout, 2002; Lomask, 2003; Lomask et al., 2005) the data has to be mirrored in order to eliminate Fourier artifacts. This means that the data is replicated and... more
This article presents an optimization methodology for finding the heater settings that provide spatially uniform transient heating in manufacturing processes involving radiant heating. Equations governing the transient temperature and... more
A design procedure for frequency-response masking (FRM) prototype filters of cosine-modulated filter banks (CMFBs) is proposed. In the given method, we perform minimization of the maximum attenuation level in the filter's stopband,... more
A new algorithm for the location of a transition-state structure on an energy hypersurface is proposed. The method is compared to three other quasi-Newton step calculations available in literature. Numerical results derived from several... more
PREQN is a package of Fortran 77 subroutines for automatically generating preconditioners for the conjugate gradient method. It is designed for solving a sequence of linear systems A i x = b i i = 1 : : : t , where the coe cient matrices... more
The quasi-Newton strategy presented in this paper preserves one of the most important features of the stabilized Sequential Quadratic Programming method, the local convergence without constraint qualifications assumptions. It is known... more
The stabilized version of the sequential quadratic programming algorithm (sSQP) had been developed in order to achieve fast convergence despite possible degeneracy of constraints of optimization problems, when the Lagrange multipliers... more
In this article, we consider the correction of metric matrices in quasi-Newton methods (QNM) from the perspective of machine learning theory. Based on training information for estimating the matrix of the second derivatives of a function,... more
The notion of least-change secant updates is extended to apply to nonsquare matrices in a way appropriate for quasi-Newton methods used to solve systems of nonlinear equations that depend on parameters. Extensions of the widely used... more
The notion of least-change secant updates is extended to apply to nonsquare matrices in a way appropriate for quasi-Newton methods used to solve systems of nonlinear equations that depend on parameters. Extensions of the widely used... more
Using Taylor's series we propose a modified secant relation to get a more accurate approximation of the second curvature of the objective function. Then, based on this modified secant relation we present a new BFGS method for solving... more
Using Taylor's series we propose a modified secant relation to get a more accurate approximation of the second curvature of the objective function. Then, based on this modified secant relation we present a new BFGS method for solving... more
The Han-Powell method has proved to be extremely fast and robust for small optimum power flow problems (of the order of 100 buses) However, it balks at full size problems (of the order of 1000 buses) This paperdevelops a class of... more
The Han-Powell method has proved to be extremely fast and robust for small optimum power flow problems (of the order of 100 buses) However, it balks at full size problems (of the order of 1000 buses) This paperdevelops a class of... more
This paper presents a modified quasi-Newton method for structured unconstrained optimization. The usual SQN equation employs only the gradients, but ignores the available function value information. Several researchers paid attention to... more
We apply and compare various artificial neural network (ANN) and other algorithms for the automated morphological classification of galaxies. The ANNs are presented here mathematically, as non-linear extensions of conventional statistical... more
SQOPT is a software package for minimizing a convex quadratic function subject to both equality and inequality constraints. SQOPT may also be used for linear programming and for finding a feasible point for a set of linear equalities and... more
SNOPT is a general-purpose system for solving optimization problems involving many variables and constraints. It minimizes a linear or nonlinear function subject to bounds on the variables and sparse linear or nonlinear constraints. It is... more
A unifying approach is presented for the nonlinear static analysis of cable structures and for the form-finding of tensegrity structures. The novelty lies in the possibility of static analyses of structures where the stiffness matrix is... more
We proposed a matrix-free direction with an inexact line search technique to solve system of nonlinear equations by using double direction approach. In this article, we approximated the Jacobian matrix by appropriately constructed... more
A comparison of the Gauss-Newton and quasi-Newton methods in resistivity imaging inversion Loke, MH; Dahlin, Torleif
A fast inversion technique for the interpretation of data from resistivity tomography surveys has been developed for operation on a microcomputer. This technique is based on the smoothness‐constrained least‐squares method and it produces... more
This research proposes a novel approach to bridging the gap between theoretical concepts and practical applications of inventory management functions within intelligent order management systems. The study introduces a robust framework... more
This paper presents the application of Newton-based methods in the time-domain for the computation of the periodic steady state solutions of microgrids with multiple distributed generation units, harmonic stability and power quality... more
This paper presents an introduction to the use of neural network computational algorithms for the identification of dynamic systems. Simulated linear and non-linear systems and real plant data are used to demonstrate the effectiveness of... more
KEY WORDS: topology optimization; dual methods; convex approximations; CONLIN; generalised MMA; quasi-Newton method; 1 INTRODUCTION As a result of several researches (e.g.[6, 9, 8, 12]), structural optimization problems with sizing or... more
This paper presents an improved diagonal Secant-like method using two-step approach for solving large scale systems of nonlinear equations. In this scheme, instead of using direct updating matrix in every iteration to construct the... more
A new iterative method based on the quasi-Newton approach for solving systems of nonlinear equations, especially large scale is proposed. We used the weighted combination of the Trapezoidal and Simpson quadrature rules. Our goal is to... more
We present a new matrix-free method for the computation of negative curvature directions based on the eigenstructure of minimal-memory BFGS matrices. We determine via simple formulas the eigenvalues of these matrices and we compute the... more
We present a new matrix-free method for the computation of the negative curvature direction in large scale unconstrained problems. We describe a curvilinear method which uses a combination of a quasi-Newton direction and a negative... more
In this work, we introduce a family of Least Change Secant Update Methods for solving Nonlinear Complementarity Problems based on its reformulation as a nonsmooth system using the one-parametric class of nonlinear complementarity... more
In this work new quasi-Newton methods for solving large-scale nonlinear systems of equations are presented. In these methods q (¿ 1) columns of the approximation of the inverse Jacobian matrix are updated in such a way that the q last... more
This paper introduces new contrast functions for blind separation of sources with different time-frequency signatures. Two contrast functions based on the Kullback-Leibler and Jensen-Rényi divergences in the time-frequency (T-F) plane are... more
Uncertainty quantification of production forecasts is crucially important for business planning of hydrocarbon-field developments. This is still a very challenging task, especially when subsurface uncertainties must be conditioned to... more
Iterative methods have gained a solid reputation for efficient image restoration, for both spatially invariant and spatially variant blurs. This paper shows how a "strap-on" quasi-Newton Broyden method can further accelerate the... more
Iterative methods have gained a solid reputation for efficient image restoration, for both spatially invariant and spatially variant blurs. This paper shows how a “strap-on” quasi-Newton Broyden method can further accelerate the... more
Zontul, Metin (Arel Author)It is getting more difficult to retrieve relevant information regarding the user input query due to the large amount of information in the web. Unlike the conventional information retrieval (IR) algorithms, this... more
In this paper, we present a hybrid optimization approach to solving the economic dispatch (ED) problem. The objective is to minimize the total fuel cost and keep the power flows within the security limits. The idea consists in combining... more
Download research papers for free!