Quasi-Newton Methods Research Papers

Multi-secant equations, approximate invariant subspaces and multigrid optimization

2025

New approximate secant equations are shown to result from the knowledge of (problem dependent) invariant subspace information, which in turn suggests improvements in quasi-Newton methods for unconstrained minimization. It is also shown... more

descriptionView Paper arrow_downwardDownload

Approximate invariant subspaces and quasi-newton optimization methods

by Serge Gratton

2025, Optimization Methods and Software

New approximate secant equations are shown to result from the knowledge of (problem dependent) invariant subspace information, which in turn suggests improvements in quasi-Newton methods for unconstrained minimization. A new limitedmemory... more

descriptionView Paper arrow_downwardDownload

Using approximate secant equations in limited memory methods for multilevel unconstrained optimization

by Serge Gratton

2025, Computational Optimization and Applications

The properties of multilevel optimization problems defined on a hierarchy of discretization grids can be used to define approximate secant equations, which describe the second-order behaviour of the objective function. Following earlier... more

descriptionView Paper arrow_downwardDownload

A Theory of Secant Preconditioners

by Jose Mario Martinez

2025, Mathematics of Computation

In this paper we analyze the use of structured quasi-Newton formulae as preconditioners of iterative linear methods when the inexact-Newton approach is employed for solving nonlinear systems of equations. We prove that superlinear... more

descriptionView Paper arrow_downwardDownload

CARTopt: a random search method for nonsmooth unconstrained optimization

by Chris Price

2025, Computational Optimization and Applications

A random search algorithm for unconstrained local nonsmooth optimization is described. The algorithm forms a partition on R n using classification and regression trees (CART) from statistical pattern recognition. The CART partition... more

descriptionView Paper arrow_downwardDownload

Parameter Estimation Algorithms for Kinetic Modeling from Noisy Data

by Davide Pinelli

2025, IFIP Advances in Information and Communication Technology

The aim of this work is to test the Levemberg Marquardt and BFGS (Broyden Fletcher Goldfarb Shanno) algorithms, implemented by the matlab functions lsqnonlin and fminunc of the Optimization Toolbox, for modeling the kinetic terms... more

descriptionView Paper arrow_downwardDownload

On the cost of solving augmented Lagrangian subproblems

by Damián Fernández

2025, Mathematical Programming

At each iteration of the augmented Lagrangian algorithm, a nonlinear subproblem is being solved. The number of inner iterations (of some/any method) needed to obtain a solution of the subproblem, or even a suitable approximate stationary... more

descriptionView Paper arrow_downwardDownload

Stabilized sequential quadratic programming for optimization and a stabilized Newton-type method for variational problems

by Damián Fernández

2025, Mathematical Programming

The stabilized version of the sequential quadratic programming algorithm (sSQP) had been developed in order to achieve fast convergence despite possible degeneracy of constraints of optimization problems, when the Lagrange multipliers... more

descriptionView Paper arrow_downwardDownload

Performance of 4D-Var with Different Strategies for the Use of Adjoint Physics with the FSU Global Spectral Model

by Ionel Navon

2025, Monthly Weather Review

A set of four-dimensional variational data assimilation (4D-Var) experiments were conducted using both a standard method and an incremental method in an identical twin framework. The full physics adjoint model of the Florida State... more

A set of four-dimensional variational data assimilation (4D-Var) experiments were conducted using both a standard method and an incremental method in an identical twin framework. The full physics adjoint model of the Florida State University global spectral model (FSUGSM) was used in the standard 4D-Var, while the adjoint of only a few selected physical parameterizations was used in the incremental method. The impact of physical processes on 4D-Var was examined in detail by comparing the results of these experiments. The inclusion of full physics turned out to be significantly beneficial in terms of assimilation error to the lower troposphere during the entire minimization process. The beneficial impact was found to be primarily related to boundary layer physics. The precipitation physics in the adjoint model also tended to have a beneficial impact after an intermediate number (50) of minimization iterations. Experiment results confirmed that the forecast from assimilation analyses with the full physics adjoint model displays a shorter precipitation spinup period. The beneficial impact on precipitation spinup did not result solely from the inclusion of the precipitation physics in the adjoint model, but rather from the combined impact of several physical processes. The inclusion of full physics in the adjoint model exhibited a detrimental impact on the rate of convergence at an early stage of the minimization process, but did not affect the final convergence. A truncated Newton-like incremental approach was introduced for examining the possibility of circumventing the detrimental aspects using the full physics in the adjoint model in 4D-Var but taking into account its positive aspects. This algorithm was based on the idea of the truncated Newton minimization method and the sequential cost function incremental method introduced by Courtier et al., consisting of an inner loop and an outer loop. The inner loop comprised the incremental method, while the outer loop consisted of the standard 4D-Var method using the full physics adjoint. The limited-memory quasi-Newton minimization method (L-BFGS) was used for both inner and outer loops, while information on the Hessian of the cost function was jointly updated at every iteration in both loops. In an experiment with a two-cycle truncated Newton-like incremental approach, the assimilation analyses turned out to be better than those obtained from either the standard 4D-Var or the incremental 4D-Var in all aspects examined. The CPU time required by this two-cycle approach was larger by 35% compared with that required by the incremental 4D-Var without almost any physics in the adjoint model, while the CPU time required by the standard 4D-Var with the full physics adjoint model was more than twice that required by the incremental 4D-Var. Finally, several hypotheses concerning the impact of using standard 4D-Var full physics on minimization convergence were advanced and discussed.

descriptionView Paper arrow_downwardDownload

Experimental Comparisons of Derivative Free Optimization Algorithms

by Marc Schoenauer

2025, arXiv (Cornell University)

In this paper, the performances of the quasi-Newton BFGS algorithm, the NEWUOA derivative free optimizer, the Covariance Matrix Adaptation Evolution Strategy (CMA-ES), the Differential Evolution (DE) algorithm and Particle Swarm... more

descriptionView Paper arrow_downwardDownload

Nonlinear predictive control for a NNARX hydro plant model

by Nand Kishor

2025, Neural Computing and Applications

A neural network (NN)-based nonlinear predictive control (NPC) is described for control of turbine power with variation in gate position. The studied plant includes the tunnel, surge tank and penstock effect dynamics. Multilayer... more

descriptionView Paper arrow_downwardDownload

Short Note Flattening with cosine transforms

by Jesse Lomask

2025

In the Fourier-domain flattening methods presented previously (Lomask and Claerbout, 2002; Lomask, 2003; Lomask et al., 2005) the data has to be mirrored in order to eliminate Fourier artifacts. This means that the data is replicated and... more

descriptionView Paper arrow_downwardDownload

Optimization of Transient Heater Settings to Provide Spatially Uniform Heating in Manufacturing Processes Involving Radiant Heating

by John R. Howell

2025, Numerical Heat Transfer, Part A: Applications

This article presents an optimization methodology for finding the heater settings that provide spatially uniform transient heating in manufacturing processes involving radiant heating. Equations governing the transient temperature and... more

descriptionView Paper arrow_downwardDownload

Optimized Prototype Filter Based on the FRM Approach for Cosine-Modulated Filter Banks

by Paulo Augusto Diniz

2025, Circuits, Systems, and Signal Processing

A design procedure for frequency-response masking (FRM) prototype filters of cosine-modulated filter banks (CMFBs) is proposed. In the given method, we perform minimization of the maximum attenuation level in the filter's stopband,... more

descriptionView Paper arrow_downwardDownload

A quasi-Newton algorithm for first-order saddle-point location

by Văn Hiến Nguyễn

2025, Theoretica chimica acta

A new algorithm for the location of a transition-state structure on an energy hypersurface is proposed. The method is compared to three other quasi-Newton step calculations available in literature. Numerical results derived from several... more

descriptionView Paper arrow_downwardDownload

Algorithm 809: PREQN

by Jose Luis Morales

2025, ACM Transactions on Mathematical Software

PREQN is a package of Fortran 77 subroutines for automatically generating preconditioners for the conjugate gradient method. It is designed for solving a sequence of linear systems A i x = b i i = 1 : : : t , where the coe cient matrices... more

descriptionView Paper arrow_downwardDownload

A quasi-Newton strategy for the sSQP method for variational inequality and optimization problems

by Damián Fernández

2025, Mathematical Programming

The quasi-Newton strategy presented in this paper preserves one of the most important features of the stabilized Sequential Quadratic Programming method, the local convergence without constraint qualifications assumptions. It is known... more

descriptionView Paper arrow_downwardDownload

Stabilized sequential quadratic programming for optimization and a stabilized Newton-type method for variational problems

by Damián Fernández

2025, Mathematical Programming

The stabilized version of the sequential quadratic programming algorithm (sSQP) had been developed in order to achieve fast convergence despite possible degeneracy of constraints of optimization problems, when the Lagrange multipliers... more

descriptionView Paper arrow_downwardDownload

Machine Learning in Quasi-Newton Methods

by Elena Tovbis

2025, Axioms

In this article, we consider the correction of metric matrices in quasi-Newton methods (QNM) from the perspective of machine learning theory. Based on training information for estimating the matrix of the second derivatives of a function,... more

k k Figure 1. Step of process (13) on hyperplane <z", c> = y, along the direction z*. Proof. Property 2 is justified by the direct implementation of the function in (15) which is the minimizing step along the direction zk which is presented in Figure 1. Property 1 follows from the fact that movement to the point cK+] is carried out along the normal to the hyperplane <z* ,C> = Yx, that is, along the shortest path (Figure 1). Movement to other points on the hyperplane, for example to point A, satisfy only the condition in (14). Let us denote the residual as * = ck — c*. By subtracting c* from both sides of (13) and naking transformations, we obtain the following learning algorithm in the form of residuals:

Figure 2. Qualitative behavior of the spectrum of matrix Hk eigenvalues for cases of scaling (124) fot various values of K.

Figure 3. Level curves and paths of the optimization algorithms for function f5. The path of three considered algorithms is shown in Figure 3. Here, theoretical results of the influence of the orthogonality degree of matrix learning vectors on the convergence rate of the method are confirmed. The BFGS_V method performs forced orthogonalization, which improves the result of the BFGS method. The trajectories of the methods are listed in Tables Al—A3 of Appendix A (the trajectory of the DFP method is shown partially).

Table 1. Results of minimization with normalization of matrix (124) at K = 1 and n = 1000. Table 2. Results of minimization with normalization of matrix (124) at K = 10,000 and n = 1000. For results marked with an asterisk, K = 100.

The initial normalization of the metric matrices, as follows from the results of Tables 1 and 2, significantly improves the convergence of QNMs. The situation corresponds to the case in Figure 2 for K > 1. Large eigenvalues in the unexplored part of the subspace make it easy to find new conjugate directions and efficiently train metric matrices with almost orthogonal training vectors.

Table 3. Results of minimization with normalization of matrix (124) at K = 0.000001 and n = 2.

Table A1. Trajectory of the BFGS_V method moving.

Table A2. Trajectory of the BFGS method moving. Table A3. Trajectory of the DFP method moving.

descriptionView Paper arrow_downwardDownload

Least-Change Secant Updates of Nonsquare Matrices

by samih bourji

2024, SIAM Journal on Numerical Analysis

The notion of least-change secant updates is extended to apply to nonsquare matrices in a way appropriate for quasi-Newton methods used to solve systems of nonlinear equations that depend on parameters. Extensions of the widely used... more

The notion of least-change secant updates is extended to apply to nonsquare matrices in a way appropriate for quasi-Newton methods used to solve systems of nonlinear equations that depend on parameters. Extensions of the widely used least-change secant updates for square matrices are given. A local convergence analysis for certain paradigm iterations is outlined as motivation for the use of these updates, and numerical experiments involving these iterations are discussed. Key words, least-change secant updates, quasi-Newton updates, parameter-dependent systems AMS(MOS) subject classification. 65H10 1. Introduction. Quasi-Newton methods are very widely used iterative methods for solving systems of nonlinear algebraic equations. The basic form of a quasi-Newton method for solving F(x) 0, F R n-. Rn, is (1.1) Xk+: =xk-B:F(xk), in which Bk ,. F(xk) E Rnn, the Jacobian (matrix) of F at xk. For practical success, it is usually essential to augment this basic form with procedures for modifying the step-B:F(xk) to ensure progress from bad starting points, but we need not consider such procedures here. For a general reference on all aspects of quasi-Newton methods, see Dennis and Schnabel [11]. The most effective quasi-Newton methods are those in which each successive Bk+: is determined as a least-change secant update of its predecessor Bk. As the name suggests, Bk+l is determined as a least-change secant update of Bk by making the least possible change in Bk (as measured by a suitable matrix norm) which incorporates current secant information (usually expressed in terms of successive x-and F-values) and other available information about the structure of F. There are also notable updates which, strictly speaking, are least-change inverse secant updates obtained in an analogous way by making the least possible change in B-:. When speaking generically of least-change secant updates, we intend to include these. In [10], Dennis and Schnabel precisely formalize the notion of a least-change secant update and show how the most widely used updates can be derived as least-change secant updates. In [12], Dennis and Walker show that least-change secant update methods, i.e., quasi-Newton methods which use least-change secant updates, have desirable convergence properties in general.

descriptionView Paper arrow_downwardDownload

Least-Change Secant Updates of Nonsquare Matrices

by samih bourji

2024, SIAM Journal on Numerical Analysis

The notion of least-change secant updates is extended to apply to nonsquare matrices in a way appropriate for quasi-Newton methods used to solve systems of nonlinear equations that depend on parameters. Extensions of the widely used... more

The notion of least-change secant updates is extended to apply to nonsquare matrices in a way appropriate for quasi-Newton methods used to solve systems of nonlinear equations that depend on parameters. Extensions of the widely used least-change secant updates for square matrices are given. A local convergence analysis for certain paradigm iterations is outlined as motivation for the use of these updates, and numerical experiments involving these iterations are discussed. Key words, least-change secant updates, quasi-Newton updates, parameter-dependent systems AMS(MOS) subject classification. 65H10 1. Introduction. Quasi-Newton methods are very widely used iterative methods for solving systems of nonlinear algebraic equations. The basic form of a quasi-Newton method for solving F(x) 0, F R n-. Rn, is (1.1) Xk+: =xk-B:F(xk), in which Bk ,. F(xk) E Rnn, the Jacobian (matrix) of F at xk. For practical success, it is usually essential to augment this basic form with procedures for modifying the step-B:F(xk) to ensure progress from bad starting points, but we need not consider such procedures here. For a general reference on all aspects of quasi-Newton methods, see Dennis and Schnabel [11]. The most effective quasi-Newton methods are those in which each successive Bk+: is determined as a least-change secant update of its predecessor Bk. As the name suggests, Bk+l is determined as a least-change secant update of Bk by making the least possible change in Bk (as measured by a suitable matrix norm) which incorporates current secant information (usually expressed in terms of successive x-and F-values) and other available information about the structure of F. There are also notable updates which, strictly speaking, are least-change inverse secant updates obtained in an analogous way by making the least possible change in B-:. When speaking generically of least-change secant updates, we intend to include these. In [10], Dennis and Schnabel precisely formalize the notion of a least-change secant update and show how the most widely used updates can be derived as least-change secant updates. In [12], Dennis and Walker show that least-change secant update methods, i.e., quasi-Newton methods which use least-change secant updates, have desirable convergence properties in general.

descriptionView Paper arrow_downwardDownload

The modified BFGS method with new secant relation for unconstrained optimization problems

by narges bidabadi

2024, Computational Methods for Differential Equations

Using Taylor's series we propose a modified secant relation to get a more accurate approximation of the second curvature of the objective function. Then, based on this modified secant relation we present a new BFGS method for solving... more

descriptionView Paper arrow_downwardDownload

The modified BFGS method with new secant relation for unconstrained optimization problems

by narges bidabadi

2024, Computational Methods for Differential Equations

Using Taylor's series we propose a modified secant relation to get a more accurate approximation of the second curvature of the objective function. Then, based on this modified secant relation we present a new BFGS method for solving... more

descriptionView Paper arrow_downwardDownload

Decompositions For Optimal Power Flows

by Vibhu Kalyan

2024, IEEE Transactions on Power Apparatus and Systems

The Han-Powell method has proved to be extremely fast and robust for small optimum power flow problems (of the order of 100 buses) However, it balks at full size problems (of the order of 1000 buses) This paperdevelops a class of... more

descriptionView Paper arrow_downwardDownload

Decompositions for Optimal Power Flows

by Vibhu Kalyan

2024, IEEE Power Engineering Review

The Han-Powell method has proved to be extremely fast and robust for small optimum power flow problems (of the order of 100 buses) However, it balks at full size problems (of the order of 1000 buses) This paperdevelops a class of... more

descriptionView Paper arrow_downwardDownload

A new structured quasi-Newton algorithm using partial information on Hessian

by Dr.Keyvan Amini

2024, Journal of Computational and Applied Mathematics

This paper presents a modified quasi-Newton method for structured unconstrained optimization. The usual SQN equation employs only the gradients, but ignores the available function value information. Several researchers paid attention to... more

descriptionView Paper arrow_downwardDownload

Neural computation as a tool for galaxy classification: methods and examples

by Michael Storrie-Lombardi

2024, Monthly Notices of the Royal Astronomical Society

We apply and compare various artificial neural network (ANN) and other algorithms for the automated morphological classification of galaxies. The ANNs are presented here mathematically, as non-linear extensions of conventional statistical... more

descriptionView Paper arrow_downwardDownload

User's Guide for SQOPT Version 7.5: Software for Large-Scale Linear and Quadratic Programming

by Michael Saunders

2024

SQOPT is a software package for minimizing a convex quadratic function subject to both equality and inequality constraints. SQOPT may also be used for linear programming and for finding a feasible point for a set of linear equalities and... more

descriptionView Paper arrow_downwardDownload

User’s Guide For Snopt Version 6, A Fortran Package for Large-Scale Nonlinear Programming∗

by Michael Saunders

2024

SNOPT is a general-purpose system for solving optimization problems involving many variables and constraints. It minimizes a linear or nonlinear function subject to bounds on the variables and sparse linear or nonlinear constraints. It is... more

descriptionView Paper arrow_downwardDownload

A unified approach for analysis of cable and tensegrity structures using memoryless quasi-newton minimization of total strain energy

by Nathan Branam

2024, Engineering Structures

A unifying approach is presented for the nonlinear static analysis of cable structures and for the form-finding of tensegrity structures. The novelty lies in the possibility of static analyses of structures where the stiffness matrix is... more

descriptionView Paper arrow_downwardDownload

Efficient matrix-free direction method with line search for solving large-scale system of nonlinear equations

by Ibrahim Yusuf

2024, Yugoslav Journal of Operations Research

We proposed a matrix-free direction with an inexact line search technique to solve system of nonlinear equations by using double direction approach. In this article, we approximated the Jacobian matrix by appropriately constructed... more

descriptionView Paper arrow_downwardDownload

A comparison of the Gauss–Newton and quasi-Newton methods in resistivity imaging inversion

by M.H. Loke

2024, Journal of Applied Geophysics

A comparison of the Gauss-Newton and quasi-Newton methods in resistivity imaging inversion Loke, MH; Dahlin, Torleif

descriptionView Paper arrow_downwardDownload

Resolution of 2D Wenner resistivity imaging as assessed by numerical modelling

by M.H. Loke

2024, Journal of Applied Geophysics

descriptionView Paper arrow_downwardDownload

Rapid least‐squares inversion of apparent resistivity pseudosections by a quasi‐Newton method1

by M.H. Loke

2024, Geophysical Prospecting

A fast inversion technique for the interpretation of data from resistivity tomography surveys has been developed for operation on a microcomputer. This technique is based on the smoothness‐constrained least‐squares method and it produces... more

Figure 2. Arrangement of rectangular blocks used in the 2D model.

Figure 3. Part of finite-difference mesh with 139 by 18 nodes together with the node numbers for the electrodes.

Figure 4. (a) Wenner array pseudosection due to a wide rectangular block. Inverse models obtained with (b) the Gauss—Newton method and (c) the quasi-Newton method. Note that a different contour interval is used for the apparent resistivity pseudosection. The depths to the centre of each row of the model blocks are shown in (b) and (c). The outline of the block is also shown for comparison.

Rectangular block model © 1996 European Association of Geoscientists & Engineers, Geophysical Prospecting, 44, 131-15:

Figure 6. (a) Wenner array pseudosection due to a wide rectangular block with 5% random noise. Models obtained with (b) the Gauss-Newton method and (c) the quasi-Newton method. The outline of the block is also shown for comparison.

Figure 7. (a) Stud Farm survey apparent resistivity pseudosection. Models obtained with (b) the Gauss—Newton and (c) the quasi-Newton methods. Locations of boreholes and observed depths to weathered microdiorite are indicated.

Figure 8. Error curves for the inversion of the Stud Farm survey data using the Gauss— Newton and quasi-Newton methods.

© 1996 European Association of Geoscientists & Engineers, Geophysical Prospecting, 44, 131-15:

Figure 10. Borehole resistivity log from a survey at Blue Farm. The model resistivity obtained by the quasi-Newton method along the borehole is also shown for comparison. c). The microdiorite is known to be deeply weathered in places, so a gradational increase in the resistivity from the overburden to the fresh bedrock is expected. The two regions with higher resistivity material near the surface at the 450 and 650m marks are probably due to boulders and partially weathered material. The depths to the weathered bedrock at four boreholes near the survey line are also shown. There is good agreement between the microdiorite topography in the models and the borehole depths at locations BH1 and BH2. The model bedrock interface is significantly deeper than the borehole depths at BH3 and BH4. One possible reason is that the boreholes record the top of the very weathered microdiorite and not the higher resistivity bedrock shown in the sections. Another possible reason is that the boreholes were sunk into core boulders (Turnbull 1986). Furthermore, it is known from other boreholes in this area that the bedrock topography is not 2D which could also account for differences between the models and the boreholes.

Figure 11. (a) Dipole-dipole apparent resistivity pseudosection for the Magusi River orebody survey. (b) Model section obtained at the 6th iteration by the quasi-Newton method. Note that the apparent resistivity pseudosection uses a different contour interval. The boundaries of the orebody and overburden (Edwards 1977) are also shown. ‘The quasi-Newton method requires only about one-ninth of the computing time needed by the Gauss~Newton method to converge (Table 1). Perhaps more importantly, the finite-difference subroutine in the quasi-Newton program requires only about 240 kilobytes of memory compared to 2700 kilobytes for the Gauss— Newton program. It is possible to reduce the memory space required by saving some of the arrays temporarily in a disc file but this would substantially increase the computing time. This pseudosection (with 35 electrodes) is about the largest that can be efficiently processed by the Gauss—Newton program in a computer with only 4 megabytes of memory. In comparison, we have successfully processed pseudosections with 64 electrodes using the quasi-Newton program.

Table 1. A comparison of the number of iterations and the time taken by the Gauss— Newton (GN) and quasi-Newton (QN) methods to converge for different data sets. The exact rms error of the final model and the memory required by the finite-difference subroutine are also shown. inversion of this data set. Rather small damping factors can be used since this data set is noise free. The rms error convergence limit was set at 1%. The models obtained by the Gauss—Newton and quasi-Newton methods are shown in Figs 4b and c, respectively. There are no significant differences between the two models. The shapes of both models agree reasonably well with the actual shape of the block. (he highest model resistivity value near the centre of the rectangular block is about 250m, which is less than the true value of 500Qm. This is partly a result of equivalence (Keller and Frischknecht 1966), where a thicker body with a lower resistivity contrast can give rise to the same anomaly as a thinner block with a higher resistivity contrast. TMwW.) lw et ty re, ro Vet 7

descriptionView Paper arrow_downwardDownload

Framework for Optimized Sales and Inventory Control: A Comprehensive Approach for Intelligent Order Management Application

by Sumit Mittal

2024, International Journal of Computer Trends and Technology

This research proposes a novel approach to bridging the gap between theoretical concepts and practical applications of inventory management functions within intelligent order management systems. The study introduces a robust framework... more

descriptionView Paper arrow_downwardDownload

Harmonic Issues Assessment on PWM VSC-Based Controlled Microgrids Using Newton Methods

by Nancy Visairo

2024, IEEE Transactions on Smart Grid

This paper presents the application of Newton-based methods in the time-domain for the computation of the periodic steady state solutions of microgrids with multiple distributed generation units, harmonic stability and power quality... more

descriptionView Paper arrow_downwardDownload

Feedforward Neural Networks for the Identification of Dynamic Processes

by Jules Thibault

2024, Chemical Engineering Communications

This paper presents an introduction to the use of neural network computational algorithms for the identification of dynamic systems. Simulated linear and non-linear systems and real plant data are used to demonstrate the effectiveness of... more

where U is the scaled input vector and H is the output vector of the neurons contained in the hidden layer. The last elements of these two vectors, U, and H,, are the bias and they are set equal to 1. The bias provides a way of adding a constant term to the weighted sum. Output of a neuron of the final laver:

This system has a static gain of 4, a zero at 2 and complex poles at 0.80 + 0.458 j. Figure 2 presents the first 300 values of the 500 data points sequence used in this investigation to teach the neural network, the dynamics of the system. This simulated system was also used with various levels of superimposed measurement noise. The upper graph plotted in Figure 2 shows the zero mean Gaussian noise, with a standard deviation of unity (o = 1), directly added to the output signal.

FIGURE 3 Discrete-time sequences of the non-linear system. The gas furnace example of Box and Jenkins (1976) is also treated since it has become a classical example used by many authors (Young, 1984; Graham and

FIGURE 4 Influence of parameter a, on the gain and poles of the non-linear system.

FIGURE 5 _Discrete-time sequences of the gas furnace data.

FIGURE 6 Modelling errors: (A) Noise free linear system, (B) Linear system with noise (o = 1) and (C) Non-linear system.

FIGURE 7 Comparison of backpropagation and quasi-Newton Jearning algorithms.

calculated system output of the linear system at each sampling time. Seven different noise levels were used, corresponding to noise having a standard deviation in the range of 0 to 1.4. The models, identified with measurement noise, were validated with the noise free discrete-time sequence. The sum of squares of the residuals evaluated on a sequence of 500 data points are presented in Figure 9. The middle graph of Figure 6 presents the prediction errors obtained with the linear system to which was added measurement noise having a standard deviation of unity. The corresponding sum of squares of the measurement noise superimposed to the output data sequence is also plotted in Figure 9. As the level of noise increases, the identification of the correct transfer function is more difficult as evidenced by the two converging curves of Figure 9. System identification with neural networks is more sensitive to noise than standard linear identification techniques. Indeed, the simple least square method was able to identify correctly the original linear transfer function even for high levels of noise and short data sequences. These results clearly show that neural networks can identify a good model in the presence of measurement noise provided it is not excessive. The results for the identification of the linear system with measurement noise

FIGURE 9 Influence of the level of noise on the identification of the linear system. structures (2, 1, 1 and 2, 2, 1) give nearly equivalent sums of squares of the residuals of approximately 1200 which is more than twice the sum of squares of the original noise. When the same model was used with the noise free discrete-time sequence, the sum of squares of the residuals was reduced to approximately 200 (Figures 6 and 9). To be valuable in practice, a good noise filter of the output signal would be required.

Sum of squares of residuals as a function of the structure of the model

Influence of the size of the learning data set

descriptionView Paper arrow_downwardDownload

Convex Approximation Methods for Large Scale Structural Problems

by Pierre Duysinx

2024

KEY WORDS: topology optimization; dual methods; convex approximations; CONLIN; generalised MMA; quasi-Newton method; 1 INTRODUCTION As a result of several researches (e.g.[6, 9, 8, 12]), structural optimization problems with sizing or... more

descriptionView Paper arrow_downwardDownload

An Implicit Multi-Step Diagonal Secant-Type Method for Solving Large-Scale Systems of Nonlinear Equations

by Zanariah Abdul Majid

2024

This paper presents an improved diagonal Secant-like method using two-step approach for solving large scale systems of nonlinear equations. In this scheme, instead of using direct updating matrix in every iteration to construct the... more

descriptionView Paper arrow_downwardDownload

Quadrature based Broyden-like method for systems of nonlinear equations

by Saidat Yusuff

2024, STATISTICS, OPTIMIZATION AND INFORMATION COMPUTING

A new iterative method based on the quasi-Newton approach for solving systems of nonlinear equations, especially large scale is proposed. We used the weighted combination of the Trapezoidal and Simpson quadrature rules. Our goal is to... more

Therefore, x, converges q-superlinearly to x*. Conversely, suppose that x, converges q-superlinearly to «* and F(a*) = 0. Then, by Lemma 2.1, there exists p > 0 such that we have We used six test functions with eight instances of dimension n = 5 to n = 1065, which makes a total of 48 problems, in order to check the effectiveness of the proposed methods. The xo stands for the initial approximatior to the solution in all the tested problems.

Table 2. Summary of Robustness, Efficiency and Combined Robustness and Efficiency Measures superior in terms of robustness over other methods. The number of iterations obtained is promising and competitive, the CPU time is also encouraging. The results of the proposed method are better than one of CB and TB. However, MSB is more efficient than the proposed method. In conclusion, the numerical tests have shown that our new method is comparable with the well known existing Broyden-like methods.

descriptionView Paper arrow_downwardDownload

A curvilinear method based on minimal-memory BFGS updates

by Marianna S. Apostolopoulou

2024, Applied Mathematics and Computation

We present a new matrix-free method for the computation of negative curvature directions based on the eigenstructure of minimal-memory BFGS matrices. We determine via simple formulas the eigenvalues of these matrices and we compute the... more

descriptionView Paper arrow_downwardDownload

A Curvilinear Method for Large Scale Optimization Problems

by Marianna S. Apostolopoulou

2024

We present a new matrix-free method for the computation of the negative curvature direction in large scale unconstrained problems. We describe a curvilinear method which uses a combination of a quasi-Newton direction and a negative... more

descriptionView Paper arrow_downwardDownload

Least Change Secant Update Methods for Nonlinear Complementarity Problem

by Rosana Perez

2024, Ingeniería y Ciencia

In this work, we introduce a family of Least Change Secant Update Methods for solving Nonlinear Complementarity Problems based on its reformulation as a nonsmooth system using the one-parametric class of nonlinear complementarity... more

descriptionView Paper arrow_downwardDownload

Inverse q-Columns Updating Methods for solving nonlinear systems of equations

by Rosana Perez

2024, Journal of Computational and Applied Mathematics

In this work new quasi-Newton methods for solving large-scale nonlinear systems of equations are presented. In these methods q (¿ 1) columns of the approximation of the inverse Jacobian matrix are updated in such a way that the q last... more

descriptionView Paper arrow_downwardDownload

Contrast functions for blind source separation based on time-frequency information-theory

by Adel BELOUCHRANI

2024, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

This paper introduces new contrast functions for blind separation of sources with different time-frequency signatures. Two contrast functions based on the Kullback-Leibler and Jensen-Rényi divergences in the time-frequency (T-F) plane are... more

descriptionView Paper arrow_downwardDownload

Robust Uncertainty Quantification Through Integration of Distributed-Gauss-Newton Optimization With a Gaussian Mixture Model and Parallelized Sampling Algorithms

by Mariela Araujo

2024, SPE reservoir evaluation & engineering

Uncertainty quantification of production forecasts is crucially important for business planning of hydrocarbon-field developments. This is still a very challenging task, especially when subsurface uncertainties must be conditioned to... more

Uncertainty quantification of production forecasts is crucially important for business planning of hydrocarbon-field developments. This is still a very challenging task, especially when subsurface uncertainties must be conditioned to production data. Many different approaches have been proposed, each with their strengths and weaknesses. In this work, we develop a robust uncertainty-quantification work flow by seamless integration of a distributed-Gauss-Newton (GN) (DGN) optimization method with a Gaussian mixture model (GMM) and parallelized sampling algorithms. Results are compared with those obtained from other approaches. Multiple local maximum-a-posteriori (MAP) estimates are determined with the local-search DGN optimization method. A GMM is constructed to approximate the posterior probability-density function (PDF) by reusing simulation results generated during the DGN minimization process. The traditional acceptance/rejection (AR) algorithm is parallelized and applied to improve the quality of GMM samples by rejecting unqualified samples. AR-GMM samples are independent, identically distributed samples that can be directly used for uncertainty quantification of model parameters and production forecasts. The proposed method is first validated with 1D nonlinear synthetic problems with multiple MAP points. The AR-GMM samples are better than the original GMM samples. The method is then tested with a synthetic history-matching problem using the SPE01 reservoir model (Odeh 1981; Islam and Sepehrnoori 2013) with eight uncertain parameters. The proposed method generates conditional samples that are better than or equivalent to those generated by other methods, such as Markov-chain Monte Carlo (MCMC) and global-search DGN combined with the randomized-maximum-likelihood (RML) approach, but have a much lower computational cost (by a factor of five to 100). Finally, it is applied to a real-field reservoir model with synthetic data, with 235 uncertain parameters. A GMM with 27 Gaussian components is constructed to approximate the actual posterior PDF. There are 105 AR-GMM samples accepted from the 1,000 original GMM samples, and they are used to quantify the uncertainty of production forecasts. The proposed method is further validated by the fact that production forecasts for all AR-GMM samples are quite consistent with the production data observed after the history-matching period. The newly proposed approach for history matching and uncertainty quantification is quite efficient and robust. The DGN optimization method can efficiently identify multiple local MAP points in parallel. The GMM yields proposal candidates with sufficiently high acceptance ratios for the AR algorithm. Parallelization makes the AR algorithm much more efficient, which further enhances the efficiency of the integrated work flow.

descriptionView Paper arrow_downwardDownload

Limited memory switched Broyden method for faster image deblurring

by ichraf lahouli

2024

Iterative methods have gained a solid reputation for efficient image restoration, for both spatially invariant and spatially variant blurs. This paper shows how a "strap-on" quasi-Newton Broyden method can further accelerate the... more

descriptionView Paper arrow_downwardDownload

Limited memory switched Broyden method for faster image deblurring

by ichraf lahouli

2024, 2017 Fifteenth IAPR International Conference on Machine Vision Applications (MVA)

Iterative methods have gained a solid reputation for efficient image restoration, for both spatially invariant and spatially variant blurs. This paper shows how a “strap-on” quasi-Newton Broyden method can further accelerate the... more

descriptionView Paper arrow_downwardDownload

Information retrieval using the reduced row echelon form of a term-document matrix

by Duygu Celik Ertugrul

2024

Zontul, Metin (Arel Author)It is getting more difficult to retrieve relevant information regarding the user input query due to the large amount of information in the web. Unlike the conventional information retrieval (IR) algorithms, this... more

descriptionView Paper arrow_downwardDownload

Hybrid Approach to the Economic Dispatch Problem Using a Genetic and a Quasi-Newton Algorithms

by Bakhta Naama

2024, Acta Electrotechnica et …

In this paper, we present a hybrid optimization approach to solving the economic dispatch (ED) problem. The objective is to minimize the total fuel cost and keep the power flows within the security limits. The idea consists in combining... more

descriptionView Paper arrow_downwardDownload

Quasi-Newton Methods

Key research themes

1. How can variants of Newton's method achieve third-order convergence without requiring higher-order derivatives?

2. What explicit convergence rates and Hessian approximation properties can greedy quasi-Newton methods guarantee, and how do they differ from classical approaches?

3. How can quasi-Newton strategies be integrated with preconditioning in nonlinear conjugate gradient methods to enhance convergence and stability?

All papers in Quasi-Newton Methods

Quasi-Newton Methods

Key research themes

1. How can variants of Newton's method achieve third-order convergence without requiring higher-order derivatives?

2. What explicit convergence rates and Hessian approximation properties can greedy quasi-Newton methods guarantee, and how do they differ from classical approaches?

3. How can quasi-Newton strategies be integrated with preconditioning in nonlinear conjugate gradient methods to enhance convergence and stability?

Related Topics

All papers in Quasi-Newton Methods