Support Vector Machines for regression

description7 papers

group1 follower

lightbulbAbout this topic

Support Vector Machines for regression (SVR) is a supervised learning algorithm that extends Support Vector Machines to predict continuous outcomes. It identifies a function that approximates the relationship between input features and target values by minimizing prediction error while maintaining a margin of tolerance, thus ensuring robustness against overfitting.

lightbulbAbout this topic

Key research themes

1. How can Support Vector Machines be adapted and optimized for effective regression tasks in time series forecasting and high-dimensional data?

This area focuses on extending SVM methodologies, particularly Support Vector Regression (SVR), to handle complex regression problems such as time series prediction and high-dimensional feature spaces. It examines adaptations including different loss functions, online and incremental learning algorithms, kernel selections, and integration with optimization and dimensionality reduction techniques. These adaptations aim to improve generalization, computational efficiency, and applicability in real-world scenarios involving nonlinear and large-volume data.

On-line support vector machines for function approximation

by Mario Martin

2015

Key finding: Introduced an exact incremental algorithm for ε-insensitive SVR that allows incremental addition, removal, and updating of training points, enabling online regression learning, particularly useful for streaming data or... Read more

articleView Paper downloadDownload

A class of new Support Vector Regression models

by Reshma Khemchandani

2024, Applied Soft Computing

Key finding: Proposed a novel convex ϵ-penalty loss function generalizing and encompassing ϵ-insensitive and Laplace losses, leading to two new SVR variants with either L2 or L1 norm regularization. These models utilize penalization... Read more

articleView Paper downloadDownload

Generalized Support Vector Regression and Symmetry Functional Regression Approaches to Model the High-Dimensional Data

by N. Rahman

2024, Symmetry

Key finding: Reviewed and integrated principal component methods with SVR to handle high-dimensional data where p (features) > n (samples), overcoming matrix inversion issues in least squares regression. Demonstrated that combining... Read more

articleView Paper downloadDownload

Recurrent Neural Networks and Nonlinear Prediction in Support Vector Machines

by VIJITHA ANANTHI J RPK20EI002

2021, Journal of Soft Computing Paradigm

Key finding: Developed a hybrid prediction framework combining SVR with recurrent neural networks (RNN) optimized via metaheuristic parameter tuning, addressing limitations of traditional RNN training and improving prediction robustness... Read more

articleView Paper downloadDownload

Predicting Stock Market Price Using Support Vector Regression

by Risul Islam Rasel

2016

Key finding: Applied SVR with comprehensive kernel parameter tuning and innovative windowing data preprocessing for forecasting rainfall time series, demonstrating that careful parameter selection and input preprocessing significantly... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. What methods effectively enhance computational efficiency and scalability of Support Vector Regression models, particularly with nonlinear kernels?

Given the computational and memory costs associated with large-scale SVR models, especially those using nonlinear kernels like RBF, this theme explores algorithmic and approximation techniques to speed up prediction and training times without significant accuracy loss. It examines methods such as kernel approximation, online/incremental learning, and parallel/distributed computation, focusing on making SVR applicable to real-time and big data contexts.

Fast Prediction with SVM Models Containing RBF Kernels

by Marc Claesen

2022

Key finding: Presented a second-order Maclaurin series approximation for RBF kernel evaluation in SVR that reduces prediction complexity from dependence on the number of support vectors to quadratic in input dimensionality, enabling... Read more

articleView Paper downloadDownload

On-line support vector machines for function approximation

by Mario Martin

2015

Key finding: Introduced an incremental online learning algorithm for ε-insensitive SVR allowing dynamic modification of training points and immediate model updates without retraining from scratch, thereby increasing computational... Read more

articleView Paper downloadDownload

A Novel Hybrid PSO- and GS-based Hyperparameter Optimization Algorithm for Support Vector Regression

by Mustafa Acikkar

2023

Key finding: Developed PSOGS, a hybrid hyperparameter optimization algorithm combining Particle Swarm Optimization and Grid Search to efficiently tune SVR hyperparameters (C, ε, γ), demonstrating improved prediction accuracy and... Read more

articleView Paper downloadDownload

Parallel Support Vector Machines in Practice

by John Tran

2025

Key finding: Conducted an empirical comparison of explicit parallelization (e.g., SMO with hand-parallelized components) versus implicit parallelization approaches (expressing SVM algorithms with large dense linear algebra operations to... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. How do Support Vector Regression models perform in practical predictive applications such as stock market, rainfall prediction, student performance, and hydrological drought forecasting?

This theme investigates the application of SVR models tailored through domain-specific preprocessing, feature selection, and hybrid modeling strategies to improve prediction accuracy and reliability in real-world regression problems involving complex nonlinear and time-dependent data. It emphasizes practical insights from parameter tuning, integration with windowing or dimensionality reduction, and comparison with alternative regression methods in various domains.

Predicting Stock Market Price Using Support Vector Regression

by Risul Islam Rasel

2016

Key finding: Demonstrated that incorporating different windowing operators as data preprocessing inputs significantly enhances SVR's prediction accuracy for rainfall time series on a four-year Dhaka Stock Exchange dataset, with evaluation... Read more

articleView Paper downloadDownload

Predicting Standardized Streamflow index for hydrological drought using machine learning models

by K.W. Chau

2020, Engineering Applications of Computational Fluid Mechanics

Key finding: Applied SVR alongside Gene Expression Programming and M5 model trees to model drought indices including Standardized Streamflow Index (SSI), finding SVR competitive for predicting monthly precipitation and recommending M5... Read more

articleView Paper downloadDownload

ML - Based Diabetes Foretell Using SVM and Logistic Regression In Healthcare

by International Journal of Scientific Research in Science, Engineering and Technology IJSRSET

2023, International Journal of Scientific Research in Science, Engineering and Technology

Key finding: Built machine learning models using SVR and logistic regression to predict diabetes risk and disease state with high accuracy (90-92%), emphasizing model scalability, usability, and reliability in handling patient datasets,... Read more

articleView Paper downloadDownload

Recurrent Neural Networks and Nonlinear Prediction in Support Vector Machines

by VIJITHA ANANTHI J RPK20EI002

2021, Journal of Soft Computing Paradigm

Key finding: Developed an improved nonlinear regression prediction model using a hybrid of SVR and recurrent neural networks (RNN), applying metaheuristic optimization for parameter tuning, leading to superior performance compared to... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in Support Vector Machines for regression

Introducing anisotropic Minkowski functionals and quantitative anisotropy measures for local structure analysis in biomedical imaging

by Titas De

2013, Proceedings of SPIE - Medical Imaging

The ability of Minkowski Functionals to characterize local structure in different biological tissue types has been demonstrated in a variety of medical image processing tasks. We introduce anisotropic Minkowski Functionals (AMFs) as a... more

In this contribution, we introduce a method to extend the capability of Minkowski Functionals (MFs) [4] to capture anisotropic properties in image data. MFs have recently attracted significant attention in a wide scope of pattern recognition domains, including biomedical imaging applications [5-8]. The computation of such measures requires ROIs to be chosen as fixed local patches with sizes determined by the practical applicability within the image processing task at hand, e.g. as (hyper-) spheres or (hyper-) cubes, such as squares in 2D or cubes in 3D. To address the challenge of capturing rotation-invariant image features, we generalize the concept of MFs by introducing so-called Anisotropic Minkowski Functionals (AMFs). This is accomplished by replacing above mentioned naive ROI definitions with arbitrary kernel functions that will allow us to identify local preferential feature directions in image data. To quantify the degree of anisotropy measured in analyzed ROIs by our approach, we adapt a fractional anisotropy measure motivated by MR diffusion tensor imaging analysis. Figure 1: Gaussians kernels skewed in 0°, 45°, 90° and 135° (from left to right) used for computation of AMFs.

Figure 2: LEFT — Coronal reconstruction of femur with ROIs in head (central, light blue), head(medial, dark blue), neck (medial, dark green), neck (lateral, light green) and trochanter (red). MIDDLE — Map of FA values; dark areas correspond to isotropic regions while bright areas correspond to anisotropic regions. RIGHT — Color coded direction map for pixels with FA > 0.03; red corresponds to 0°, green to 60° and blue to 120°.

Figure 3: LEFT Column — FA histograms for head (top), neck(middle) and trochanter (bottom) regions. RIGHT Column — Direction histograms for head (top), neck (middle) and trochanter (bottom) regions. All histograms were derived from AMF Euler characteristic. Note that the FA histogram for the trochanter region exhibits a greater fraction of isotropic pixels than other regions. The direction histogram for the neck region shows a strong preference for the ~60° direction which is also see on the MDCT image in Fig. 2. edge, the average weight of the two pixels on either side, and (3) for each white pixel, the corresponding weight from the kernel. Thus, four anisotropic measures are computed for each Minkowski Functional outlined in section 1.1.

Figure 4: Comparison of prediction performance (RMSE) for mean BMD, feature vectors derived from AMFs, and feature vectors derived from isotropic MFs (extracted from the femoral head) when used in conjunction with multi-regression. For each RMSE distribution, the central mark corresponds to the median and the edges are the 25th and 75th percentile. As noted here, all AMF-derived feature vectors outperform the conventionally-used mean BMD. The best performance is achieved with the angle feature vector of AMF perimeter which significantly outperforms all feature vectors derived from isotropic MFs (p < 0.01). While Minkowski Functionals have been previously applied in several medical image processing contexts [5-8], we have proposed a method to extend the capability of such measures to capture anisotropic properties in image data. We accomplish this through the introduction of anisotropic Minkowski Functionals where the ROIs used to compute such measures are defined by arbitrary kernel functions that allow the identification of local preferential feature directions in image data. We also propose a fractional anisotropy measure adapted from MR diffusion tensor imaging to quantify the degree of anisotropy measured in ROIs using such anisotropic Minkowski Functionals. Our method can be extended in multiple ways, e.g. inclusion of 3D anisotropy measures, use of different kernel functions or other anisotropy measures.

descriptionView Paper arrow_downwardDownload

Predicting Standardized Streamflow index for hydrological drought using machine learning models

by K.W. Chau

2020, Engineering Applications of Computational Fluid Mechanics

Hydrological droughts are characterized based on their duration, severity, and magnitude. Among the most critical factors, precipitation, evapotranspiration, and runoff are essential in modeling the droughts. In this study, three indices... more

Figure 1. The geographical location of Navrood drainage basin. The SPEI is established by Vicente-Serrano, Begueria, and Lépez-Moreno (2010), and studied in various researches. This method involves climate balance in its calculation, and the role of temperature studied in the evaluation of drought. The SPEI depends on the changes in the difference between precipitation and potential evapotranspiration (P-PET). The Palmer Drought Sever- ity Index (PDSI) (Palmer, 1965) had been presented con- cerning the changes in numerous source and demand variables of the hydrological cycle. Though, PDSI did not consist of multi-scale characteristics and is not a standard index. Different methods recommended for cal- culating PET. Sheffield, Wood, and Roderick (2012) stud- ied different calculation methods and compared them with each other; it is obvious that the Penman-Monteith method achieved more accurate results, for it is more based on atmospheric evaporation demand (Allen et al., 1998). Thus, the calculation of SPEI in this study is based on the Penman-Monteith equation as it is described in FAO56 (Allen et al., 1998). PM method selected by the World Meteorological Organization (WMO) as a stan- dard method for the calculation of PET, and its accuracy proved without requiring more data. The monthly val- ues of reference crop evapotranspiration are calculated based on climatic information and using equation 1. In the next step, the difference between Precipitation (P) and evapotranspiration (PETo;) calculated for the ith SPI is an indicator that depends on the probability of precipitation for any time and used to calculate different time scales. This method was devised and developed by Mckee, Doesken, and Leist (1993) in an attempt to study various effects of scarcity of precipitation on groundwa- ter, surface water reserves and resources, soil mois and waterway flow. The implementation of the SPI i ure, ndi- cator is increased globally due to its advantages such as simplicity and small amount of data in the calcula tion, and being independent of mean precipitation. Moreover, itis used to compare a wide range of climates. The SPI cal- culation begins by fitting a probability density func Usually, a two-parameter gamma and sometimes a tion. Log Pearson type III, to the total precipitation over periods of 3-24 months.

where, xj, yj are the predicted and observed values of SSI, and n is the number of observations. The CC indicates complete correlation between observed and predicted values. Positive values signify direct correlation and neg- ative values reveal an inverse correlation. Moreover, the RMSE and MAE values represent errors and their smaller values indicate lower errors in modeling. The error values among the predicted and observed data were studied by the root mean square error (RMSE), rel- ative absolute error (RAE), mean absolute error (MAE) and correlation coefficient (CC) as follows (Choubin et al., 2020; Hauduc et al., 2015):

Table 1. Statistical characteristics of the utilized data. Figure 2. Trend of the SSI, SPI, and SPEI indices on a 48-month scale in the studied period.

Figure 3. scatter plots of predicted and observed SSI by SPEI values using the best models.

Figure 4. scatter plots of predicted and observed SSI by SPI values using the best models.

Figure 5. Assessment criteria of all considered models using (a) SPEI and (b) SPI data. In the case of using the SPEI index, GEP-4 has a bet- ter performance among GEP models as its CC, MAE, RMSE, and RAE were 0.650, 0.849, 0.961, and 1.08, respectively. Moreover, GEP-3 ranking second, had an acceptable performance with slightly higher error, whose Additionally, in the case of using SPEI index, among GEP models, GEP-1 had an acceptable performance in which CC, MAE, RMSE and RAE were 0.644, 0.711, 0.837 and 0.904, respectively. Likewise, GEP-2 had also CC, MAE, RMSE, and RAE were 0.642, 0.830, 0.967, and .056, respectively. Among tree models, M5-3 had the best performance, whose CC, MAE, RMSE and RAE were 0.659, 0.830, 0.964, and 0.780, respectively. Concerning the SVR method, SVR-1 had the best performance com- pared to other time delays, in which CC, MAE, RMSE, and RAE were 0.663, 0.879, 1.017, and 0.835, respec- ively. Also, overall results indicated that SVR-6, M5-6, and GEP-6 had the worst performance whose CC, MAE, RMSE, and RAE were 0.804, 0.924, 1.088, 0.844 and 0.520, 0.897, 1.057, 0.821 and 0.540, 0.903, 1.053, 1.159, separately. Therefore, they are not recommended for SSI prediction.

Table 2. Assessment criteria for predicting SSI using SPEI. Table 3. Assessment criteria for predicting SSI using SPI.

descriptionView Paper arrow_downwardDownload