Long short term memory cells

description11 papers

group0 followers

lightbulbAbout this topic

Long Short-Term Memory (LSTM) cells are a type of recurrent neural network architecture designed to model temporal sequences and dependencies. They utilize memory gates to regulate the flow of information, enabling the network to retain or forget information over long periods, thus addressing the vanishing gradient problem in traditional RNNs.

lightbulbAbout this topic

Key research themes

1. How can LSTM architectures be optimized and extended to improve long-term dependency learning and memory retention in sequential data?

This research area focuses on architectural innovations and training methodologies that enhance the ability of LSTM networks to capture, retain, and utilize long-term dependencies. Improving memory retention and mitigating vanishing gradients are critical challenges in recurrent neural networks, and various structural modifications aim to address these through gating mechanisms, dimensionality expansion, and training algorithms.

Understanding and Controlling Memory in Recurrent Neural Networks

by Alexander Rivkind

2021

Key finding: Demonstrates that the stability and speed of slow points in RNN hidden state dynamics directly predict memory retention and extrapolation capabilities. The paper shows how different training protocols yield networks with... Read more

articleView Paper downloadDownload

Depth-Gated LSTM

by Ekaterina Vylomova

2022, ArXiv

Key finding: Introduces a depth gate connecting adjacent LSTM layers to create gated, linear dependence between memory cells across layers. This architectural modification facilitates better gradient flow in deep recurrent networks,... Read more

articleView Paper downloadDownload

Grid Long Short-Term Memory

by Jayakumar Munuswamy

2015

Key finding: Proposes Grid LSTM networks with LSTM cells arranged in multidimensional grids, enabling recurrent connections not only across temporal sequences but also across network depth and spatial dimensions. This structure enhances... Read more

articleView Paper downloadDownload

Improving the Gating Mechanism of Recurrent Neural Networks

by Çağlar Gülçehre

2023, ArXiv

Key finding: Identifies limitations in standard gating mechanisms that require gates operating near saturation for long-term memory retention, which impedes gradient-based learning. Proposes uniform gate initialization and a refined... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. What novel LSTM cell architectures best leverage multiple sequence dependencies for improved recognition tasks in multimodal and multi-view data?

This area investigates new LSTM cell designs that can jointly process multiple dependent input sequences, enabling richer representation learning for complex, correlated data such as multi-view images or multimodal inputs. These architectures go beyond conventional sequential LSTM cells by fusing information at gate or cell state levels, which enhances performance on recognition and classification tasks.

Novel Long Short-Term Memory Cell Architectures: Application to Light Field Face Recognition

by Paulo Correia

2025, arXiv (Cornell University)

Key finding: Develops two novel LSTM cell architectures, Gate-Level Fusion (GLF-LSTM) and State-Level Fusion (SLF-LSTM), that jointly learn from simultaneously acquired dependent input sequences (e.g., horizontal and vertical parallax... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. How can recurrent neural network architectures incorporating biological principles and novel training methods advance sequence modeling and neuronal activity estimation?

This theme encompasses models that draw inspiration from biological neural systems or integrate neuroscientific insights, including biologically plausible learning rules, neural dynamics interpretation, and application to neuronal activity data. It also covers new training approaches aiming to overcome limitations of backpropagation and extend RNN models to better capture biological temporal processes.

Towards New Generation, Biologically Plausible Deep Neural Network Learning

by Ognjen Arandjelovic

2023, Sci

Key finding: Proposes a biologically plausible learning framework that leverages local Hebbian synaptic plasticity combined with supervised and unsupervised signals, countering limitations of orthodox backpropagation such as non-locality... Read more

articleView Paper downloadDownload

The Synaptic Properties of Cells Define the Hallmarks of Interval Timing in a Recurrent Neural Network

by Hugo Merchant

2024, The Journal of Neuroscience

Key finding: Implements a recurrent network mimicking cortical ensembles with paired-pulse facilitation and slow inhibitory synaptic currents, producing interval-selective responses exhibiting behavioral timing hallmarks: bias... Read more

articleView Paper downloadDownload

Unsupervised learning of an efficient short-term memory network

by Wieland Brendel and

2015, Advances in neural information processing systems

Key finding: Derives local, biologically plausible learning rules for recurrent networks to efficiently represent current and past inputs by enforcing balance between feedforward and recurrent inputs. The approach leads to efficient... Read more

articleView Paper downloadDownload

Understanding and Controlling Memory in Recurrent Neural Networks

by Alexander Rivkind

2021

Key finding: Analyzes RNN hidden state dynamics as discrete-time dynamical systems, linking slow points to memory representations. Finds that training protocols impact memory stability and extrapolation ability, and modifies loss... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in Long short term memory cells

Comparison of N-BEATS with Standalone and Hybrid Deep Learning Models in Monthly Inflow Forecasting to the Aras Dam Reservoir: A Feature Selection Analysis

by Tarim Bilimleri

2025, Tarım Bilimleri Dergisi

Reservoir dams play a pivotal role in water resource management. Accurate prediction of inflow to reservoirs significantly enhances operational performance. While standalone artificial intelligence methods have recently been frequently used to predict inflow, hybrid models have shown quite more satisfactory success. In this study, various deep learning models, including MLP, GRU, LSTM, CNN, CNN-MLP, CNN-GRU, CNN-LSTM, CNN-GRU-MLP, and CNN-LSTM-MLP, were utilized to predict the monthly inflow to the Aras reservoir in the Azerbaijan-Iran region. The results were compared with the Neural Basis Expansion Analysis for Time Series Forecasting (N-BEATS) model for univariate forecasting and the NBEATSx model for multivariate forecasting using a monthly inflow time series dataset. To enhance prediction accuracy, the hyperparameters of the models were optimized. Additionally, to evaluate the impact of feature selection on model performance, five different scenarios were developed as combinations of input variables for forecasting one future time step. The evaluation metrics revealed that among the scenarios, Scenario 5 (comprising lagged inflows at months 1, 11, and 12; lagged average monthly precipitation in the upstream basin at months 1 and 12; the solar month counter; and a three-month moving average of monthly inflow) yielded the best results. Among the models, the hybrid CNN-LSTM-MLP demonstrated the highest prediction accuracy. Specifically, the performance metrics for this model and the best scenario included MAE, RMSE, PBIAS, R², KGE, and NSE, which were 8.78 m³/s, 12.95 m³/s, 1.5%, 0.89, 0.91, and 0.89, respectively. Conversely, the NBEATSx model exhibited suboptimal performance, with reduced accuracy as the number of input features increased, although the N-BEATS model performed well in univariate forecasting. This study highlights the high potential of hybrid deep learning models in accurately forecasting reservoir inflows and underscores their utility in enhancing water resource and reservoir operation management.

descriptionView Paper arrow_downwardDownload

Water level prediction using long short-term memory neural network model for a lowland river: a case study on the Tisza River, Central Europe

by István .Fehérváry

2024, Environmental Sciences Europe

Background Precisely predicting the water levels of rivers is critical for planning and supporting flood hazard and risk assessments and maintaining navigation, irrigation, and water withdrawal for urban areas and industry. In Hungary,... more

descriptionView Paper arrow_downwardDownload

An Effective Model-Based Trust Collaborative Filtering for Explainable Recommendations

by Hafed Zarzour

2023

Nowadays, many companies through the world wide web like YouTube, Netflix, Aliexpress and Amazon, provide personalized services as recommendations. Recommender systems use the related information about products or services to suggest the... more

descriptionView Paper arrow_downwardDownload

Comparative Analysis of Recurrent Neural Network Architectures for Reservoir Inflow Forecasting

by halit apaydin

2022, Water

Due to the stochastic nature and complexity of flow, as well as the existence of hydrological uncertainties, predicting streamflow in dam reservoirs, especially in semi-arid and arid areas, is essential for the optimal and timely use of... more

For modeling, ANN methods and four different RNN deep-network methods were employed. In total, 70% and 30% of the data were used in training and testing stages, respectively, as indicated in Figure 2. Besides this, several delayed daily mean streamflow are used as input data. For this purpose, the correlation between the time series of streamflows with its delays is obtained, and seven delays (SF t-1, SFt.2, SF 1-3, SFt-4, SFt-5, SF}-6, and SF;-7) are selected as inputs due to high correlation (Figure 3). rhe correlation coefficients are found to vary between 0.63 and 0.85 in seven-day lag conditions. Similar studies [7,8] have also used seven delays in flow time series as inputs.

Figure 3. Correlation plot between streamflow and its delays.

Figure 4. General structure of a neural network [29]. An ANN isa distributed knowledge treatment system in which performance essentials are alike to the human brain, and is based on a simulated biological neural network [28]. Each neural network has three layers, namely, input, hidden, and output. The input layer is a layer for providing data provided as inputs to the network. The output layer contains values predicted by the network. The hidden layer is the data analysis location. Usually, the number of selected neurons of the layers is obtained by trial and error. The general architecture of the ANN is displayed in Figure 4, where X (x1, X2, .., Xn) = inputs vector, W = connecting weights to the next layer, bj; = bias, and y; is the ANN final output. The activation function converts input signals into output signals. In Figure 4, N inputs are given from x, to xn to the counterpart weights Wy, to Wey. Initially, the weights are multiplied by their inputs, and then they are summed with the amount of bias to obtain u (Fauation (1)):

Figure 5. The flow chart of the modelling processes. *SF: Streamflow, t: delayed day, ANN: Artificial neural network, RNN: Recurrent neural network, Bi-LSTM: Bidirectional long short-term memory, GRU: Gated recurrent unit, LSTM: Long short-term memory, simple RNN: simple recurrent neural networks. Figure 5. The flow chart of the modelling processes. *SF: Streamflow, t: delayed day, ANN: Artificial

Figure 6. The general structure of recurrent neural network models. (a) Simple RNN, (b) LSTM, (c) GRU, and (d) Bi-LSTM models [19].

Figure 7. Structure of networks (a) before and (b) after applying dropout [38].

The learning rate is one of the hyperparameters necessary to find the optimal value and usually takes values between 1 and 1 x 107’. In fact, the learning rate expresses the size of the move steps by the network. Figure 8 compares the changes in the loss function versus the epoch based on the learning rate. Working with a low learning rate, it takes a long time to find the best solution. Besides, if the earning rate is too high, it rejects the optimal mode. Since a high learning rate is advantageous in early iterations, and a low one is advantageous in later iterations, a learning rate that slows down as the algorithm progresses is preferred [39]. Figure 8. Changes in the loss function vs. the epoch by the learning rate [40].

where X; is the observation parameter with a mean denoted by X; Y; is the prediction parameter with a mean denoted by Y; N is a number of instances. The more the two first criteria are closer to 1 and the next three values are closer to 0 show the better performance of the model. According to Chiew et al. [41], if NS > 0.90, the simulation is very acceptable; if 0.60 < NS < 0.90, the simulation is acceptable; and if NS < 0.60 as in this case, simulation is unacceptable.

The loss function, observational streamflow values and predicted values for training and testing ges computed by various methods are displayed in Figure 9. When the loss function and time-series iphs are examined, the most successful methods are LSTM and GRU. The loss charts of testing and ining stages of both methods overlap at the highest epoch value. According to the loss function graphs sure 9), with the reduction of the modeling error for the training stage, the error of the testing stage reases, and the distance between the two graph lines reduces. Therefore, it can be said that the dropou ction helps to prevent network overfitting. Besides, considering the changes in the loss function value sus epochs (and according to Figure 8) can ensure that the learning rate is selected correctly.

Figure 9. Loss function plots (a—e), observed and computed time series (f-j) for different methods in training and testing stages.

Table 1. Statistical characteristics of daily streamflow and rainfall.

Table 2. Results of different models *. Different models are run with different iterations. As seen in Table 2, the number of epochs as one of the influential parameters plays a basic role; so in fewer iterations, the accuracy of the model is low, and the error is greater. By increasing the number of iterations, the model gradually converges so that there is not a significant difference between the results of 300 and 500 epochs. For this reason, it is decided that a maximum of 500 epochs is sufficient. Another parameter that influences the accuracy of the models is the number of neurons in the hidden layers. If it is low, the model will not be able to simulate correctly, and if it is too high, there is a risk of overfitting. This problem is solved by using the dropout method, which in fact deactivates several neurons. The values of the learning rate (LR) and decay are also presented in Table 2. Decay occurs due to how much the learning rate is reduced in each step. As seen in Table 2, five different evaluation criteria are used to compare five different prediction methods. Each method has the best result with 300 or 500 epochs. The performance of the testing stage is generally 1-5% lower than that of the training stage. According to Table 2, among the methods used, the LSTM method performs best in 500 epochs with an accuracy of CC = 87% in the testing stage. This network has long-term memory, and its forget gate specifies how much previous memory is kept. The first step is the multiplication of input data in weights and then its summation with the bias, followed by output. In this step, the output is likely to be very different from the actual output. Therefore, errors are returned backward to update the weights and also biases. Comparison of some metrics by all methods used in the study are given in Table 3. In this table,

Table 3. Comparison of estimated values with observed values for all methods. Figure 10 shows the scatter of observed values versus the predicted values for both training and testing stages. Regarding the streamflow graphs (Figures 9 and 10), the networks yield poor results in peak periods and cannot simulate these periods well, except LSTM. One major reason of this is that streamflow abundance has very few high values, and the network cannot properly learn them. This happens even though the streamflow abundance has several low values, so the model can correctly and accurately be learned in the training stage. According to the data, the difference between the minimum and the maximum flow values is high, and the validity of these values is investigated during the study period based on precipitation in the region. In periods where peak streamflow is observed, the maximum rainfall is observed with a significant amount that indicates the correctness of peak flow values. However, among the methods applied, the LSTM network is better than other methods, and is able to simulate peak flow periods fairly. As noted above, the greatest feature of this network is its skill to learn long-term dependencies, and the forget gate makes the network keep or forget the desired amount of previous memory and thus helps to improve the modeling. The GRU method is similar to the LSTM method, and its results are close to those of LSTM. In contrast, the other methods failed to simulate peak flow values. The scatter plots in Figure 10 indicate a relatively small dispersion of observed and predicted data in training and testing stages compared to other methods. Kratzert et al. [19] used deep LSTM networks in similar research and demonstrated that these networks perform better than other networks and can be used in hydrological simulations as an application method.

descriptionView Paper arrow_downwardDownload

An Effective Model-Based Trust Collaborative Filtering for Explainable Recommendations

by Hafed Zarzour

2022, 2020 11th International Conference on Information and Communication Systems (ICICS)

descriptionView Paper arrow_downwardDownload

A long short-term memory deep learning framework for explainable recommendation

by Hafed Zarzour

2022, 2020 11th International Conference on Information and Communication Systems (ICICS)

Due to the growing quantity of information available on the Web, recommender systems have become crucial component for the success of online shopping stores. However, most of the existing recommender systems were only designed to improve... more

descriptionView Paper arrow_downwardDownload

Stock Market Indices Prediction with Various Neural Network Models

by Arun Babulo

2022

Stock market Indices prediction is one of the most important issues in the financial field. Although many prediction models have been developed during the last decade, they suffer a poor performance because indices movement is highly non... more

descriptionView Paper arrow_downwardDownload

Definition of artificial neural networks with comparison to other networks

by Gulgun Kayakutlu

2022, Procedia Computer Science

Definition of Artificial Neural Networks (ANNs) is made by computer scientists, artificial intelligence experts and mathematicians in various dimensions. Many of the definitions explain ANN by referring to graphics instead of giving well... more

descriptionView Paper arrow_downwardDownload

Single and Multilayer LSTM Models for Positive COVID-19 Cases Prediction

by Asmae EL KASSIRI

2022, Proceedings of the 2nd International Conference on Advanced Technologies for Humanity

COVID-19 is a global pandemic that has been reported first in Wuhan, China in December 2019. According to the World Health Organization (WHO), around 1 out of every 5 people who get COVID-19 get seriously ill and develop difficulty... more

descriptionView Paper arrow_downwardDownload

Deep Learning for Subtyping and Prediction of Diseases: Long-Short Term Memory

by Hayrettin Okut

2022, Deep Learning Applications

The long short-term memory neural network (LSTM) is a type of recurrent neural network (RNN). During the training of RNN architecture, sequential information is used and travels through the neural network from input vector to the output... more

descriptionView Paper arrow_downwardDownload

Investigating Explainability Methods in Recurrent Neural Network Architectures for Financial Time Series Data

by Warren Freeborough

2022, Applied Sciences

Statistical methods were traditionally primarily used for time series forecasting. However, new hybrid methods demonstrate competitive accuracy, leading to increased machine-learning-based methodologies in the financial sector. However,... more

descriptionView Paper arrow_downwardDownload

Joint Spatial and Temporal Modeling for Hydrological Prediction

by Yufeng YU

2022, IEEE Access

The accurate and timely estimation of river discharge plays an important role in hydrological modeling, especially for avoiding the consequences of flood events. The majority of existing work on hydrologic prediction focuses on modeling... more

descriptionView Paper arrow_downwardDownload

Stacked Ensemble of Recurrent Neural Networks for Predicting Turbocharger Remaining Useful Life

by Sepideh Pashami

2021, Applied Sciences

Predictive Maintenance (PM) is a proactive maintenance strategy that tries to minimize a system’s downtime by predicting failures before they happen. It uses data from sensors to measure the component’s state of health and make forecasts... more

Figure 1. Time to Event (TTE) and censored data.

Figure 2. Tensor representation of data. Figure 2 depicts structures of the designed tensor. There are two advantages in representing the tensors in this format. One is that it is compatible with the way RNNs in Tensorflow [23] consume the input: (samples, Ntimesteps, 1 features )- The other advantage is that since in this study the number of time steps and number of features are going to be varied (will be discussed in Section 5), this structure makes changing the size of each plane convenient. 3. Covariate Shift

Figure 3. Distributions governing train and test.

Figure 4. Distributions governing train and test. In this work, we detect the features with a significant shift, and remove them from training data. It is also possible to apply further transformations to compensate for a shift in those features. Several methods of handling covariate shift exist. For example, Kullback Leibler shift detection method measures the Kullback—Leibler (KL) divergence between splits of a dataset as a measure of distribution change [25]. Statistical test such as Student t-test and Fisher f-test can be used to calculate the shift in mean and variance [26]. There are more recent methods such as Intersection of Confidence Interval rule [27], in which at each point in time a confidence interval is calculated by estimating the standard deviation of a polynomial fitted at the neighborhood of that point. Then, if the confidence interval of a point in time does not have an intersection with the confidence intervals of the previous points in time, a shift will be detected. n [28], control charts which are graphical representations of sample statistics are used for detecting shifts in the mean of time series data. Exponentially weighted average of the previous observations is used to predict one time ahead. Using the predicted value and the control chart, the upper bound and lower bound of upcoming values are calculated. Then, if the observation value of one time ahead falls outside the interval (lower bound to upper bound) a shift will be detected.

Figure 5. Two-layer stacking architecture.

Figure 7. LSTM error results for choosing threshold for feature selection in Algorithm 1. For training the LSTMs, Mtimesteps in the tensor structure (Msampless Ntimesteps 1 features ) is 20, and 11 features comes from the selected threshold for Algorithm 1. The result of this experiment is shown in Figure 7, which indicates threshold of 0.8 as the best choice—which leads to 355 features.

Figure 8. Comparing stacked ensemble results to the target values. Figure 9. Comparing best model results to the target values. In order to better visualise the training process of base models in comparison to the meta model Figures 10 and 11 depict the best base model and meta model learning curve, respectively.

Please note that for both learning curves early stopping is applied (early stopping epoch for best base model is 63 and for meta model is 43). The error on both validation and training sets is lower for meta model, except at the beginning of the curves for the training set. The reason for the error of the base model being lower at the beginning for the training set is that mini-batch training is used for base models, but incremental training is used for meta model. It can also be seen that the evolution of the meta model error is much more stable in comparison to the base model. >a a a P , a es ie é . % ¢ ee | 1 +. + -

Figure 10. Learning curve of the best base model.

Figure 11. Learning curve of the meta model.

Figure 12. Comparing stacked ensemble results to the target values. ane > Se” ieee” | aaa: © aaa aii In Figure 12 we present how the correlation between base models and the meta model is related to the performance of the former. It can be seen that for base models with between 0.5 and 0.6 correlation to the meta model, the MAE values are almost steady (with only minor fluctuations). After that, as the correlation increases the MAE reduces until the correlation reaches to almost 0.7. From this point on, the MAE values seem to remain steady again. Decrease in MAE as the correlation increases is expected since the meta model is predicting target values; hence, higher correlation between the base models and the meta model means higher predictability and consequently lower MAE. 8. Conclusions

Algorithm 1 Covariate shift detection algorithm 4. Model Stacking

Table 1. LSTM architectures and parameters.

Table 2. CONV-LSTM architectures and parameters. Table 3. Meta model architecture and parameters.

Table 4. LightGBM setting and parameters.

Table 5. Model comparison based on MAE of the TTE predictions.

Table 6. Correlation statistics of base models.

descriptionView Paper arrow_downwardDownload

Flood Prediction and Uncertainty Estimation Using Deep Learning

by Steven Corns

2021, Water

Floods are a complex phenomenon that are difficult to predict because of their non-linear and dynamic nature. Therefore, flood prediction has been a key research topic in the field of hydrology. Various researchers have approached this... more

descriptionView Paper arrow_downwardDownload

PREDICTING STOCK MARKET INDICES USING NEURAL NETWORKS

by IAEME Publication

2021, IAEME PUBLICATION

Investing in stock markets is a decisive role for every investor. Speculation in the market makes an investor distressed about his investment. Hence predicting the exact stock market price at high accuracy helps investors to invest wisely... more

descriptionView Paper arrow_downwardDownload

A long short-term memory deep learning framework for explainable recommendation

by Hafed Zarzour

2021

Fig. 3. Accuracy for our model with three diversity values.

Figure 2 depicts the gradual degradation in the values of loss for our model using different diversity values achieved at 30" epoch of training. It can be observed that the loss values are degraded for all diversity values. ‘ig. 2. Training loss for our model with three diversity values.

descriptionView Paper arrow_downwardDownload

An Analytical Approach for Stock Market Forecasting Based on Machine Learning

by International Journal of Scientific Research in Science, Engineering and Technology IJSRSET

2021, International Journal of Scientific Research in Science, Engineering and Technology

Stock Market act as a mechanism for organizations to mean their capitals by introducing their organization shares to market and furthermore ends up being a helpful stage for investors to procure past the edge of interest rates of offered... more

descriptionView Paper arrow_downwardDownload

Stock Prediction using Neural Networks and Time Series Analysis Methods

by International Journal of Scientific Research in Computer Science, Engineering and Information Technology IJSRCSEIT

2020, International Journal of Scientific Research in Computer Science, Engineering and Information Technology

The stock market is considered to be one of the most highly complex financial systems which consist of various components or stocks, the price of which fluctuates greatly with respect to time. Stock market forecasting involves uncovering... more

descriptionView Paper arrow_downwardDownload

Music generation using Bidirectional Recurrent Neural Nets

by IRJET Journal

2020

The advancement in neural network and deep learning is enabling the use of these technologies in several art and other fields, and produces outcome which are similar to humans. This paper proposes a method to generate music, different... more

descriptionView Paper arrow_downwardDownload

Comparative Analysis of Recurrent Neural Network Architectures for Reservoir Inflow Forecasting

by K.W. Chau

2020, Water

descriptionView Paper arrow_downwardDownload

U-CNNpred: A Universal CNN-based Predictor for Stock Markets

by Ehsan Hoseinzade

2020, arXive

The performance of financial market prediction systems depends heavily on the quality of features it is using. While researchers have used various techniques for enhancing the stock specific features, less attention has been paid to... more

layers. This happens through Eq 10. The last step is updating weights of the CNN using calculated gradient

a graphical visualization of what we described just now. Figure 1: Graphical Visualization of 2D-CNNpred (Hoseinzade & Haratizadeh, 2019)

Figure 2: Performance of algorithms in having the best F-measure for predicting each of the 458 stocks

called the base predictor. 4.3. Transfer learning

Table 4: F-measure of some of the stocks in S&P 500 index

Table 5: F-measure of base predictor with different layers

Table 6: F-measure of algorithms in prediction of U.S. indices

Table 7: F-measure of algorithms in prediction of world's famous indices

The list of features that were used in this research is represented in Table 8:

References \hmadi, E., Jasemi, M., Monplaisir, L., Nabavi, M. A.. Mahmoodi, A., & Jam, P. A.

descriptionView Paper arrow_downwardDownload

Share Price Prediction using Machine Learning Technique

by Naresh Edupuganti

2019

Stock Market has started to attract more people from academics and business point of view which has increased. So this paper is mostly based on the approach of predicting the share price using Long Short Term Memory (LSTM) and Recurrent... more

Fig 1 Sliding window example where it helps to predict the next price [8] We know that stock price can be represented based on time series of given length NN, Defined as PO, Pl, P2.....P N- 1P0,P1,....PN-1 in which PiPi in eqn. 3 and eqn. 4 can be closed price for the day ii, 0 <i < NO <i<N. Then next is that we have declare the imagined window of sliding window of fixed size Ww which then be declared as the Input size for the program and everytime we move the sliding window there is no overlap between the sets of data [8](Fig 1). The model which we are using has the LSTM cells and the model is RNN now let’s form the input structure to provide the machine learning part to train the model taking the first sliding window and time as Wt for time t and WO for window. The model which we are using has the LSTM cells and the

The following 1s the configuration used to conduct the experiment the model. The configuration for the experiment is given in Table 1. We train all the stock price of NSE data of last 24 months and check the result here we are going to discuss result obtain by SP500 where the data display the result for the past 200 days with the predicted value where we can compare with the actual data as shown in the Fig. 4 with Istm_size 32 and input_size 1, Fig. 5 with Istm_size 128 and input_size | Fig. 6 input_size 5 lstm_size 128 and max_epoch 75 (instead 50). to predict the future and compare the result to learn [10]. adjust the error and also use backpropagation to avoid scale out factor then we finally get the output and train that values te eR AAS tea Pte. oe Apne. the seaenle HR lace TI) given in Table 1. We train all the stock price of NSE data of adjust the error and also use backpropagation to avoid scale last 24 months and check the result here we are going to

We can observe that the model gets efficient as we improve the configuration and set them according to the environment we observe in Fig 4 that with less lstm_size the error is more and the prediction is not efficient when compared to the increased Istm_size in Fig. 5 we can still improve the prediction efficiency by increasing the max_epoch so from this we can conclude that the model gets efficient by understanding the nature of the model and setting the model with the proper configuration to predict the model [8].

Table | Default Configuration input_size | Fig. 6 input_size 5 Istm_size 128 and max_epoch can compare with the actual data as shown in the Fig. 4 with

descriptionView Paper arrow_downwardDownload

Over view on stock predictor using machine learning and deep learning

by Ijariit Journal

2019, International Journal of Advance Research, Ideas and Innovations in Technology

The trust which has grown in the power of deep learning models has made the finance industry very eager and with dynamical attitude to apply them in practice as it has been for the past few years. There are a variety of methods and... more

Shreelekshmy S et al [11] has proposed a technique to forecast Coskun H et al [16] has proposed a model that compares direct and iterative ANN forecasting approaches in multi-periodic time series forecasting. When performed using the direct or iterative method, the result of the method is compared using grey relational analysis to obtain a technique that gives good results.

descriptionView Paper arrow_downwardDownload

Definition of artificial neural networks with comparison to other networks

by Erkam Guresen

2011, Procedia Computer Science

descriptionView Paper arrow_downwardDownload