Academia.eduAcademia.edu

Long short term memory cells

description11 papers
group0 followers
lightbulbAbout this topic
Long Short-Term Memory (LSTM) cells are a type of recurrent neural network architecture designed to model temporal sequences and dependencies. They utilize memory gates to regulate the flow of information, enabling the network to retain or forget information over long periods, thus addressing the vanishing gradient problem in traditional RNNs.
lightbulbAbout this topic
Long Short-Term Memory (LSTM) cells are a type of recurrent neural network architecture designed to model temporal sequences and dependencies. They utilize memory gates to regulate the flow of information, enabling the network to retain or forget information over long periods, thus addressing the vanishing gradient problem in traditional RNNs.

Key research themes

1. How can LSTM architectures be optimized and extended to improve long-term dependency learning and memory retention in sequential data?

This research area focuses on architectural innovations and training methodologies that enhance the ability of LSTM networks to capture, retain, and utilize long-term dependencies. Improving memory retention and mitigating vanishing gradients are critical challenges in recurrent neural networks, and various structural modifications aim to address these through gating mechanisms, dimensionality expansion, and training algorithms.

Key finding: Demonstrates that the stability and speed of slow points in RNN hidden state dynamics directly predict memory retention and extrapolation capabilities. The paper shows how different training protocols yield networks with... Read more
Key finding: Introduces a depth gate connecting adjacent LSTM layers to create gated, linear dependence between memory cells across layers. This architectural modification facilitates better gradient flow in deep recurrent networks,... Read more
Key finding: Proposes Grid LSTM networks with LSTM cells arranged in multidimensional grids, enabling recurrent connections not only across temporal sequences but also across network depth and spatial dimensions. This structure enhances... Read more
Key finding: Identifies limitations in standard gating mechanisms that require gates operating near saturation for long-term memory retention, which impedes gradient-based learning. Proposes uniform gate initialization and a refined... Read more

2. What novel LSTM cell architectures best leverage multiple sequence dependencies for improved recognition tasks in multimodal and multi-view data?

This area investigates new LSTM cell designs that can jointly process multiple dependent input sequences, enabling richer representation learning for complex, correlated data such as multi-view images or multimodal inputs. These architectures go beyond conventional sequential LSTM cells by fusing information at gate or cell state levels, which enhances performance on recognition and classification tasks.

Key finding: Develops two novel LSTM cell architectures, Gate-Level Fusion (GLF-LSTM) and State-Level Fusion (SLF-LSTM), that jointly learn from simultaneously acquired dependent input sequences (e.g., horizontal and vertical parallax... Read more

3. How can recurrent neural network architectures incorporating biological principles and novel training methods advance sequence modeling and neuronal activity estimation?

This theme encompasses models that draw inspiration from biological neural systems or integrate neuroscientific insights, including biologically plausible learning rules, neural dynamics interpretation, and application to neuronal activity data. It also covers new training approaches aiming to overcome limitations of backpropagation and extend RNN models to better capture biological temporal processes.

Key finding: Proposes a biologically plausible learning framework that leverages local Hebbian synaptic plasticity combined with supervised and unsupervised signals, countering limitations of orthodox backpropagation such as non-locality... Read more
Key finding: Implements a recurrent network mimicking cortical ensembles with paired-pulse facilitation and slow inhibitory synaptic currents, producing interval-selective responses exhibiting behavioral timing hallmarks: bias... Read more
Key finding: Derives local, biologically plausible learning rules for recurrent networks to efficiently represent current and past inputs by enforcing balance between feedforward and recurrent inputs. The approach leads to efficient... Read more
Key finding: Analyzes RNN hidden state dynamics as discrete-time dynamical systems, linking slow points to memory representations. Finds that training protocols impact memory stability and extrapolation ability, and modifies loss... Read more

All papers in Long short term memory cells

Reservoir dams play a pivotal role in water resource management. Accurate prediction of inflow to reservoirs significantly enhances operational performance. While standalone artificial intelligence methods have recently been frequently... more
Background Precisely predicting the water levels of rivers is critical for planning and supporting flood hazard and risk assessments and maintaining navigation, irrigation, and water withdrawal for urban areas and industry. In Hungary,... more
Nowadays, many companies through the world wide web like YouTube, Netflix, Aliexpress and Amazon, provide personalized services as recommendations. Recommender systems use the related information about products or services to suggest the... more
Due to the stochastic nature and complexity of flow, as well as the existence of hydrological uncertainties, predicting streamflow in dam reservoirs, especially in semi-arid and arid areas, is essential for the optimal and timely use of... more
Nowadays, many companies through the world wide web like YouTube, Netflix, Aliexpress and Amazon, provide personalized services as recommendations. Recommender systems use the related information about products or services to suggest the... more
Due to the growing quantity of information available on the Web, recommender systems have become crucial component for the success of online shopping stores. However, most of the existing recommender systems were only designed to improve... more
Stock market Indices prediction is one of the most important issues in the financial field. Although many prediction models have been developed during the last decade, they suffer a poor performance because indices movement is highly non... more
Definition of Artificial Neural Networks (ANNs) is made by computer scientists, artificial intelligence experts and mathematicians in various dimensions. Many of the definitions explain ANN by referring to graphics instead of giving well... more
COVID-19 is a global pandemic that has been reported first in Wuhan, China in December 2019. According to the World Health Organization (WHO), around 1 out of every 5 people who get COVID-19 get seriously ill and develop difficulty... more
The long short-term memory neural network (LSTM) is a type of recurrent neural network (RNN). During the training of RNN architecture, sequential information is used and travels through the neural network from input vector to the output... more
Statistical methods were traditionally primarily used for time series forecasting. However, new hybrid methods demonstrate competitive accuracy, leading to increased machine-learning-based methodologies in the financial sector. However,... more
The accurate and timely estimation of river discharge plays an important role in hydrological modeling, especially for avoiding the consequences of flood events. The majority of existing work on hydrologic prediction focuses on modeling... more
Predictive Maintenance (PM) is a proactive maintenance strategy that tries to minimize a system’s downtime by predicting failures before they happen. It uses data from sensors to measure the component’s state of health and make forecasts... more
Floods are a complex phenomenon that are difficult to predict because of their non-linear and dynamic nature. Therefore, flood prediction has been a key research topic in the field of hydrology. Various researchers have approached this... more
Investing in stock markets is a decisive role for every investor. Speculation in the market makes an investor distressed about his investment. Hence predicting the exact stock market price at high accuracy helps investors to invest wisely... more
Due to the growing quantity of information available on the Web, recommender systems have become crucial component for the success of online shopping stores. However, most of the existing recommender systems were only designed to improve... more
Stock Market act as a mechanism for organizations to mean their capitals by introducing their organization shares to market and furthermore ends up being a helpful stage for investors to procure past the edge of interest rates of offered... more
The stock market is considered to be one of the most highly complex financial systems which consist of various components or stocks, the price of which fluctuates greatly with respect to time. Stock market forecasting involves uncovering... more
The advancement in neural network and deep learning is enabling the use of these technologies in several art and other fields, and produces outcome which are similar to humans. This paper proposes a method to generate music, different... more
Due to the stochastic nature and complexity of flow, as well as the existence of hydrological uncertainties, predicting streamflow in dam reservoirs, especially in semi-arid and arid areas, is essential for the optimal and timely use of... more
The performance of financial market prediction systems depends heavily on the quality of features it is using. While researchers have used various techniques for enhancing the stock specific features, less attention has been paid to... more
Stock Market has started to attract more people from academics and business point of view which has increased. So this paper is mostly based on the approach of predicting the share price using Long Short Term Memory (LSTM) and Recurrent... more
The trust which has grown in the power of deep learning models has made the finance industry very eager and with dynamical attitude to apply them in practice as it has been for the past few years. There are a variety of methods and... more
Definition of Artificial Neural Networks (ANNs) is made by computer scientists, artificial intelligence experts and mathematicians in various dimensions. Many of the definitions explain ANN by referring to graphics instead of giving well... more
Download research papers for free!