All terms
Architectures
Long Short-Term Memory
A recurrent network with gates that carry information across many steps, easing vanishing gradients.
Definition
Long Short-Term Memory is a recurrent neural network cell with three gates (input, forget, and output) that control what enters, is retained in, and leaves a long-term cell state. This gating eases the vanishing gradient problem (where plain recurrent networks, or RNNs, lose track of early inputs during training) and lets the cell carry information across many steps. LSTMs were the workhorse for machine translation, speech recognition, and time-series tasks before Transformers became dominant.