LSTM
Long Short Term Memory networks – usually just called “LSTMs” – are a special kind of RNN, capable of learning long-term dependencies. They were introduced by Hochreiter & Schmidhuber (1997) ,and were refined and popularized by many people in following work. They work tremendously well on a large variety of problems, and are now widely used.
It's a multi-part series in which I am planning to cover the following:
- What is LSTM?
- The core idea behind LSTM
- Step by Step LSTM walk through
- Variants on Long Short Term Memory
- GRU VS LSTM