Last updated 5 months ago
Why Transformers over LSTM
Because LSTM take tokens sequentially, making them slow. Whereas transformers takes in all the tokens together, allowing for faster computation.