Transformers

Why Transformers over LSTM

  • Because LSTM take tokens sequentially, making them slow. Whereas transformers takes in all the tokens together, allowing for faster computation.

Last updated