Transformers from Scratch in PyTorch
Highlights
Many people in the deep learning community (myself included) believe that an attention revolution is imminent โ that is, that attention-based models will soon replace most of the existing state-of-the-art methods. โคด๏ธ
I hope to bring a new perspective and encourage others to join the revolution. โคด๏ธ