Transformers from Scratch in PyTorch

#omnivore

๐Ÿ“– | ๐ŸŒ

Highlights

Many people in the deep learning community (myself included) believe that an attention revolution is imminent โ€” that is, that attention-based models will soon replace most of the existing state-of-the-art methods. โคด๏ธ

I hope to bring a new perspective and encourage others to join the revolution. โคด๏ธ