An abstract diagram representing the architecture of a transformer model.

Attention Is All You Need

Google researchers publish "Attention Is All You Need," introducing the Transformer architecture. This model becomes the foundation for most state-of-the-art NLP models, including GPT and BERT.