Transformer
T
Transformer
Definition
A deep learning architecture introduced in the 2017 paper "Attention Is All You Need" that relies on self-attention mechanisms instead of recurrence. Transformers are the foundation of modern LLMs like GPT, BERT, and T5, excelling at sequence-to-sequence tasks.