Glossary · Technical concept
Transformer
The neural-network architecture underlying most modern foundation models. Introduced by Vaswani et al. (2017). Key innovation: the attention mechanism that lets every token in a sequence interact with every other token. Transformers underpin LLMs, vision-transformer image models, and most multimodal systems.