Abstract
Transformers play a central role in the inner workings of large language models. We develop a mathematical framework for analyzing transformers based on their interpretation as interacting particle systems, with a particular emphasis on long-time clustering behavior. Our study explores the underlying theory and offers new perspectives for mathematicians as well as computer scientists.
Author supplied keywords
Cite
CITATION STYLE
APA
GESHKOVSKI, B., LETROUIT, C., POLYANSKIY, Y. U. R. Y., & RIGOLLET, P. (2025). A MATHEMATICAL PERSPECTIVE ON TRANSFORMERS. Bulletin of the American Mathematical Society, 62(3), 427–479. https://doi.org/10.1090/bull/1863
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.
Already have an account? Sign in
Sign up for free