Calculon: a Methodology and Tool for High-Level Codesign of Systems and Large Language Models

Mikhail Isaev; Nic McDonald; Larry Dennison; Richard Vuduc

Conference ProceedingsOPEN ACCESS

Calculon: a Methodology and Tool for High-Level Codesign of Systems and Large Language Models

International Conference for High Performance Computing, Networking, Storage and Analysis, SC (2023)

DOI: 10.1145/3581784.3607102

N/ACitations

37Readers

Get full text

Abstract

This paper presents a parameterized analytical performance model of transformer-based Large Language Models (LLMs) for guiding high-level algorithm-architecture codesign studies. This model de-rives from an extensive survey of performance optimizations that have been proposed for the training and inference of LLMs; the model's parameters capture application characteristics, the hardware system, and the space of implementation strategies. With such a model, we can systematically explore a joint space of hardware and software configurations to identify optimal system designs under given constraints, like the total amount of system memory. We implemented this model and methodology in a Python-based open-source tool called Calculon. Using it, we identified novel system designs that look significantly different from current inference and training systems, showing quantitatively the estimated potential to achieve higher efficiency, lower cost, and better scalability.

Cite

CITATION STYLE

APA

Isaev, M., McDonald, N., Dennison, L., & Vuduc, R. (2023). Calculon: a Methodology and Tool for High-Level Codesign of Systems and Large Language Models. In International Conference for High Performance Computing, Networking, Storage and Analysis, SC. IEEE Computer Society. https://doi.org/10.1145/3581784.3607102

Calculon: a Methodology and Tool for High-Level Codesign of Systems and Large Language Models

Abstract

Cite

Register to see more suggestions