Few-shot training LLMs for project-specific code-summarization

Toufique Ahmed; Premkumar Devanbu

Conference ProceedingsOPEN ACCESS

Few-shot training LLMs for project-specific code-summarization

ACM International Conference Proceeding Series (2022)

DOI: 10.1145/3551349.3559555

161Citations

78Readers

Get full text

Abstract

Very large language models (LLMs), such as GPT-3 and Codex have achieved state-of-the-art performance on several natural-language tasks, and show great promise also for code. A particularly exciting aspect of LLMs is their knack for few-shot and zero-shot learning: they can learn to perform a task with very few examples. Few-shotting has particular synergies in software engineering, where there are a lot of phenomena (identifier names, APIs, terminology, coding patterns) that are known to be highly project-specific. However, project-specific data can be quite limited, especially early in the history of a project; thus the few-shot learning capacity of LLMs might be very relevant. In this paper, we investigate the use few-shot training with the very large GPT (Generative Pre-trained Transformer) Codex model, and find evidence suggesting that one can significantly surpass state-of-the-art models for code-summarization, leveraging project-specific training.

Author supplied keywords

Cite

CITATION STYLE

APA

Ahmed, T., & Devanbu, P. (2022). Few-shot training LLMs for project-specific code-summarization. In ACM International Conference Proceeding Series. Association for Computing Machinery. https://doi.org/10.1145/3551349.3559555

Few-shot training LLMs for project-specific code-summarization

Abstract

Author supplied keywords

Cite

Register to see more suggestions