RALLE: A Framework for Developing and Evaluating Retrieval-Augmented Large Language Models

Yasuto Hoshi; Daisuke Miyashita; Youyang Ng; Kento Tatsuno; Yasuhiro Morioka; Osamu Torii; Jun Deguchi

Conference Proceedings

RALLE: A Framework for Developing and Evaluating Retrieval-Augmented Large Language Models

EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings of the System Demonstrations (2023) 52-69

DOI: 10.18653/v1/2023.emnlp-demo.4

9Citations

54Readers

Get full text

Abstract

Retrieval-augmented large language models (R-LLMs) combine pre-trained large language models (LLMs) with information retrieval systems to improve the accuracy of factual question-answering. However, current libraries for building R-LLMs provide high-level abstractions without sufficient transparency for evaluating and optimizing prompts within specific inference processes such as retrieval and generation. To address this gap, we present RALLE, an open-source framework designed to facilitate the development, evaluation, and optimization of R-LLMs for knowledge-intensive tasks. With RALLE, developers can easily develop and evaluate R-LLMs, improving hand-crafted prompts, assessing individual inference processes, and objectively measuring overall system performance quantitatively. By leveraging these features, developers can enhance the performance and accuracy of their R-LLMs in knowledge-intensive generation tasks. We open-source our code at https://github.com/yhoshi3/RaLLe.

Cite

CITATION STYLE

APA

Hoshi, Y., Miyashita, D., Ng, Y., Tatsuno, K., Morioka, Y., Torii, O., & Deguchi, J. (2023). RALLE: A Framework for Developing and Evaluating Retrieval-Augmented Large Language Models. In EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings of the System Demonstrations (pp. 52–69). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.emnlp-demo.4

RALLE: A Framework for Developing and Evaluating Retrieval-Augmented Large Language Models

Abstract

Cite

Register to see more suggestions