Development and analysis of medical instruction-tuning for Japanese large language models

Issey Sukeda; Masahiro Suzuki; Hiroki Sakaji; Satoshi Kodera

Journal ArticleOPEN ACCESS

Development and analysis of medical instruction-tuning for Japanese large language models

Artificial Intelligence in Health (2024) 1(2) 107-116

DOI: 10.36922/aih.2695

4Citations

9Readers

Get full text

Abstract

In the ongoing wave of impact driven by large language models (LLMs) like ChatGPT, the adaptation of LLMs to the medical domain has emerged as a crucial research frontier. Since mainstream LLMs tend to be designed for general-purpose applications, constructing a medical LLM through domain adaptation is a huge challenge. While instruction-tuning, particularly based on low-rank adaptation (LoRA), has become a frequently employed strategy to fine-tune LLMs recently, its precise roles in domain adaptation remain unknown. Here, we investigated how LoRA-based instruction-tuning improves the performance of Japanese medical question-answering tasks by employing a multifaceted evaluation of multiple-choice questions, including scoring based on “Exact match” and “Gestalt distance” in addition to the conventional accuracy. Our findings suggest that LoRA-based instruction-tuning can partially incorporate domain-specific knowledge into LLMs, with larger models demonstrating more pronounced effects. Furthermore, our results underscore the potential of adapting English-centric models for Japanese applications in domain adaptation, while also highlighting the persisting limitations of Japanese-centric models. This initiative represents a pioneering effort in enabling medical institutions to fine-tune and operate models without relying on external services.

Author supplied keywords

Cite

CITATION STYLE

APA

Sukeda, I., Suzuki, M., Sakaji, H., & Kodera, S. (2024). Development and analysis of medical instruction-tuning for Japanese large language models. Artificial Intelligence in Health, 1(2), 107–116. https://doi.org/10.36922/aih.2695

Development and analysis of medical instruction-tuning for Japanese large language models

Abstract

Author supplied keywords

Cite

Register to see more suggestions