Enhancement of the Performance of Large Language Models in Diabetes Education through Retrieval-Augmented Generation: Comparative Study
0301 basic medicine
Original Paper
Diabetes education
Computer applications to medicine. Medical informatics
R858-859.7
Claude-2
RAG
GPT-4.0
Retrieval-augmented generation
Google Bard
0302 clinical medicine
LLMs
Large language models
Public aspects of medicine
RA1-1270
DOI:
10.2196/58041
Publication Date:
2024-07-24T11:38:23Z
AUTHORS (15)
ABSTRACT
Background Large language models (LLMs) demonstrated advanced performance in processing clinical information. However, commercially available LLMs lack specialized medical knowledge and remain susceptible to generating inaccurate Given the need for self-management diabetes, patients commonly seek information online. We introduce Retrieval-augmented Information System Enhancement (RISE) framework evaluate its enhancing provide accurate responses diabetes-related inquiries. Objective This study aimed potential of RISE framework, an retrieval augmentation tool, improve LLM’s accurately safely respond Methods The RISE, innovative comprises 4 steps: rewriting query, retrieval, summarization, execution. Using a set 43 common questions, we evaluated 3 base (GPT-4, Anthropic Claude 2, Google Bard) their RISE-enhanced versions respectively. Assessments were conducted by clinicians accuracy comprehensiveness understandability. Results integration significantly improved from all LLMs. On average, percentage increased 12% (15/129) with RISE. Specifically, rates 7% (3/43) GPT-4, 19% (8/43) 9% (4/43) Bard. also enhanced response comprehensiveness, mean scores improving 0.44 (SD 0.10). Understandability was 0.19 0.13) on average. Data collection September 30, 2023 February 5, 2024. Conclusions improves LLMs’ responding inquiries, accuracy, These improvements have crucial implications RISE’s future role patient education chronic illness self-management, which contributes relieving resource pressures raising public awareness knowledge.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (53)
CITATIONS (15)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....