NFDI4DS | UHH-SEMS - Publication Details

Distilling large language models for matching patients to clinical trials.

FOS: Computer and information sciences Clinical Trials as Topic 03 medical and health sciences Artificial Intelligence (cs.AI) Computer Science - Artificial Intelligence Patient Selection Humans Programming Languages 0305 other medical science Information Retrieval (cs.IR) Computer Science - Information Retrieval Natural Language Processing

DOI: 10.48550/arxiv.2312.09958 Publication Date: 2024-04-19

Abstract Supplemental Material References Cited by

AUTHORS (4)

Nievas, Mauro

Basu, Aditya

Wang, Yanshan

Singh, Hrituraj

ABSTRACT

Abstract Objective The objective of this study is to systematically examine the efficacy of both proprietary (GPT-3.5, GPT-4) and open-source large language models (LLMs) (LLAMA 7B, 13B, 70B) in the context of matching patients to clinical trials in healthcare. Materials and methods The study employs a multifaceted evaluation framework, incorporating extensive automated and human-centric assessments along with a detailed error analysis for each model, and assesses LLMs’ capabilities in analyzing patient eligibility against clinical trial’s inclusion and exclusion criteria. To improve the adaptability of open-source LLMs, a specialized synthetic dataset was created using GPT-4, facilitating effective fine-tuning under constrained data conditions. Results The findings indicate that open-source LLMs, when fine-tuned on this limited and synthetic dataset, achieve performance parity with their proprietary counterparts, such as GPT-3.5. Discussion This study highlights the recent success of LLMs in the high-stakes domain of healthcare, specifically in patient-trial matching. The research demonstrates the potential of open-source models to match the performance of proprietary models when fine-tuned appropriately, addressing challenges like cost, privacy, and reproducibility concerns associated with closed-source proprietary LLMs. Conclusion The study underscores the opportunity for open-source LLMs in patient-trial matching. To encourage further research and applications in this field, the annotated evaluation dataset and the fine-tuned LLM, Trial-LLAMA, are released for public use.

SUPPLEMENTAL MATERIAL

Coming soon ....

REFERENCES ()

CITATIONS ()

EXTERNAL LINKS

OPENAIRE - Products

PlumX Metrics

Distilling large language models for matching patients to clinical trials.

RECOMMENDATIONS

FAIR ASSESSMENT

Coming soon ....

JUPYTER LAB

Coming soon ....