NFDI4DS | UHH-SEMS - Publication Details

MAIRA-1: A specialised large multimodal model for radiology report generation

FOS: Computer and information sciences Computer Science - Computation and Language Artificial Intelligence (cs.AI) Computer Science - Artificial Intelligence Computer Vision and Pattern Recognition (cs.CV) Computer Science - Computer Vision and Pattern Recognition Computation and Language (cs.CL)

DOI: 10.48550/arxiv.2311.13668 Publication Date: 2023-01-01

Abstract Supplemental Material References Cited by

AUTHORS (15)

Hyland, Stephanie L.

Bannur, Shruthi

Bouzid, Kenza

Castro, Daniel C.

Ranjit, Mercy

Schwaighofer, Anton

Pérez-García, Fer...

Salvatelli, Valen...

Srivastav, Shaury

Thieme, Anja

Codella, Noel

Lungren, Matthew P.

Wetscherek, Maria...

Oktay, Ozan

Alvarez-Valle, Ja...

ABSTRACT

We present a radiology-specific multimodal model for the task for generating radiological reports from chest X-rays (CXRs). Our work builds on the idea that large language model(s) can be equipped with multimodal capabilities through alignment with pre-trained vision encoders. On natural images, this has been shown to allow multimodal models to gain image understanding and description capabilities. Our proposed model (MAIRA-1) leverages a CXR-specific image encoder in conjunction with a fine-tuned large language model based on Vicuna-7B, and text-based data augmentation, to produce reports with state-of-the-art quality. In particular, MAIRA-1 significantly improves on the radiologist-aligned RadCliQ metric and across all lexical metrics considered. Manual review of model outputs demonstrates promising fluency and accuracy of generated reports while uncovering failure modes not captured by existing evaluation practices. More information and resources can be found on the project website: https://aka.ms/maira.<br/>18 pages, 9 tables, 5 figures. v2 adds test IDs and image encoder citation. v3 fixes error in NPV/specificity<br/>

SUPPLEMENTAL MATERIAL

Coming soon ....

REFERENCES ()

CITATIONS ()

EXTERNAL LINKS

OPENAIRE - Products

PlumX Metrics

MAIRA-1: A specialised large multimodal model for radiology report generation

RECOMMENDATIONS

FAIR ASSESSMENT

Coming soon ....

JUPYTER LAB

Coming soon ....