Accurate Identification of Fatty Liver Disease in Data Warehouse Utilizing Natural Language Processing

0301 basic medicine Databases, Factual Veterans Health Prognosis Magnetic Resonance Imaging United States 3. Good health Fatty Liver United States Department of Veterans Affairs 03 medical and health sciences Predictive Value of Tests Data Mining Electronic Health Records Humans Tomography, X-Ray Computed Algorithms Natural Language Processing Ultrasonography
DOI: 10.1007/s10620-017-4721-9 Publication Date: 2017-08-31T08:44:05Z
ABSTRACT
Natural language processing is a powerful technique of machine learning capable of maximizing data extraction from complex electronic medical records.We utilized this technique to develop algorithms capable of "reading" full-text radiology reports to accurately identify the presence of fatty liver disease. Abdominal ultrasound, computerized tomography, and magnetic resonance imaging reports were retrieved from the Veterans Affairs Corporate Data Warehouse from a random national sample of 652 patients. Radiographic fatty liver disease was determined by manual review by two physicians and verified with an expert radiologist. A split validation method was utilized for algorithm development.For all three imaging modalities, the algorithms could identify fatty liver disease with >90% recall and precision, with F-measures >90%.These algorithms could be used to rapidly screen patient records to establish a large cohort to facilitate epidemiological and clinical studies and examine the clinic course and outcomes of patients with radiographic hepatic steatosis.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (16)
CITATIONS (31)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....