NFDI4DS | UHH-SEMS - Publication Details

BERN2: an advanced neural biomedical named entity recognition and normalization tool

OPENALEX - Publications

Mujeen Sung Minbyul Jeong Yonghwa Choi Donghyeon Kim Jinhyuk Lee and 1 more

In biomedical natural language processing, named entity recognition (NER) and normalization (NEN) are key tasks that enable the automatic extraction of entities (e.g. diseases drugs) from ever-growing literature. this article, we present BERN2 (Advanced Biomedical Entity Recognition Normalization), a tool improves previous neural network-based NER by employing multi-task model NEN models to achieve much faster more accurate inference. We hope our can help annotate large-scale texts for...

10.1093/bioinformatics/btac598 article EN Bioinformatics 2022-08-31

The SLOMO F-Box Protein is Required for ABA-Induced Degradation of VAMP721/722 in Arabidopsis

OPENALEX - Publications

Yonghwa Choi Hani Kim Hyeokjin Kwon Hyera Jung A. V. Kim and 6 more

10.1007/s12374-024-09452-6 article EN Journal of Plant Biology 2025-01-07

ReSimNet: drug response similarity prediction using Siamese neural networks

OPENALEX - Publications

Minji Jeon Donghyeon Park Jinhyuk Lee Hwisang Jeon Miyoung Ko and 4 more

Abstract Motivation Traditional drug discovery approaches identify a target for disease and find compound that binds to the target. In this approach, structures of compounds are considered as most important features because it is assumed similar will bind same Therefore, structural analogs drugs selected candidates. However, even though not analogs, they may achieve desired response. A new method based on response, which can complement structure-based methods, needed. Results We implemented...

10.1093/bioinformatics/btz411 article EN Bioinformatics 2019-05-16

Answering Questions on COVID-19 in Real-Time

OPENALEX - Publications

Jinhyuk Lee Sean S. Yi Minbyul Jeong Mujeen Sung Wonjin Yoon and 3 more

The recent outbreak of the novel coronavirus is wreaking havoc on world and researchers are struggling to effectively combat it. One reason why fight difficult due lack information knowledge. In this work, we outline our effort contribute shrinking knowledge vacuum by creating covidAsk, a question answering (QA) system that combines biomedical text mining QA techniques provide answers questions in real-time. Our also leverages retrieval (IR) approaches entity-level complementary models....

10.18653/v1/2020.nlpcovid19-2.1 article EN cc-by 2020-01-01

Deep learning of mutation-gene-drug relations from the literature

OPENALEX - Publications

Kyubum Lee Byounggun Kim Yonghwa Choi Sunkyu Kim Won‐Ho Shin and 5 more

Molecular biomarkers that can predict drug efficacy in cancer patients are crucial components for the advancement of precision medicine. However, identifying these molecular remains a laborious and challenging task. Next-generation sequencing preclinical models have increasingly led to identification novel gene-mutation-drug relations, results been reported published scientific literature. Here, we present two new computational methods utilize all PubMed articles as domain specific...

10.1186/s12859-018-2029-1 article EN cc-by BMC Bioinformatics 2018-01-25

ARPNet: Antidepressant Response Prediction Network for Major Depressive Disorder

OPENALEX - Publications

Buru Chang Yonghwa Choi Minji Jeon Junhyun Lee Kyu‐Man Han and 3 more

Treating patients with major depressive disorder is challenging because it takes several months for antidepressants prescribed the to take effect. This limitation may result in increased risks and treatment costs. To address this limitation, an accurate antidepressant response prediction model needed. Recently, studies have proposed models that extract useful features such as neuroimaging biomarkers genetic variants from patient data, use them predictors predicting responses of patients....

10.3390/genes10110907 article EN Genes 2019-11-07

Integrated clinical and genomic models using machine-learning methods to predict the efficacy of paclitaxel-based chemotherapy in patients with advanced gastric cancer

OPENALEX - Publications

Yonghwa Choi Jangwoo Lee Keewon Shin Ji Won Lee Ju Won Kim and 4 more

Abstract Background Paclitaxel is commonly used as a second-line therapy for advanced gastric cancer (AGC). The decision to proceed with chemotherapy and select an appropriate regimen critical vulnerable patients AGC progressing after first-line chemotherapy. However, no predictive biomarkers exist identify who would benefit from paclitaxel-based Methods This study included 288 receiving between 2017 2022 part of the K-MASTER project, nationwide government-funded precision medicine...

10.1186/s12885-024-12268-9 article EN cc-by BMC Cancer 2024-04-20

Does a Large Language Model Really Speak in Human-Like Language?

OPENALEX - Publications

M. M. Park Yonghwa Choi Jiwoon Jeon

Large Language Models (LLMs) have recently emerged, attracting considerable attention due to their ability generate highly natural, human-like text. This study compares the latent community structures of LLM-generated text and human-written within a hypothesis testing procedure. Specifically, we analyze three sets: original texts ($\mathcal{O}$), LLM-paraphrased versions ($\mathcal{G}$), twice-paraphrased set ($\mathcal{S}$) derived from $\mathcal{G}$. Our analysis addresses two key...

10.48550/arxiv.2501.01273 preprint EN arXiv (Cornell University) 2025-01-02

A Pilot Study of Biomedical Text Comprehension using an Attention-Based Deep Neural Reader: Design and Experimental Analysis

OPENALEX - Publications

Seongsoon Kim Donghyeon Park Yonghwa Choi Kyubum Lee Byounggun Kim and 4 more

With the development of artificial intelligence (AI) technology centered on deep-learning, computer has evolved to a point where it can read given text and answer question based context text. Such specific task is known as machine comprehension. Existing comprehension tasks mostly use datasets general texts, such news articles or elementary school-level storybooks. However, no attempt been made determine whether an up-to-date deep learning-based model also process scientific literature...

10.2196/medinform.8751 article EN cc-by JMIR Medical Informatics 2018-01-05

HiPub: translating PubMed and PMC texts to networks for knowledge discovery

OPENALEX - Publications

Kyubum Lee Won‐Ho Shin Byounggun Kim Sunwon Lee Yonghwa Choi and 4 more

We introduce HiPub, a seamless Chrome browser plug-in that automatically recognizes, annotates and translates biomedical entities from texts into networks for knowledge discovery. Using combination of two different named-entity recognition resources, HiPub can recognize genes, proteins, diseases, drugs, mutations cell lines in texts, achieve high precision recall. extracts entity-relationships to construct context-specific networks, integrates existing network data external databases It...

10.1093/bioinformatics/btw511 article EN Bioinformatics 2016-08-02

Evaluation of crowdsourced mortality prediction models as a framework for assessing artificial intelligence in medicine

OPENALEX - Publications

Timothy Bergquist Thomas Schaffter Yao Yan Thomas Yu Justin Prosser and 95 more

Abstract Objective Applications of machine learning in healthcare are high interest and have the potential to improve patient care. Yet, real-world accuracy these models clinical practice on different subpopulations remains unclear. To address important questions, we hosted a community challenge evaluate methods that predict outcomes. We focused prediction all-cause mortality as question. Materials Using Model-to-Data framework, 345 registered participants, coalescing into 25 independent...

10.1093/jamia/ocad159 article EN Journal of the American Medical Informatics Association 2023-08-08

Deep-Learning-Based Natural Language Processing of Serial Free-Text Radiological Reports for Predicting Rectal Cancer Patient Survival

OPENALEX - Publications

Sunkyu Kim Choong‐kun Lee Yonghwa Choi Eun Sil Baek Jeong Eun Choi and 3 more

Most electronic medical records, such as free-text radiological reports, are unstructured; however, the methodological approaches to analyzing these accumulating unstructured records limited. This article proposes a deep-transfer-learning-based natural language processing model that analyzes serial magnetic resonance imaging reports of rectal cancer patients and predicts their overall survival. To evaluate model, retrospective cohort study 4,338 was conducted. The experimental results...

10.3389/fonc.2021.747250 article EN cc-by Frontiers in Oncology 2021-11-17

Can Machines Learn to Comprehend Scientific Literature?

OPENALEX - Publications

Donghyeon Park Yonghwa Choi Daehan Kim Minhwan Yu Seongsoon Kim and 1 more

To measure the ability of a machine to understand professional-level scientific articles, we construct question answering task called PaperQA. The PaperQA is based on more than 80 000 "fill-in-the-blank" type questions articles from reputed journals such as Nature and Science. We perform fine-grained linguistic analysis evaluation compare other conventional (QA) tasks general literature (e.g., books, news Wikipedia texts). results indicate that most difficult QA for both humans (lay people)...

10.1109/access.2019.2891666 article EN cc-by-nc-nd IEEE Access 2019-01-01

Answering Questions on COVID-19 in Real-Time

OPENALEX - Publications

Jinhyuk Lee Sean S. Yi Minbyul Jeong Mujeen Sung Wonjin Yoon and 3 more

The recent outbreak of the novel coronavirus is wreaking havoc on world and researchers are struggling to effectively combat it. One reason why fight difficult due lack information knowledge. In this work, we outline our effort contribute shrinking knowledge vacuum by creating covidAsk, a question answering (QA) system that combines biomedical text mining QA techniques provide answers questions in real-time. Our also leverages retrieval (IR) approaches entity-level complementary models....

10.48550/arxiv.2006.15830 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Evaluation of crowdsourced mortality prediction models as a framework for assessing AI in medicine

OPENALEX - Publications

Timothy Bergquist Thomas Schaffter Yao Yan Thomas Yu Justin Prosser and 16 more

Abstract Applications of machine learning in healthcare are high interest and have the potential to significantly improve patient care. Yet, real-world accuracy performance these models on different subpopulations remains unclear. To address important questions, we hosted a community challenge evaluate methods that predict outcomes. overcome privacy concerns, employed Model-to-Data approach, allowing citizen scientists researchers train private health data without direct access data. We...

10.1101/2021.01.18.21250072 preprint EN cc-by-nd medRxiv (Cold Spring Harbor Laboratory) 2021-01-20

Frequency-wavenumber analysis with a sparse array

OPENALEX - Publications

Yonghwa Choi Donghyeon Kim Jea Soo Kim

In underwater acoustics, there has been many studies for finding the target direction using beamforming technique. When receiving a signal with frequency higher than design of array, it is difficult to estimate due spatial aliasing. this study, we propose method estimating array by frequency-wavenumber analysis. analysis performed, striation pattern appears, and confirmed that slope remains constant even if aliasing occurs. The was estimated visual inspection verified SAVEX15 data.

10.1121/1.5036081 article EN The Journal of the Acoustical Society of America 2018-03-01

SEMO

OPENALEX - Publications

Jukyoung Lee Yonghwa Choi Suhkyung Kim Seongsoon Kim Jaewoo Kang

Many people seek majority opinions by searching for question-answers that are uploaded others or uploading their own questions on social media sites. However, have to read through a large number of documents returned search services find the opinions. Moreover, even when users upload sites, they cannot immediately obtain answers. To address these problems, we present Searching Majority Opinions System (SEMO), novel opinion-based system uses QA threads SNS and cQA websites. SEMO returns...

10.1145/2872518.2890553 article EN 2016-01-01

Prediction of array gain in directional noise field

OPENALEX - Publications

Jisung Park Yonghwa Choi J. S. Kim Sungho Cho Jungsoo Park

The Array Gain(AG) is a metric to assess the performance of an array and dependent on configuration array, frequency, as well directionality noise. In this study, AG calculated based spatial coherence between sensor elements in directional noise environment for given shape. estimated then compared with derived from sea going data signal ratio. results are presented discussed.

10.1121/1.4969958 article EN The Journal of the Acoustical Society of America 2016-10-01