NFDI4DS | UHH-SEMS - Publication Details

Ngan Luu-Thuy Nguyen

ORCID: 0000-0003-3931-849X

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5033137339

Research Areas

Topic Modeling
Natural Language Processing Techniques
Sentiment Analysis and Opinion Mining
Multimodal Machine Learning Applications
Text Readability and Simplification
Advanced Text Analysis Techniques
Text and Document Classification Technologies
Hate Speech and Cyberbullying Detection
Spam and Phishing Detection
Handwritten Text Recognition Techniques
Surgical Simulation and Training
Biomedical Text Mining and Ontologies
Advanced Image and Video Retrieval Techniques
Cancer Immunotherapy and Biomarkers
Domain Adaptation and Few-Shot Learning
Bioinformatics and Genomic Networks
Internet Traffic Analysis and Secure E-voting
Online Learning and Analytics
Simulation-Based Education in Healthcare
Algorithms and Data Compression
Speech Recognition and Synthesis
Misinformation and Its Impacts
Smart Agriculture and AI
Patient Safety and Medication Errors
Human Pose and Action Recognition

Oncology Institute of Hope and Innovation
2024

Vietnam National University Ho Chi Minh City
2015-2024

Teikoku Pharma (United States)
2023-2024

King Abdullah University of Science and Technology
2024

Ho Chi Minh City University of Technology
2019-2023

University Of Information Technology
2016-2022

OhioHealth
2015-2020

Riverside Methodist Hospital
2015-2020

Bridgeport Hospital
2017

Yale New Haven Health System
2017

FormerLeaf: An efficient vision transformer for Cassava Leaf Disease detection

OPENALEX - Publications

Huy-Tan Thai Kim-Hung Le Ngan Luu-Thuy Nguyen

10.1016/j.compag.2022.107518 article EN Computers and Electronics in Agriculture 2022-12-08

The Genia Event and Protein Coreference tasks of the BioNLP Shared Task 2011

OPENALEX - Publications

Jin-Dong Kim Ngan Luu-Thuy Nguyen Yue Wang Jun’ichi Tsujii Toshihisa Takagi and 1 more

The Genia task, when it was introduced in 2009, the first community-wide effort to address a fine-grained, structural information extraction from biomedical literature. Arranged for second time as one of main tasks BioNLP Shared Task 2011, aimed measure progress community since and evaluate generalization technology full text papers. Protein Coreference task arranged supporting tasks, motivated lessons 2009 that abundance coreference structures natural language hinders further improvement...

10.1186/1471-2105-13-s11-s1 article EN cc-by BMC Bioinformatics 2012-06-01

UIT-VSFC: Vietnamese Students’ Feedback Corpus for Sentiment Analysis

OPENALEX - Publications

Kiet Van Nguyen Duc-Vu Nguyen Phu X. V. Nguyen Tham T. H. Truong Ngan Luu-Thuy Nguyen

Students' feedback is a vital resource for the interdisciplinary research combining of two fields: sentiment analysis and education. To strengthen Vietnamese language which low-resource language, we build Feedback Corpus (UIT-VSFC), free high-quality corpus on different tasks: sentiment-based topic-based classifications. In this paper, present methods building annotation guidelines ensure accuracy consistency corpus. The consists over 16,000 sentences are human-annotated tasks. assess...

10.1109/kse.2018.8573337 article EN 2018-11-01

A Vietnamese Dataset for Evaluating Machine Reading Comprehension

OPENALEX - Publications

Kiet Van Nguyen Duc-Vu Nguyen Thi Hong Anh Nguyen Ngan Luu-Thuy Nguyen

Over 97 million inhabitants speak Vietnamese as the native language in world. However, there are few research studies on machine reading comprehension (MRC) Vietnamese, task of understanding a document or text, and answering questions related to it. Due lack benchmark datasets for we present Question Answering Dataset (UIT-ViQuAD), new dataset low-resource evaluate MRC models. This comprises over 23,000 human-generated question-answer pairs based 5,109 passages 174 articles from Wikipedia....

10.18653/v1/2020.coling-main.233 article EN cc-by Proceedings of the 17th international conference on Computational linguistics - 2020-01-01

OpenViVQA: Task, dataset, and multimodal fusion models for visual question answering in Vietnamese

OPENALEX - Publications

Nghia Hieu Nguyen Duong T.D. Vo Kiet Van Nguyen Ngan Luu-Thuy Nguyen

10.1016/j.inffus.2023.101868 article EN Information Fusion 2023-07-04

Lower frequencies of circulating suppressive regulatory T cells and higher frequencies of CD4+naïve T cells at baseline are associated with severe immune-related adverse events in immune checkpoint inhibitor-treated melanoma

OPENALEX - Publications

Magdalena Kovacsovics-Bankowski Johanna M. Sweere Connor P. Healy Natalia Sigal Lichun Cheng and 15 more

Background Immune-related adverse events (irAEs) are major barriers of clinical management and further development immune checkpoint inhibitors (ICIs) for cancer therapy. Therefore, biomarkers associated with the onset severe irAEs needed. In this study, we aimed to identify features detectable in peripheral blood that required intervention. Methods We used a 43-marker mass cytometry panel characterize mononuclear cells from 28 unique patients melanoma across 29 lines ICI therapy before...

10.1136/jitc-2023-008056 article EN cc-by-nc Journal for ImmunoTherapy of Cancer 2024-01-01

Deep Learning versus Traditional Classifiers on Vietnamese Students’ Feedback Corpus

OPENALEX - Publications

Phu X. V. Nguyen Tham T. T. Hong Kiet Van Nguyen Ngan Luu-Thuy Nguyen

Student's feedback is an important source of collecting students' opinions to improve quality training activities. Implementing sentiment analysis into student data, we can determine sentiments polarities which express all problems in the institution since changes necessary will be applied teaching and learning. This study focused on machine learning natural language processing techniques (Naive Bayes, Maximum Entropy, Long Short-Term Memory, Bi-Directional Memory) Vietnamese Students'...

10.1109/nics.2018.8606837 article EN 2021 8th NAFOSTED Conference on Information and Computer Science (NICS) 2018-11-01

Job Prediction: From Deep Neural Network Models to Applications

OPENALEX - Publications

Tin Van Huynh Kiet Van Nguyen Ngan Luu-Thuy Nguyen Anh Gia-Tuan Nguyen

Determining the job is suitable for a student or person looking work based on their descriptions such as knowledge and skills that are difficult, well how employers must find ways to choose candidates match they require. In this paper, we focus studying prediction using different deep neural network models including TextCNN, Bi-GRU-LSTM-CNN, Bi-GRU-CNN with various pre-trained word embeddings IT dataset. addition, proposed simple effective ensemble model combining models. Our experimental...

10.1109/rivf48685.2020.9140760 article EN 2022 RIVF International Conference on Computing and Communication Technologies (RIVF) 2020-07-15

Variants of Long Short-Term Memory for Sentiment Analysis on Vietnamese Students’ Feedback Corpus

OPENALEX - Publications

Duc-Vu Nguyen Kiet Van Nguyen Ngan Luu-Thuy Nguyen

The Long Short-Term Memory (LSTM) and Dependency Tree-LSTM have shown the state-of-the-art results for sentiment analysis task English language. Despite many studies of LSTM approach, there are no approach Vietnamese analysis. In this paper, we conducted experiments with LSTM, Tree-LSTM, our proposed models on Students' Feedback Corpus. According to experimental results, were not better than model. However, when combining final hidden state vectors a Support Vector Machine classifier,...

10.1109/kse.2018.8573351 article EN 2018-11-01

Hate Speech Detection on Vietnamese Social Media Text using the Bi-GRU-LSTM-CNN Model

OPENALEX - Publications

Tin Van Huynh Duc-Vu Nguyen Kiet Van Nguyen Ngan Luu-Thuy Nguyen Anh Gia-Tuan Nguyen

In recent years, Hate Speech Detection has become one of the interesting fields in natural language processing or computational linguistics. this paper, we present description our system to solve problem at VLSP shared task 2019: on Social Networks with corpus which contains 20,345 human-labeled comments/posts for training and 5,086 public-testing. We implement a deep learning method based Bi-GRU-LSTM-CNN classifier into task. Our result is 70.576% F1-score, ranking 5th performance public-test set.

10.48550/arxiv.1911.03644 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Enhancing Lexical-Based Approach With External Knowledge for Vietnamese Multiple-Choice Machine Reading Comprehension

OPENALEX - Publications

Kiet Van Nguyen Khiem Vinh Tran Son T. Luu Anh Gia-Tuan Nguyen Ngan Luu-Thuy Nguyen

Although Vietnamese is the 17 <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">th</sup> most popular native-speaker language in world, there are not many research studies on machine reading comprehension (MRC), task of understanding a text and answering questions about it. One reasons because lack high-quality benchmark datasets for this task. In work, we construct dataset which consists 2,783 pairs multiple-choice answers based 417 texts...

10.1109/access.2020.3035701 article EN cc-by IEEE Access 2020-01-01

New Vietnamese Corpus for Machine Reading Comprehension of Health News Articles

OPENALEX - Publications

Kiet Van Nguyen Tin Van Huynh Duc-Vu Nguyen Anh Gia-Tuan Nguyen Ngan Luu-Thuy Nguyen

Machine reading comprehension is a natural language understanding task where the computing system required to read text and then find answer specific question posed by human. Large-scale high-quality corpora are necessary for evaluating machine models. Furthermore, (MRC) health sector has potential practical applications; nevertheless, MRC research in this domain currently scarce. This article presents UIT-ViNewsQA, new corpus Vietnamese evaluate models healthcare textual domain. The...

10.1145/3527631 article EN ACM Transactions on Asian and Low-Resource Language Information Processing 2022-05-02

Towards sustainable agriculture: A lightweight hybrid model and cloud-based collection of datasets for efficient leaf disease detection

OPENALEX - Publications

Huy-Tan Thai Kim-Hung Le Ngan Luu-Thuy Nguyen

10.1016/j.future.2023.06.016 article EN Future Generation Computer Systems 2023-06-25

EF-CenterNet: An efficient anchor-free model for UAV-based banana leaf disease detection

OPENALEX - Publications

Huy-Tan Thai Kim-Hung Le Ngan Luu-Thuy Nguyen

10.1016/j.compag.2025.109927 article EN Computers and Electronics in Agriculture 2025-01-16

Abstract B034: Overcoming resistance to immunotherapy by targeting CD38 in human tumor explants

OPENALEX - Publications

Or‐Yam Revach Angelina M. Cicerchia Ofir Shorer Boryana Petrova Seth Anderson and 45 more

Abstract T cell exhaustion is a major driver of immune checkpoint blockade (ICB) resistance and clinically effective strategies to prevent or reverse restore ICB sensitivity are lacking. CD38, an ecto-enzyme involved in NAD+ catabolism, highly expressed exhausted CD8+ cells human melanoma, yet its role remains be elucidated. Here we show that CD38+CD8+ enriched during tumor progression following unsuccessful treatment strongly associated with melanoma. Chronic TCR activation type I...

10.1158/1538-7445.genfunc25-b034 article EN Cancer Research 2025-03-11

ViTASA: New benchmark and methods for Vietnamese targeted aspect sentiment analysis for multiple textual domains

OPENALEX - Publications

Khanh Quoc Tran Quang Nhat Huynh Lê Thi Tu Oanh Kiet Van Nguyen Ngan Luu-Thuy Nguyen

10.1016/j.csl.2025.101800 article EN Computer Speech & Language 2025-03-01

Vietnamese Words Are Not Constructed from Syllables: Rethinking the Role of Word Segmentation in Natural Language Processing for Vietnamese Texts

OPENALEX - Publications

Nghia Hieu Nguyen Dat Tien Nguyen Ngan Luu-Thuy Nguyen

The definition of words is the fundamental and crucial linguistic concept. Any changes in word lead to theoretical system respective language. Traditionally, researchers Natural Language Processing (NLP) for Vietnamese texts believe are constructed from syllables. However, their works did not explicitly mention which theory they followed this assumption. Although there no guarantees, most NLP studies accept Consequently, segmentation recognized as one essential stages texts. In study, we...

10.1609/aaai.v39i22.34581 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2025-04-11

Multimodal blood based profiling reveals insights into mechanisms of immunotherapy resistance

OPENALEX - Publications

Samuel J. Wright Izabella Zamora Milan Parikh Deepika Yeramosu Marijana Ručević and 27 more

Many cancer patients treated with immune checkpoint blockade (ICB) do not have durable treatment responses. Circulating biomarkers the potential to identify primary resistance or early progression on therapy alter course and avoid unnecessary toxicity. Unbiased multimodal proteomic profiling in blood has been underexplored due previously limited scalability of multiplexing technologies cohorts lacking time-series sampling. To address this, we performed plasma >2,900 proteins...

10.1101/2025.04.20.25325955 preprint EN cc-by medRxiv (Cold Spring Harbor Laboratory) 2025-04-22

Simulation Improves Nontechnical Skills Performance of Residents During the Perioperative and Intraoperative Phases of Surgery

OPENALEX - Publications

Ngan Luu-Thuy Nguyen John O. Elliott William Watson Edward P. Dominguez

10.1016/j.jsurg.2015.03.005 article EN Journal of surgical education 2015-04-21

Two New Large Corpora for Vietnamese Aspect-based Sentiment Analysis at Sentence Level

OPENALEX - Publications

Dang Van Thin Ngan Luu-Thuy Nguyen Tri Minh Truong Lac Si Le Duy Tin Vo

Aspect-based sentiment analysis has been studied in both research and industrial communities over recent years. For the low-resource languages, standard benchmark corpora play an important role development of methods. In this article, we introduce two with largest sizes at sentence-level for tasks: Aspect Category Detection Polarity Classification Vietnamese. Our are annotated high inter-annotator agreements restaurant hotel domains. The release our would push forward language processing...

10.1145/3446678 article EN ACM Transactions on Asian and Low-Resource Language Information Processing 2021-05-26

ViHOS: Hate Speech Spans Detection for Vietnamese

OPENALEX - Publications

Phu Gia Hoang Canh Duc Luu Khanh Quoc Tran Kiet Van Nguyen Ngan Luu-Thuy Nguyen

The rise in hateful and offensive language directed at other users is one of the adverse side effects increased use social networking platforms. This could make it difficult for human moderators to review tagged comments filtered by classification systems. To help address this issue, we present ViHOS (Vietnamese Hate Offensive Spans) dataset, first human-annotated corpus containing 26k spans on 11k comments. We also provide definitions Vietnamese as well detailed annotation guidelines....

10.18653/v1/2023.eacl-main.47 article EN cc-by 2023-01-01

Coming Soon ...