- Topic Modeling
- Natural Language Processing Techniques
- Sentiment Analysis and Opinion Mining
- Multimodal Machine Learning Applications
- Text Readability and Simplification
- Advanced Text Analysis Techniques
- Text and Document Classification Technologies
- Hate Speech and Cyberbullying Detection
- Spam and Phishing Detection
- Handwritten Text Recognition Techniques
- Surgical Simulation and Training
- Biomedical Text Mining and Ontologies
- Advanced Image and Video Retrieval Techniques
- Cancer Immunotherapy and Biomarkers
- Domain Adaptation and Few-Shot Learning
- Bioinformatics and Genomic Networks
- Internet Traffic Analysis and Secure E-voting
- Online Learning and Analytics
- Simulation-Based Education in Healthcare
- Algorithms and Data Compression
- Speech Recognition and Synthesis
- Misinformation and Its Impacts
- Smart Agriculture and AI
- Patient Safety and Medication Errors
- Human Pose and Action Recognition
Oncology Institute of Hope and Innovation
2024
Vietnam National University Ho Chi Minh City
2015-2024
Teikoku Pharma (United States)
2023-2024
King Abdullah University of Science and Technology
2024
Ho Chi Minh City University of Technology
2019-2023
University Of Information Technology
2016-2022
OhioHealth
2015-2020
Riverside Methodist Hospital
2015-2020
Bridgeport Hospital
2017
Yale New Haven Health System
2017
The Genia task, when it was introduced in 2009, the first community-wide effort to address a fine-grained, structural information extraction from biomedical literature. Arranged for second time as one of main tasks BioNLP Shared Task 2011, aimed measure progress community since and evaluate generalization technology full text papers. Protein Coreference task arranged supporting tasks, motivated lessons 2009 that abundance coreference structures natural language hinders further improvement...
Students' feedback is a vital resource for the interdisciplinary research combining of two fields: sentiment analysis and education. To strengthen Vietnamese language which low-resource language, we build Feedback Corpus (UIT-VSFC), free high-quality corpus on different tasks: sentiment-based topic-based classifications. In this paper, present methods building annotation guidelines ensure accuracy consistency corpus. The consists over 16,000 sentences are human-annotated tasks. assess...
Over 97 million inhabitants speak Vietnamese as the native language in world. However, there are few research studies on machine reading comprehension (MRC) Vietnamese, task of understanding a document or text, and answering questions related to it. Due lack benchmark datasets for we present Question Answering Dataset (UIT-ViQuAD), new dataset low-resource evaluate MRC models. This comprises over 23,000 human-generated question-answer pairs based 5,109 passages 174 articles from Wikipedia....
Background Immune-related adverse events (irAEs) are major barriers of clinical management and further development immune checkpoint inhibitors (ICIs) for cancer therapy. Therefore, biomarkers associated with the onset severe irAEs needed. In this study, we aimed to identify features detectable in peripheral blood that required intervention. Methods We used a 43-marker mass cytometry panel characterize mononuclear cells from 28 unique patients melanoma across 29 lines ICI therapy before...
Student's feedback is an important source of collecting students' opinions to improve quality training activities. Implementing sentiment analysis into student data, we can determine sentiments polarities which express all problems in the institution since changes necessary will be applied teaching and learning. This study focused on machine learning natural language processing techniques (Naive Bayes, Maximum Entropy, Long Short-Term Memory, Bi-Directional Memory) Vietnamese Students'...
Determining the job is suitable for a student or person looking work based on their descriptions such as knowledge and skills that are difficult, well how employers must find ways to choose candidates match they require. In this paper, we focus studying prediction using different deep neural network models including TextCNN, Bi-GRU-LSTM-CNN, Bi-GRU-CNN with various pre-trained word embeddings IT dataset. addition, proposed simple effective ensemble model combining models. Our experimental...
The Long Short-Term Memory (LSTM) and Dependency Tree-LSTM have shown the state-of-the-art results for sentiment analysis task English language. Despite many studies of LSTM approach, there are no approach Vietnamese analysis. In this paper, we conducted experiments with LSTM, Tree-LSTM, our proposed models on Students' Feedback Corpus. According to experimental results, were not better than model. However, when combining final hidden state vectors a Support Vector Machine classifier,...
In recent years, Hate Speech Detection has become one of the interesting fields in natural language processing or computational linguistics. this paper, we present description our system to solve problem at VLSP shared task 2019: on Social Networks with corpus which contains 20,345 human-labeled comments/posts for training and 5,086 public-testing. We implement a deep learning method based Bi-GRU-LSTM-CNN classifier into task. Our result is 70.576% F1-score, ranking 5th performance public-test set.
Although Vietnamese is the 17 <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">th</sup> most popular native-speaker language in world, there are not many research studies on machine reading comprehension (MRC), task of understanding a text and answering questions about it. One reasons because lack high-quality benchmark datasets for this task. In work, we construct dataset which consists 2,783 pairs multiple-choice answers based 417 texts...
Machine reading comprehension is a natural language understanding task where the computing system required to read text and then find answer specific question posed by human. Large-scale high-quality corpora are necessary for evaluating machine models. Furthermore, (MRC) health sector has potential practical applications; nevertheless, MRC research in this domain currently scarce. This article presents UIT-ViNewsQA, new corpus Vietnamese evaluate models healthcare textual domain. The...
Abstract T cell exhaustion is a major driver of immune checkpoint blockade (ICB) resistance and clinically effective strategies to prevent or reverse restore ICB sensitivity are lacking. CD38, an ecto-enzyme involved in NAD+ catabolism, highly expressed exhausted CD8+ cells human melanoma, yet its role remains be elucidated. Here we show that CD38+CD8+ enriched during tumor progression following unsuccessful treatment strongly associated with melanoma. Chronic TCR activation type I...
The definition of words is the fundamental and crucial linguistic concept. Any changes in word lead to theoretical system respective language. Traditionally, researchers Natural Language Processing (NLP) for Vietnamese texts believe are constructed from syllables. However, their works did not explicitly mention which theory they followed this assumption. Although there no guarantees, most NLP studies accept Consequently, segmentation recognized as one essential stages texts. In study, we...
Many cancer patients treated with immune checkpoint blockade (ICB) do not have durable treatment responses. Circulating biomarkers the potential to identify primary resistance or early progression on therapy alter course and avoid unnecessary toxicity. Unbiased multimodal proteomic profiling in blood has been underexplored due previously limited scalability of multiplexing technologies cohorts lacking time-series sampling. To address this, we performed plasma >2,900 proteins...
Aspect-based sentiment analysis has been studied in both research and industrial communities over recent years. For the low-resource languages, standard benchmark corpora play an important role development of methods. In this article, we introduce two with largest sizes at sentence-level for tasks: Aspect Category Detection Polarity Classification Vietnamese. Our are annotated high inter-annotator agreements restaurant hotel domains. The release our would push forward language processing...
The rise in hateful and offensive language directed at other users is one of the adverse side effects increased use social networking platforms. This could make it difficult for human moderators to review tagged comments filtered by classification systems. To help address this issue, we present ViHOS (Vietnamese Hate Offensive Spans) dataset, first human-annotated corpus containing 26k spans on 11k comments. We also provide definitions Vietnamese as well detailed annotation guidelines....