- Biomedical Text Mining and Ontologies
- Natural Language Processing Techniques
- Topic Modeling
- Galectins and Cancer Biology
- Glycosylation and Glycoproteins Research
- Carbohydrate Chemistry and Synthesis
Case Western Reserve University
2022-2025
Abstract Mucin type O-glycan core elongation is typically performed by the C1GALT1, B3GNT6, and ST6GalNAc-I/-II O-glycosyltransferases. These enzymes target Tn antigen (GalNAc-O-Thr/Ser) dictating fate of elongation, playing important roles in health disease. Changes transferase expression glycan structure are commonly associated with diseases such as cancer, Tn-syndrome, ulcerative colitis. Despite their significance, substrate specificities biological remain elusive. Here, we examine...
Long Phan, Tai Dang, Hieu Tran, Trieu H. Trinh, Vy Lam D. Chau, Minh-Thang Luong. Proceedings of the 17th Conference European Chapter Association for Computational Linguistics. 2023.
Abstract Biomedical data and benchmarks are highly valuable yet very limited in low-resource languages other than English such as Vietnamese. In this paper, we make use of a state-of-theart translation model English-Vietnamese to translate produce both pretrained well supervised the biomedical domains. Thanks large-scale translation, introduce ViPubmedT5, Encoder-Decoder Transformer trained on 20 million translated abstracts from high-quality public PubMed corpus. ViPubMedT5 demonstrates...
Biomedical data and benchmarks are highly valuable yet very limited in low-resource languages other than English such as Vietnamese. In this paper, we make use of a state-of-the-art translation model English-Vietnamese to translate produce both pretrained well supervised the biomedical domains. Thanks large-scale translation, introduce ViPubmedT5, Encoder-Decoder Transformer trained on 20 million translated abstracts from high-quality public PubMed corpus. ViPubMedT5 demonstrates results two...