Nghia Luan Pham

ORCID: 0000-0003-3922-2607
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Natural Language Processing Techniques
  • Topic Modeling
  • Handwritten Text Recognition Techniques
  • Multimodal Machine Learning Applications

Hai Phong University
2020-2023

The translation quality of machine systems depends on the parallel corpus used for training, in particular quantity and corpus.However, building a high-quality large-scale is complex expensive, particularly specific domain corpus.Therefore, data augmentation techniques are widely translation.The input back-translation method monolingual text, which available from many sources, therefore this can be easily effectively implemented to generate synthetic data.In practice, texts collected...

10.1109/access.2023.3252898 article EN cc-by-nc-nd IEEE Access 2023-01-01

The translation quality of machine systems depends on the parallel corpus used for training, including quantity and corpus. However, building a highquality large-scale is complex expensive, particularly specific domain Therefore, data augmentation techniques are widely in translation. back-translation method simple effective when input monolingual text. In practice, texts can be collected from different sources, which sources websites often have errors grammar spelling, sentence mismatch,...

10.2139/ssrn.4216607 article EN SSRN Electronic Journal 2022-01-01
Coming Soon ...