- Natural Language Processing Techniques
- Topic Modeling
- Handwritten Text Recognition Techniques
- Multimodal Machine Learning Applications
Hai Phong University
2020-2023
The translation quality of machine systems depends on the parallel corpus used for training, in particular quantity and corpus.However, building a high-quality large-scale is complex expensive, particularly specific domain corpus.Therefore, data augmentation techniques are widely translation.The input back-translation method monolingual text, which available from many sources, therefore this can be easily effectively implemented to generate synthetic data.In practice, texts collected...
The translation quality of machine systems depends on the parallel corpus used for training, including quantity and corpus. However, building a highquality large-scale is complex expensive, particularly specific domain Therefore, data augmentation techniques are widely in translation. back-translation method simple effective when input monolingual text. In practice, texts can be collected from different sources, which sources websites often have errors grammar spelling, sentence mismatch,...