NFDI4DS | UHH-SEMS - Publication Details

The Microsoft Toolkit of Multi-Task Deep Neural Networks for Natural Language Understanding

FOS: Computer and information sciences Computer Science - Computation and Language 0202 electrical engineering, electronic engineering, information engineering 02 engineering and technology Computation and Language (cs.CL)

DOI: 10.18653/v1/2020.acl-demos.16 Publication Date: 2020-07-29T14:14:43Z

Abstract Supplemental Material References Cited by

AUTHORS (11)

Xiaodong Liu

Yu Wang

Jianshu Ji

Hao Cheng

Xueyun Zhu

Emmanuel Awa

Pengcheng He

Weizhu Chen

Hoifung Poon

Guihong Cao

Jianfeng Gao

ABSTRACT

9 pages, 3 figures and 3 tables<br/>We present MT-DNN, an open-source natural language understanding (NLU) toolkit that makes it easy for researchers and developers to train customized deep learning models. Built upon PyTorch and Transformers, MT-DNN is designed to facilitate rapid customization for a broad spectrum of NLU tasks, using a variety of objectives (classification, regression, structured prediction) and text encoders (e.g., RNNs, BERT, RoBERTa, UniLM). A unique feature of MT-DNN is its built-in support for robust and transferable learning using the adversarial multi-task learning paradigm. To enable efficient production deployment, MT-DNN supports multi-task knowledge distillation, which can substantially compress a deep neural model without significant performance drop. We demonstrate the effectiveness of MT-DNN on a wide range of NLU applications across general and biomedical domains. The software and pre-trained models will be publicly available at https://github.com/namisan/mt-dnn.<br/>

SUPPLEMENTAL MATERIAL

Coming soon ....

REFERENCES (0)

CITATIONS (6)

EXTERNAL LINKS

CROSSREF - Publications OPENAIRE - Products

PlumX Metrics

The Microsoft Toolkit of Multi-Task Deep Neural Networks for Natural Language Understanding

RECOMMENDATIONS

FAIR ASSESSMENT

Coming soon ....

JUPYTER LAB

Coming soon ....