NFDI4DS | UHH-SEMS - Publication Details

A chatbot for mental health support: exploring the impact of Emohaa on reducing mental distress in China

OPENALEX - Publications

Sahand Sabour Wen Zhang Xiyao Xiao Yuwei Zhang Yinhe Zheng and 3 more

The growing demand for mental health support has highlighted the importance of conversational agents as human supporters worldwide and in China. These could increase availability reduce relative costs support. provided can be divided into two main types: cognitive emotional. Existing work on this topic mainly focuses constructing that adopt Cognitive Behavioral Therapy (CBT) principles. Such operate based pre-defined templates exercises to provide However, research emotional using such is...

10.3389/fdgth.2023.1133987 article EN cc-by Frontiers in Digital Health 2023-05-04

A Pre-Training Based Personalized Dialogue Generation Model with Persona-Sparse Data

OPENALEX - Publications

Yinhe Zheng Rongsheng Zhang Minlie Huang Xiaoxi Mao

Endowing dialogue systems with personas is essential to deliver more human-like conversations. However, this problem still far from well explored due the difficulties of both embodying personalities in natural languages and persona sparsity issue observed most corpora. This paper proposes a pre-training based personalized model that can generate coherent responses using persona-sparse data. In method, pre-trained language used initialize an encoder decoder, personal attribute embeddings are...

10.1609/aaai.v34i05.6518 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

Out-of-Domain Detection for Natural Language Understanding in Dialog Systems

OPENALEX - Publications

Yinhe Zheng Guanyi Chen Minlie Huang

Natural Language Understanding (NLU) is a vital component of dialogue systems, and its ability to detect Out-of-Domain (OOD) inputs critical in practical applications, since the acceptance OOD input that unsupported by current system may lead catastrophic failure. However, most existing detection methods rely heavily on manually labeled samples cannot take full advantage unlabeled data. This limits feasibility these models applications. In this paper, we propose novel model generate...

10.1109/taslp.2020.2983593 article EN IEEE/ACM Transactions on Audio Speech and Language Processing 2020-01-01

Personalized Dialogue Generation with Diversified Traits

OPENALEX - Publications

Yinhe Zheng Guanyi Chen Minlie Huang Song Liu Xuan Zhu

Endowing a dialogue system with particular personality traits is essential to deliver more human-like conversations. However, due the challenge of embodying via language expression and lack large-scale persona-labeled data, this research problem still far from well-studied. In paper, we investigate incorporating explicit in generation personalized dialogues. To end, firstly, construct PersonalDialog, multi-turn dataset containing various large number speakers. The consists 20.83M sessions...

10.48550/arxiv.1901.09672 preprint EN other-oa arXiv (Cornell University) 2019-01-01

GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-supervised Learning and Explicit Policy Injection

OPENALEX - Publications

Wanwei He Yinpei Dai Yinhe Zheng Yuchuan Wu Zheng Cao and 7 more

Pre-trained models have proved to be powerful in enhancing task-oriented dialog systems. However, current pre-training methods mainly focus on understanding and generation tasks while neglecting the exploitation of policy. In this paper, we propose GALAXY, a novel pre-trained model that explicitly learns policy from limited labeled dialogs large-scale unlabeled corpora via semi-supervised learning. Specifically, introduce act prediction task for optimization during employ consistency...

10.1609/aaai.v36i10.21320 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2022-06-28

Estimation of the REV size for blockiness of fractured rock masses

OPENALEX - Publications

Lu Xia Yinhe Zheng Qingchun Yu

10.1016/j.compgeo.2016.02.016 article EN Computers and Geotechnics 2016-03-05

GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-Supervised Learning and Explicit Policy Injection

OPENALEX - Publications

Wanwei He Yinpei Dai Yinhe Zheng Yuchuan Wu Zheng Cao and 7 more

Pre-trained models have proved to be powerful in enhancing task-oriented dialog systems. However, current pre-training methods mainly focus on understanding and generation tasks while neglecting the exploitation of policy. In this paper, we propose GALAXY, a novel pre-trained model that explicitly learns policy from limited labeled dialogs large-scale unlabeled corpora via semi-supervised learning. Specifically, introduce act prediction task for optimization during employ consistency...

10.48550/arxiv.2111.14592 preprint EN cc-by arXiv (Cornell University) 2021-01-01

EVA: An Open-Domain Chinese Dialogue System with Large-Scale Generative Pre-Training

OPENALEX - Publications

Hao Zhou Pei Ke Zheng Zhang Yuxian Gu Yinhe Zheng and 9 more

Although pre-trained language models have remarkably enhanced the generation ability of dialogue systems, open-domain Chinese systems are still limited by data and model size compared with English ones. In this paper, we propose EVA, a system that contains largest 2.8B parameters. To build model, collect dataset named WDC-Dialogue from various public social media. This 1.4B context-response pairs is used as pre-training corpus EVA. Extensive experiments on automatic human evaluation show EVA...

10.48550/arxiv.2108.01547 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Improving Meta-learning for Low-resource Text Classification and Generation via Memory Imitation

OPENALEX - Publications

Yingxiu Zhao Zhiliang Tian Huaxiu Yao Yinhe Zheng Dongkyu Lee and 3 more

Yingxiu Zhao, Zhiliang Tian, Huaxiu Yao, Yinhe Zheng, Dongkyu Lee, Yiping Song, Jian Sun, Nevin Zhang. Proceedings of the 60th Annual Meeting Association for Computational Linguistics (Volume 1: Long Papers). 2022.

10.18653/v1/2022.acl-long.44 article EN cc-by Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) 2022-01-01

A method for identifying three-dimensional rock blocks formed by curved fractures

OPENALEX - Publications

Yinhe Zheng Lu Xia Qingchun Yu

10.1016/j.compgeo.2014.11.005 article EN Computers and Geotechnics 2014-12-10

Dialogue Distillation: Open-Domain Dialogue Augmentation Using Unpaired Data

OPENALEX - Publications

Rongsheng Zhang Yinhe Zheng Jianzhi Shao Xiaoxi Mao Yadong Xi and 1 more

Recent advances in open-domain dialogue systems rely on the success of neural models that are trained large-scale data. However, collecting data is usually time-consuming and labor-intensive. To address this dilemma, we propose a novel augmentation method for training by utilizing unpaired Specifically, data-level distillation process first proposed to construct augmented dialogues where both post response retrieved from A ranking module employed filter out low-quality dialogues. Further,...

10.18653/v1/2020.emnlp-main.277 article EN cc-by 2020-01-01

Diversifying Dialog Generation via Adaptive Label Smoothing

OPENALEX - Publications

Yida Wang Yinhe Zheng Yong Jiang Minlie Huang

Yida Wang, Yinhe Zheng, Yong Jiang, Minlie Huang. Proceedings of the 59th Annual Meeting Association for Computational Linguistics and 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 2021.

10.18653/v1/2021.acl-long.272 article EN cc-by 2021-01-01

Empathetic Response Generation via Emotion Cause Transition Graph

OPENALEX - Publications

Yushan Qian Bo Wang Ting-En Lin Yinhe Zheng Ying Zhu and 4 more

Empathetic dialogue is a human-like behavior that requires the perception of both affective factors (e.g., emotion status) and cognitive cause emotion). Besides concerning status in early work, latest approaches study causes empathetic dialogue. These focus on understanding duplicating context to show empathy for speaker. However, instead only repeating contextual causes, real empathic response often demonstrate logical emotion-centered transition from those responses. In this we propose an...

10.1109/icassp49357.2023.10095652 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023-05-05

Analysis of removability and stability of rock blocks by considering the rock bridge effect

OPENALEX - Publications

Yinhe Zheng Lu Xia Qingchun Yu

In traditional block theory, the removability and stability of rock blocks are analyzed independently; that is, a removable is in detail, nonremovable regarded as stable. However, practical situations, may pose more danger than blocks. This paper presents unified method for analyzing this method, cracking bridges considered not assumed to be First, possible identified by extending finite-sized fractures comparing boundary surfaces resulting with those original Then, sliding direction...

10.1139/cgj-2014-0503 article EN Canadian Geotechnical Journal 2015-08-21

Blockiness level of rock mass around underground powerhouse of Three Gorges Project

OPENALEX - Publications

Lu Xia Maohua Li Frank Chen Yinhe Zheng Qingchun Yu

10.1016/j.tust.2015.02.002 article EN Tunnelling and Underground Space Technology 2015-03-07

Stylized Dialogue Response Generation Using Stylized Unpaired Texts

OPENALEX - Publications

Yinhe Zheng Zikai Chen Rongsheng Zhang Shilei Huang Xiaoxi Mao and 1 more

Generating stylized responses is essential to build intelligent and engaging dialogue systems. However, this task far from well-explored due the difficulties of rendering a particular style in coherent responses, especially when target embedded only unpaired texts that cannot be directly used train model. This paper proposes generation method can capture stylistic features texts. Specifically, our produce are both given context conform style. In study, an inverse model first introduced...

10.1609/aaai.v35i16.17711 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2021-05-18

Identifying rock blocks based on exact arithmetic

OPENALEX - Publications

Yinhe Zheng Lu Xia Qingchun Yu

10.1016/j.ijrmms.2016.03.020 article EN International Journal of Rock Mechanics and Mining Sciences 2016-04-14

A Survey on Out-of-Distribution Detection in NLP

OPENALEX - Publications

Hao Lang Yinhe Zheng Yixuan Li Jian Sun Fei Huang and 1 more

Out-of-distribution (OOD) detection is essential for the reliable and safe deployment of machine learning systems in real world. Great progress has been made over past years. This paper presents first review recent advances OOD with a particular focus on natural language processing approaches. First, we provide formal definition discuss several related fields. We then categorize algorithms into three classes according to data they used: (1) available, (2) unavailable + in-distribution (ID)...

10.48550/arxiv.2305.03236 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Transferable Persona-Grounded Dialogues via Grounded Minimal Edits

OPENALEX - Publications

Chen Wu Yinhe Zheng Xiaoxi Mao Minlie Huang

Grounded dialogue models generate responses that are grounded on certain concepts. Limited by the distribution of data, trained such data face transferability challenges in terms and type To address challenges, we propose minimal editing framework, which minimally edits existing to be given concept. Focusing personas, Minimal Editor (GME), learns edit disentangling recombining persona-related persona-agnostic parts response. evaluate persona-grounded editing, present PersonaMi-nEdit dataset,...

10.18653/v1/2021.emnlp-main.183 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2021-01-01

Prompt Conditioned VAE: Enhancing Generative Replay for Lifelong Learning in Task-Oriented Dialogue

OPENALEX - Publications

Yingxiu Zhao Yinhe Zheng Zhiliang Tian Chang Gao Jian Sun and 1 more

Lifelong learning (LL) is vital for advanced task-oriented dialogue (ToD) systems. To address the catastrophic forgetting issue of LL, generative replay methods are widely employed to consolidate past knowledge with generated pseudo samples. However, most existing use only a single task-specific token control their models. This scheme usually not strong enough constrain model due insufficient information involved. In this paper, we propose novel method, prompt conditioned VAE lifelong...

10.18653/v1/2022.emnlp-main.766 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2022-01-01

Estimating Soft Labels for Out-of-Domain Intent Detection

OPENALEX - Publications

Hao Lang Yinhe Zheng Jian Sun Fei Huang Luo Si and 1 more

Out-of-Domain (OOD) intent detection is important for practical dialog systems. To alleviate the issue of lacking OOD training samples, some works propose synthesizing pseudo samples and directly assigning one-hot labels to these samples. However, introduce noises process because "hard" may coincide with In-Domain (IND) intents. In this paper, we an adaptive soft labeling (ASoul) method that can estimate when detectors. Semantic connections between IND intents are captured using embedding...

10.18653/v1/2022.emnlp-main.18 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2022-01-01

MMChat: Multi-Modal Chat Dataset on Social Media

OPENALEX - Publications

Yinhe Zheng Guanyi Chen Xin Liu Ke Lin

Incorporating multi-modal contexts in conversation is important for developing more engaging dialogue systems. In this work, we explore direction by introducing MMChat: a large-scale Chinese corpus (32.4M raw dialogues and 120.84K filtered dialogues). Unlike previous corpora that are crowd-sourced or collected from fictitious movies, MMChat contains image-grounded real conversations on social media, which the sparsity issue observed. Specifically, image-initiated common communications may...

10.48550/arxiv.2108.07154 preprint EN cc-by arXiv (Cornell University) 2021-01-01

Unsupervised Domain Adaptation with Adapter

OPENALEX - Publications

Rongsheng Zhang Yinhe Zheng Xiaoxi Mao Minlie Huang

Unsupervised domain adaptation (UDA) with pre-trained language models (PrLM) has achieved promising results since these embed generic knowledge learned from various domains. However, fine-tuning all the parameters of PrLM on a small domain-specific corpus distort knowledge, and it is also expensive to deployment whole fine-tuned for each domain. This paper explores an adapter-based approach unsupervised adaptation. Specifically, several trainable adapter modules are inserted in PrLM,...

10.48550/arxiv.2111.00667 preprint EN other-oa arXiv (Cornell University) 2021-01-01

DS-UI: Dual-Supervised Mixture of Gaussian Mixture Models for Uncertainty Inference in Image Recognition

OPENALEX - Publications

Jiyang Xie Zhanyu Ma Jing‐Hao Xue Guoqiang Zhang Jian Sun and 2 more

This paper proposes a dual-supervised uncertainty inference (DS-UI) framework for improving Bayesian estimation-based UI in DNN-based image recognition. In the DS-UI, we combine classifier of DNN, i.e., last fully-connected (FC) layer, with mixture Gaussian models (MoGMM) to obtain an MoGMM-FC layer. Unlike existing methods DNNs, which only calculate means or modes DNN outputs' distributions, proposed layer acts as probabilistic interpreter features that are inputs directly probabilities...

10.1109/tip.2021.3123555 article EN IEEE Transactions on Image Processing 2021-01-01