NFDI4DS | UHH-SEMS - Publication Details

Assessing elementary students’ computational thinking in everyday reasoning and robotics programming

OPENALEX - Publications

Guanhua Chen Ji Shen Lauren Barth-Cohen Shiyan Jiang Xiaoting Huang and 1 more

10.1016/j.compedu.2017.03.001 article EN publisher-specific-oa Computers & Education 2017-03-03

Zero-Shot Cross-Lingual Transfer of Neural Machine Translation with Multilingual Pretrained Encoders

OPENALEX - Publications

Guanhua Chen Shuming Ma Yun Chen Li Dong Dongdong Zhang and 3 more

Previous work mainly focuses on improving cross-lingual transfer for NLU tasks with a multilingual pretrained encoder (MPE), or the performance supervised machine translation BERT. However, it is under-explored that whether MPE can help to facilitate transferability of NMT model. In this paper, we focus zero-shot task in NMT. task, model trained parallel dataset only one language pair and an off-the-shelf MPE, then directly tested pairs. We propose SixT, simple yet effective task. SixT...

10.18653/v1/2021.emnlp-main.2 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2021-01-01

mCLIP: Multilingual CLIP via Cross-lingual Transfer

OPENALEX - Publications

Guanhua Chen Lu Hou Yun Chen Wenliang Dai Lifeng Shang and 4 more

Guanhua Chen, Lu Hou, Yun Wenliang Dai, Lifeng Shang, Xin Jiang, Qun Liu, Jia Pan, Wenping Wang. Proceedings of the 61st Annual Meeting Association for Computational Linguistics (Volume 1: Long Papers). 2023.

10.18653/v1/2023.acl-long.728 article EN cc-by 2023-01-01

Multilingual Sentence Transformer as A Multilingual Word Aligner

OPENALEX - Publications

Weikang Wang Guanhua Chen Hanqing Wang Yue Han Yun Chen

Multilingual pretrained language models (mPLMs) have shown their effectiveness in multilingual word alignment induction. However, these methods usually start from mBERT or XLM-R. In this paper, we investigate whether sentence Transformer LaBSE is a strong aligner. This idea non-trivial as trained to learn language-agnostic sentence-level embeddings, while the extraction task requires more fine-grained word-level embeddings be language-agnostic. We demonstrate that vanilla outperforms other...

10.18653/v1/2022.findings-emnlp.215 article EN cc-by 2022-01-01

Multi-Level Curriculum Learning for Multi-Turn Dialogue Generation

OPENALEX - Publications

Guanhua Chen Runzhe Zhan Derek F. Wong Lidia S. Chao

Since deep learning is the dominant paradigm in multi-turn dialogue generation task, large-scale training data key factor affecting model performance. To make full use of data, existing work directly applied curriculum to a “easy-to-hard” way. But design current methodology does not consider dialogue-specific features. close this gap, we propose Multi-Level Curriculum Learning (MLCL) method for by considering word-level linguistic feature and utterance-level semantic relation dialogue. The...

10.1109/taslp.2023.3322583 article EN IEEE/ACM Transactions on Audio Speech and Language Processing 2023-01-01

A compensation mechanism in GIS web service composition

OPENALEX - Publications

Sun Yanfeng Ma Xiujun Kunqing Xie Guanhua Chen Liu Chen and 2 more

With the evolution of GIS from stand-alone systems with geo-data tightly coupled to an increasingly distributed model based on independently-provided, interoperable Web service, much more research has been focused service composition. However, little works concern control mechanisms improving availability and reliability in Considering that services are essence loosely-coupled hosted by different providers. As a result, any update might affect critically overall composition consistency...

10.1109/igarss.2005.1525266 article EN 2005-11-15

Dynamic curriculum learning for conversation response selection

OPENALEX - Publications

Guanhua Chen Runzhe Zhan Derek F. Wong Lidia S. Chao

10.1016/j.knosys.2024.111687 article EN Knowledge-Based Systems 2024-03-28

TADIS: Steering Models for Deep-Thinking about Demonstration Examples

OPENALEX - Publications

Tianci Xue Ziqi Wang Yixia Li Yun Chen Guanhua Chen

Instruction tuning has been demonstrated that could significantly improve the zero-shot generalization capability to unseen tasks by an apparent margin. By incorporating additional context (e.g., task definition, examples) during fine-tuning process, Large Language Models (LLMs) achieved much higher performance than before. However, recent work reported delusive examples can achieve almost same as correct examples, indicating input-label correspondence is less important previously thought....

10.48550/arxiv.2310.00901 preprint EN other-oa arXiv (Cornell University) 2023-01-01

StyleBART: Decorate Pretrained Model with Style Adapters for Unsupervised Stylistic Headline Generation

OPENALEX - Publications

Hanqing Wang Yajing Luo Boya Xiong Guanhua Chen Yun Chen

Stylistic headline generation is the task to generate a that not only summarizes content of an article, but also reflects desired style attracts users. As style-specific article-headline pairs are scarce, previous researches focus on unsupervised approaches with standard dataset and mono-style corpora. In this work, we follow line propose StyleBART, approach for stylistic generation. Our method decorates pretrained BART model adapters responsible different styles allows headlines diverse by...

10.18653/v1/2023.findings-emnlp.697 article EN cc-by 2023-01-01

Towards Making the Most of Multilingual Pretraining for Zero-Shot Neural Machine Translation

OPENALEX - Publications

Guanhua Chen Shuming Ma Yun Chen Dongdong Zhang Jia Pan and 2 more

This paper demonstrates that multilingual pretraining and fine-tuning are both critical for facilitating cross-lingual transfer in zero-shot translation, where the neural machine translation (NMT) model is tested on source languages unseen during supervised training. Following this idea, we present SixT+, a strong many-to-English NMT supports 100 but trained with parallel dataset only six languages. SixT+ initializes decoder embedding full encoder XLM-R large then trains layers simple...

10.48550/arxiv.2110.08547 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Zero-shot Cross-lingual Transfer of Neural Machine Translation with Multilingual Pretrained Encoders

OPENALEX - Publications

Guanhua Chen Shuming Ma Yun Chen Li Dong Dongdong Zhang and 3 more

Previous work mainly focuses on improving cross-lingual transfer for NLU tasks with a multilingual pretrained encoder (MPE), or the performance supervised machine translation BERT. However, it is under-explored that whether MPE can help to facilitate transferability of NMT model. In this paper, we focus zero-shot task in NMT. task, model trained parallel dataset only one language pair and an off-the-shelf MPE, then directly tested pairs. We propose SixT, simple yet effective task. SixT...

10.48550/arxiv.2104.08757 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Multilingual Sentence Transformer as A Multilingual Word Aligner

OPENALEX - Publications

Weikang Wang Guanhua Chen Hanqing Wang Yue Han Yun Chen

Multilingual pretrained language models (mPLMs) have shown their effectiveness in multilingual word alignment induction. However, these methods usually start from mBERT or XLM-R. In this paper, we investigate whether sentence Transformer LaBSE is a strong aligner. This idea non-trivial as trained to learn language-agnostic sentence-level embeddings, while the extraction task requires more fine-grained word-level embeddings be language-agnostic. We demonstrate that vanilla outperforms other...

10.48550/arxiv.2301.12140 preprint EN other-oa arXiv (Cornell University) 2023-01-01

StyleBART: Decorate Pretrained Model with Style Adapters for Unsupervised Stylistic Headline Generation

OPENALEX - Publications

Hanqing Wang Yajing Luo Boya Xiong Guanhua Chen Yun Chen

Stylistic headline generation is the task to generate a that not only summarizes content of an article, but also reflects desired style attracts users. As style-specific article-headline pairs are scarce, previous researches focus on unsupervised approaches with standard dataset and mono-style corpora. In this work, we follow line propose StyleBART, approach for stylistic generation. Our method decorates pretrained BART model adapters responsible different styles allows headlines diverse by...

10.48550/arxiv.2310.17743 preprint EN other-oa arXiv (Cornell University) 2023-01-01