NFDI4DS | UHH-SEMS - Publication Details

Longyue Wang

ORCID: 0000-0002-9062-6183

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5088191810

Research Areas

Topic Modeling
Natural Language Processing Techniques
Multimodal Machine Learning Applications
Text Readability and Simplification
Speech and dialogue systems
Computational Drug Discovery Methods
Speech Recognition and Synthesis
Video Analysis and Summarization
Software Engineering Research
Web Data Mining and Analysis
Text and Document Classification Technologies
Biomedical Text Mining and Ontologies
Human Motion and Animation
Machine Learning in Materials Science
Bioinformatics and Genomic Networks
Domain Adaptation and Few-Shot Learning
Multi-Agent Systems and Negotiation
Semantic Web and Ontologies
Image Retrieval and Classification Techniques
Handwritten Text Recognition Techniques
Advanced Graph Neural Networks
Human Pose and Action Recognition
Artificial Intelligence in Games
Science Education and Pedagogy
Gastric Cancer Management and Outcomes

Tencent (China)
2018-2025

Alibaba Group (China)
2025

Zhejiang University
2024

Dublin City University
2015-2024

Beijing Institute of Technology
2024

Xiangtan University
2024

University of Illinois Chicago
2024

Hunan University
2024

Macao Polytechnic University
2024

University of Hong Kong
2020-2023

Exploiting Cross-Sentence Context for Neural Machine Translation

OPENALEX - Publications

Longyue Wang Zhaopeng Tu Andy Way Qun Liu

In translation, considering the document as a whole can help to resolve ambiguities and inconsistencies. this paper, we propose cross-sentence context-aware approach investigate influence of historical contextual information on performance neural machine translation (NMT). First, history is summarized in hierarchical way. We then integrate representation into NMT two strategies: 1) warm-start encoder decoder states, 2) an auxiliary context source for updating states. Experimental results...

10.18653/v1/d17-1301 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2017-01-01

Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models

OPENALEX - Publications

Yue Zhang Yafu Li Leyang Cui Deng Cai Lemao Liu and 10 more

While large language models (LLMs) have demonstrated remarkable capabilities across a range of downstream tasks, significant concern revolves around their propensity to exhibit hallucinations: LLMs occasionally generate content that diverges from the user input, contradicts previously generated context, or misaligns with established world knowledge. This phenomenon poses substantial challenge reliability in real-world scenarios. In this paper, we survey recent efforts on detection,...

10.48550/arxiv.2309.01219 preprint EN cc-by-nc-sa arXiv (Cornell University) 2023-01-01

Document-Level Machine Translation with Large Language Models

OPENALEX - Publications

Longyue Wang Chenyang Lyu Tianbo Ji Zhirui Zhang Dian Yu and 2 more

Large language models (LLMs) such as ChatGPT can produce coherent, cohesive, relevant, and fluent answers for various natural processing (NLP) tasks. Taking document-level machine translation (MT) a testbed, this paper provides an in-depth evaluation of LLMs’ ability on discourse modeling. The study focuses three aspects: 1) Effects Context-Aware Prompts, where we investigate the impact different prompts quality phenomena; 2) Comparison Translation Models, compare performance with commercial...

10.18653/v1/2023.emnlp-main.1036 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2023-01-01

Convolutional Self-Attention Networks

OPENALEX - Publications

Baosong Yang Longyue Wang Derek F. Wong Lidia S. Chao Zhaopeng Tu

Baosong Yang, Longyue Wang, Derek F. Wong, Lidia S. Chao, Zhaopeng Tu. Proceedings of the 2019 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). 2019.

10.18653/v1/n19-1407 preprint EN 2019-01-01

Modeling Recurrence for Transformer

OPENALEX - Publications

Jie Hao Xing Wang Baosong Yang Longyue Wang Jinfeng Zhang and 1 more

Jie Hao, Xing Wang, Baosong Yang, Longyue Jinfeng Zhang, Zhaopeng Tu. Proceedings of the 2019 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). 2019.

10.18653/v1/n19-1122 preprint EN 2019-01-01

Asymmetric Sampling Disturbance-Based Universal Impedance Measurement Method for Converters

OPENALEX - Publications

Quansen Rong Pengfei Hu Longyue Wang Yujing Li Yanxue Yu and 2 more

10.1109/tpel.2024.3451403 article EN IEEE Transactions on Power Electronics 2024-08-29

Comprehensive evaluation of molecule property prediction with ChatGPT

OPENALEX - Publications

Xibao Cai Houtim Lai Alex X. Wang Longyue Wang Wei Liu and 4 more

10.1016/j.ymeth.2024.01.004 article EN Methods 2024-01-17

A Systematic Evaluation of GPT-4V’s Multimodal Capability for Chest X-ray Image Analysis

OPENALEX - Publications

Yunyi Liu Yingshu Li Zhanyu Wang Xinyu Liang Lingqiao Liu and 5 more

This work evaluates GPT-4V's multimodal capability for medical image analysis, focusing on three representative tasks radiology report generation, visual question answering, and grounding. For the evaluation, a set of prompts is designed each task to induce corresponding GPT-4V produce sufficiently good outputs. Three evaluation ways including quantitative human case study are employed achieve an in-depth extensive evaluation. Our shows that excels in understanding images can generate...

10.1016/j.metrad.2024.100099 article EN cc-by-nc-nd Meta-Radiology 2024-07-01

Self-Attention with Structural Position Representations

OPENALEX - Publications

Xing Wang Zhaopeng Tu Longyue Wang Shuming Shi

Xing Wang, Zhaopeng Tu, Longyue Shuming Shi. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint (EMNLP-IJCNLP). 2019.

10.18653/v1/d19-1145 article EN cc-by 2019-01-01

Context-aware Self-Attention Networks for Natural Language Processing

OPENALEX - Publications

Baosong Yang Longyue Wang Derek F. Wong Shuming Shi Zhaopeng Tu

10.1016/j.neucom.2021.06.009 article EN Neurocomputing 2021-06-05

New Trends in Machine Translation using Large Language Models: Case Examples with ChatGPT

OPENALEX - Publications

Chenyang Lyu Jitao Xu Longyue Wang

Machine Translation (MT) has greatly advanced over the years due to developments in deep neural networks. However, emergence of Large Language Models (LLMs) like GPT-4 and ChatGPT is introducing a new phase MT domain. In this context, we believe that future intricately tied capabilities LLMs. These models not only offer vast linguistic understandings but also bring innovative methodologies, such as prompt-based techniques, have potential further elevate MT. paper, provide an overview...

10.48550/arxiv.2305.01181 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text Integration

OPENALEX - Publications

Chenyang Lyu Minghao Wu Longyue Wang Xinting Huang Bingshuai Liu and 3 more

Although instruction-tuned large language models (LLMs) have exhibited remarkable capabilities across various NLP tasks, their effectiveness on other data modalities beyond text has not been fully studied. In this work, we propose Macaw-LLM, a novel multi-modal LLM that seamlessly integrates visual, audio, and textual information. Macaw-LLM consists of three main components: modality module for encoding data, cognitive harnessing pretrained LLMs, an alignment harmonizing diverse...

10.48550/arxiv.2306.09093 preprint EN cc-by arXiv (Cornell University) 2023-01-01

A Comprehensive Study of GPT-4V’s Multimodal Capabilities in Medical Imaging

OPENALEX - Publications

Yingshu Li Yunyi Liu Zhanyu Wang Xinyu Liang Lingqiao Liu and 5 more

A bstract This paper presents a comprehensive evaluation of GPT-4V’s capabilities across diverse medical imaging tasks, including Radiology Report Generation, Medical Visual Question Answering (VQA), and Grounding. While prior efforts have explored performance in imaging, to the best our knowledge, study represents first quantitative on publicly available benchmarks. Our findings highlight potential generating descriptive reports for chest X-ray images, particularly when guided by...

10.1101/2023.11.03.23298067 preprint EN cc-by-nc-nd medRxiv (Cold Spring Harbor Laboratory) 2023-11-04

MAGE: Machine-generated Text Detection in the Wild

OPENALEX - Publications

Yafu Li Qintong Li Leyang Cui Wei Bi Zhilin Wang and 4 more

10.18653/v1/2024.acl-long.3 article EN 2024-01-01

Towards Understanding Neural Machine Translation with Word Importance

OPENALEX - Publications

Shilin He Zhaopeng Tu Xing Wang Longyue Wang Michael R. Lyu and 1 more

Shilin He, Zhaopeng Tu, Xing Wang, Longyue Michael Lyu, Shuming Shi. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint (EMNLP-IJCNLP). 2019.

10.18653/v1/d19-1088 article EN cc-by 2019-01-01

Understanding and Improving Lexical Choice in Non-Autoregressive Translation

OPENALEX - Publications

Liang Ding Longyue Wang Xuebo Liu Derek F. Wong Dacheng Tao and 1 more

Knowledge distillation (KD) is essential for training non-autoregressive translation (NAT) models by reducing the complexity of raw data with an autoregressive teacher model. In this study, we empirically show that as a side effect training, lexical choice errors on low-frequency words are propagated to NAT model from To alleviate problem, propose expose restore useful information words, which missed in distilled data. end, introduce extra Kullback-Leibler divergence term derived comparing...

10.48550/arxiv.2012.14583 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Self-Attention with Cross-Lingual Position Representation

OPENALEX - Publications

Liang Ding Longyue Wang Dacheng Tao

Position encoding (PE), an essential part of self-attention networks (SANs), is used to preserve the word order information for natural language processing tasks, generating fixed position indices input sequences. However, in cross-lingual scenarios, machine translation, PEs source and target sentences are modeled independently. Due divergences different languages, modeling positional relationships might help SANs tackle this problem. In paper, we augment with representations model...

10.18653/v1/2020.acl-main.153 preprint EN cc-by 2020-01-01

Rejuvenating Low-Frequency Words: Making the Most of Parallel Data in Non-Autoregressive Translation

OPENALEX - Publications

Liang Ding Longyue Wang Xuebo Liu Derek F. Wong Dacheng Tao and 1 more

Liang Ding, Longyue Wang, Xuebo Liu, Derek F. Wong, Dacheng Tao, Zhaopeng Tu. Proceedings of the 59th Annual Meeting Association for Computational Linguistics and 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 2021.

10.18653/v1/2021.acl-long.266 article EN cc-by 2021-01-01

FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces

OPENALEX - Publications

Zhenran Xu Longyue Wang Jifang Wang Zhouyi Li Senbao Shi and 5 more

Virtual film production requires intricate decision-making processes, including scriptwriting, virtual cinematography, and precise actor positioning actions. Motivated by recent advances in automated with language agent-based societies, this paper introduces FilmAgent, a novel LLM-based multi-agent collaborative framework for end-to-end automation our constructed 3D spaces. FilmAgent simulates various crew roles, directors, screenwriters, actors, cinematographers, covers key stages of...

10.48550/arxiv.2501.12909 preprint EN arXiv (Cornell University) 2025-01-22

Widening the bottleneck of lexical choice for non-autoregressive translation

OPENALEX - Publications

Liang Ding Longyue Wang Siyou Liu Weihua Luo Kaifu Zhang

10.1016/j.csl.2024.101765 article EN Computer Speech & Language 2025-01-31

Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts

OPENALEX - Publications

Yunxin Li Shenyuan Jiang Baotian Hu Longyue Wang Wan-Qi Zhong and 3 more

Recent advancements in Multimodal Large Language Models (MLLMs) underscore the significance of scalable models and data to boost performance, yet this often incurs substantial computational costs. Although Mixture Experts (MoE) architecture has been employed scale large language or visual-language efficiently, these efforts typically involve fewer experts limited modalities. To address this, our work presents pioneering attempt develop a unified MLLM with MoE architecture, named Uni-MoE that...

10.1109/tpami.2025.3532688 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2025-01-01

LayAlign: Enhancing Multilingual Reasoning in Large Language Models via Layer-Wise Adaptive Fusion and Alignment Strategy

OPENALEX - Publications

Zhiwen Ruan Yixia Li He Zhu Longyue Wang Weihua Luo and 3 more

Despite being pretrained on multilingual corpora, large language models (LLMs) exhibit suboptimal performance low-resource languages. Recent approaches have leveraged encoders alongside LLMs by introducing trainable parameters connecting the two models. However, these methods typically focus encoder's output, overlooking valuable information from other layers. We propose \aname (\mname), a framework that integrates representations all encoder layers, coupled with \attaname mechanism to...

10.48550/arxiv.2502.11405 preprint EN arXiv (Cornell University) 2025-02-16

Towards Lightweight, Adaptive and Attribute-Aware Multi-Aspect Controllable Text Generation with Large Language Models

OPENALEX - Publications

Chenyu Zhu Yefeng Liu Chenyang Lyu Xue Yang Guanhua Chen and 3 more

Multi-aspect controllable text generation aims to control in attributes from multiple aspects, making it a complex but powerful task natural language processing. Supervised fine-tuning methods are often employed for this due their simplicity and effectiveness. However, they still have some limitations: low rank adaptation (LoRA) only fine-tunes few parameters has suboptimal effects, while full (FFT) requires significant computational resources is susceptible overfitting, particularly when...

10.48550/arxiv.2502.13474 preprint EN arXiv (Cornell University) 2025-02-19

Coming Soon ...