NFDI4DS | UHH-SEMS - Publication Details

Entity Structure Within and Throughout: Modeling Mention Dependencies for Document-Level Relation Extraction

OPENALEX - Publications

Benfeng Xu Quan Wang Yajuan Lyu Yong Zhu Zhendong Mao

Entities, as the essential elements in relation extraction tasks, exhibit certain structure. In this work, we formulate such entity structure distinctive dependencies between mention pairs. We then propose SSAN, which incorporates these structural within standard self-attention mechanism and throughout overall encoding stage. Specifically, design two alternative transformation modules inside each building block to produce attentive biases so adaptively regularize its attention flow. Our...

10.1609/aaai.v35i16.17665 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2021-05-18

Curriculum Learning for Natural Language Understanding

OPENALEX - Publications

Benfeng Xu Licheng Zhang Zhendong Mao Quan Wang Hongtao Xie and 1 more

With the great success of pre-trained language models, pretrain-finetune paradigm now becomes undoubtedly dominant solution for natural understanding (NLU) tasks. At fine-tune stage, target task data is usually introduced in a completely random order and treated equally. However, examples NLU tasks can vary greatly difficulty, similar to human learning procedure, models benefit from an easy-to-difficult curriculum. Based on this idea, we propose our Curriculum Learning approach. By reviewing...

10.18653/v1/2020.acl-main.542 article EN cc-by 2020-01-01

ExpertPrompting: Instructing Large Language Models to be Distinguished Experts

OPENALEX - Publications

Benfeng Xu Yang An Junyang Lin Quan Wang Chang Zhou and 2 more

The answering quality of an aligned large language model (LLM) can be drastically improved if treated with proper crafting prompts. In this paper, we propose ExpertPrompting to elicit the potential LLMs answer as distinguished experts. We first utilize In-Context Learning automatically synthesize detailed and customized descriptions expert identity for each specific instruction, then ask provide conditioned on such agent background. Based augmented prompting strategy, produce a new set...

10.48550/arxiv.2305.14688 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Qwen Technical Report

OPENALEX - Publications

Jinze Bai Shuai Bai Yunfei Chu Zeyu Cui Kai Dang and 43 more

Large language models (LLMs) have revolutionized the field of artificial intelligence, enabling natural processing tasks that were previously thought to be exclusive humans. In this work, we introduce Qwen, first installment our large model series. Qwen is a comprehensive series encompasses distinct with varying parameter counts. It includes base pretrained models, and Qwen-Chat, chat finetuned human alignment techniques. The consistently demonstrate superior performance across multitude...

10.48550/arxiv.2309.16609 preprint EN cc-by arXiv (Cornell University) 2023-01-01

UniRel: Unified Representation and Interaction for Joint Relational Triple Extraction

OPENALEX - Publications

Wei Tang Benfeng Xu Yuyue Zhao Zhendong Mao Yifeng Liu and 2 more

Relational triple extraction is challenging for its difficulty in capturing rich correlations between entities and relations. Existing works suffer from 1) heterogeneous representations of relations, 2) modeling entity-entity interactions entity-relation interactions. Therefore, the are not fully exploited by existing works. In this paper, we propose UniRel to address these challenges. Specifically, unify relations jointly encoding them within a concatenated natural language sequence, with...

10.18653/v1/2022.emnlp-main.477 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2022-01-01

EmRel: Joint Representation of Entities and Embedded Relations for Multi-triple Extraction

OPENALEX - Publications

Benfeng Xu Quan Wang Yajuan Lyu Yabing Shi Zhu Yong and 2 more

Benfeng Xu, Quan Wang, Yajuan Lyu, Yabing Shi, Yong Zhu, Jie Gao, Zhendong Mao. Proceedings of the 2022 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies. 2022.

10.18653/v1/2022.naacl-main.48 article EN cc-by Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies 2022-01-01

EHealth: A Chinese Biomedical Language Model Built via Multi-Level Text Discrimination

OPENALEX - Publications

Quan Wang Songtai Dai Benfeng Xu Yajuan Lyu Hua Wu and 1 more

10.1109/taslpro.2025.3536177 article EN IEEE Transactions on Audio Speech and Language Processing 2025-01-01

The relationship between epigenetic biomarkers and the risk of diabetes and cancer: a machine learning modeling approach

OPENALEX - Publications

Shiqi Zhang Jianan Jin Benfeng Xu Qi Zheng Haibo Mou

Introduction Epigenetic biomarkers are molecular indicators of epigenetic changes, and some studies have suggested that these predictive power for disease risk. This study aims to analyze the relationship between 30 risk diabetes cancer using machine learning modeling. Methods The data this were sourced from NHANES database, which includes DNA methylation arrays biomarker datasets. Nine algorithms used build models: AdaBoost, GBM, KNN, lightGBM, MLP, RF, SVM, XGBoost, logistics. Model...

10.3389/fpubh.2025.1509458 article EN cc-by Frontiers in Public Health 2025-03-21

$k$NN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor Inference

OPENALEX - Publications

Benfeng Xu Quan Wang Zhendong Mao Yajuan Lyu Qiaoqiao She and 1 more

In-Context Learning (ICL), which formulates target tasks as prompt completion conditioned on in-context demonstrations, has become the prevailing utilization of LLMs. In this paper, we first disclose an actual predicament for typical usage that it can not scale up with training data due to context length restriction. Besides, existing works have shown ICL also suffers from various biases and requires delicate calibration treatment. To address both challenges, advocate a simple effective...

10.48550/arxiv.2303.13824 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Benchmarking Large Language Models on Controllable Generation under Diversified Instructions

OPENALEX - Publications

Yihan Chen Benfeng Xu Quan Wang Yi Liu Zhendong Mao

While large language models (LLMs) have exhibited impressive instruction-following capabilities, it is still unclear whether and to what extent they can respond explicit constraints that might be entailed in various instructions. As a significant aspect of LLM alignment, thus important formulate such specialized set instructions as well investigate the resulting behavior LLMs. To address this vacancy, we propose new benchmark CoDI-Eval systematically comprehensively evaluate LLMs' responses...

10.1609/aaai.v38i16.29734 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2024-03-24

Review and Arrange: Curriculum Learning for Natural Language Understanding

OPENALEX - Publications

Licheng Zhang Zhendong Mao Benfeng Xu Quan Wang Yongdong Zhang

With the notable success of pretrained language models, pretraining-fine-tuning paradigm has become a dominant solution for natural understanding (NLU) tasks. Typically, training instances target NLU task are introduced in completely random order and treated equally at fine-tuning stage. However, these can vary greatly difficulty, similar to human learning procedures, models benefit from an easy-to-difficult curriculum. Based on this concept, we propose curriculum (CL) framework. Our...

10.1109/taslp.2021.3121986 article EN IEEE/ACM Transactions on Audio Speech and Language Processing 2021-01-01

Building Chinese Biomedical Language Models via Multi-Level Text Discrimination

OPENALEX - Publications

Quan Wang Songtai Dai Benfeng Xu Yajuan Lyu Yong Zhu and 2 more

Pre-trained language models (PLMs), such as BERT and GPT, have revolutionized the field of NLP, not only in general domain but also biomedical domain. Most prior efforts building PLMs resorted simply to adaptation focused mainly on English. In this work we introduce eHealth, a Chinese PLM built from scratch with new pre-training framework. This framework pre-trains eHealth discriminator through both token- sequence-level discrimination. The former is detect input tokens corrupted by...

10.48550/arxiv.2110.07244 preprint EN cc-by-sa arXiv (Cornell University) 2021-01-01

S2ynRE: Two-stage Self-training with Synthetic data for Low-resource Relation Extraction

OPENALEX - Publications

Benfeng Xu Quan Wang Yajuan Lyu Dai Dai Yongdong Zhang and 1 more

Current relation extraction methods suffer from the inadequacy of large-scale annotated data.While distant supervision alleviates problem data quantities, there still exists domain disparity in qualities due to its reliance on domain-restrained knowledge bases. In this work, we propose S2ynRE, a framework two-stage Self-training with Synthetic for Relation Extraction.We first leverage capability large language models adapt target and automatically synthesize quantities coherent, realistic...

10.18653/v1/2023.acl-long.455 article EN cc-by 2023-01-01

Retrieval-Augmented Domain Adaptation of Language Models

OPENALEX - Publications

Benfeng Xu Chunxu Zhao Wenbin Jiang Pengfei Zhu Songtai Dai and 4 more

Language models pretrained on general domain corpora usually exhibit considerable degradation when generalizing to downstream tasks of specialized domains. Existing approaches try construct PLMs for each specific domains either from scratch or through further pretraining, which not only costs substantial resources, but also fails cover all target at various granularity. In this work, we propose RADA, a novel Retrieval-Augmented framework Domain Adaptation. We first textual that covers the...

10.18653/v1/2023.repl4nlp-1.5 article EN cc-by 2023-01-01

Benchmarking Large Language Models on Controllable Generation under Diversified Instructions

OPENALEX - Publications

Yihan Chen Benfeng Xu Quan Wang Yi Liu Zhendong Mao

While large language models (LLMs) have exhibited impressive instruction-following capabilities, it is still unclear whether and to what extent they can respond explicit constraints that might be entailed in various instructions. As a significant aspect of LLM alignment, thus important formulate such specialized set instructions as well investigate the resulting behavior LLMs. To address this vacancy, we propose new benchmark CoDI-Eval systematically comprehensively evaluate LLMs' responses...

10.48550/arxiv.2401.00690 preprint EN other-oa arXiv (Cornell University) 2024-01-01

Curriculum Learning Driven Domain Adaptation for Low-Resource Machine Reading Comprehension

OPENALEX - Publications

Licheng Zhang Quan Wang Benfeng Xu Yi Liu Zhendong Mao

10.1109/lsp.2024.3429270 article EN IEEE Signal Processing Letters 2024-01-01

USTC-BUPT at SemEval-2024 Task 8: Enhancing Machine-Generated Text Detection via Domain Adversarial Neural Networks and LLM Embeddings

OPENALEX - Publications

Zikang Guo Kaijie Jiao Xingyu Yao Yuning Wan Haoran Li and 5 more

10.18653/v1/2024.semeval-1.217 article EN Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022) 2024-01-01

Disentangled Learning with Synthetic Parallel Data for Text Style Transfer

OPENALEX - Publications

Jingxuan Han Quan Wang Zikang Guo Benfeng Xu Licheng Zhang and 1 more

10.18653/v1/2024.acl-long.811 article EN 2024-01-01

KNN-Instruct: Automatic Instruction Construction with K Nearest Neighbor Deduction

OPENALEX - Publications

Jianshang Kou Benfeng Xu Chiwei Zhu Zhendong Mao

10.18653/v1/2024.emnlp-main.577 article EN Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2024-01-01

Modaldrop: Modality-Aware Regularization for Temporal-Spectral Fusion in Human Activity Recognition

OPENALEX - Publications

Xin Zeng Yiqiang Chen Benfeng Xu Tengxiang Zhang

Although most of existing works for sensor-based Human Activity Recognition rely on the temporal view, we argue that spectral view also provides complementary prior and accordingly benchmark a standard multi-view framework with extensive experiments to demonstrate its consistent superiority over single-view opponents. We then delve into intrinsic mechanism representation fusion, propose ModalDrop as novel modality-aware regularization method learn exploit representations both views...

10.1109/icassp49357.2023.10095880 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023-05-05

Entity Structure Within and Throughout: Modeling Mention Dependencies for Document-Level Relation Extraction

OPENALEX - Publications

Benfeng Xu Quan Wang Yajuan Lyu Yong Zhu Zhendong Mao

Entities, as the essential elements in relation extraction tasks, exhibit certain structure. In this work, we formulate such structure distinctive dependencies between mention pairs. We then propose SSAN, which incorporates these structural within standard self-attention mechanism and throughout overall encoding stage. Specifically, design two alternative transformation modules inside each building block to produce attentive biases so adaptively regularize its attention flow. Our experiments...

10.48550/arxiv.2102.10249 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning

OPENALEX - Publications

Shengguang Wu Keming Lu Benfeng Xu Junyang Lin Qi Su and 1 more

Enhancing the instruction-following ability of Large Language Models (LLMs) primarily demands substantial instruction-tuning datasets. However, sheer volume these imposes a considerable computational burden and annotation cost. To investigate label-efficient instruction tuning method that allows model itself to actively sample subsets are equally or even more effective, we introduce self-evolving mechanism DiverseEvol. In this process, iteratively augments its training subset refine own...

10.48550/arxiv.2311.08182 preprint EN other-oa arXiv (Cornell University) 2023-01-01

On the Calibration of Large Language Models and Alignment

OPENALEX - Publications

Chiwei Zhu Benfeng Xu Quan Wang Yongdong Zhang Zhendong Mao

As large language models attract increasing attention and find widespread application, concurrent challenges of reliability also arise at the same time. Confidence calibration, an effective analysis method for gauging deep models, serves as a crucial tool assessing improving their reliability. However, such investigation has been comparatively underexplored. In this work, we conduct systematic examination calibration aligned throughout entire construction process, including pretraining...

10.48550/arxiv.2311.13240 preprint EN other-oa arXiv (Cornell University) 2023-01-01

On the Calibration of Large Language Models and Alignment

OPENALEX - Publications

Chiwei Zhu Benfeng Xu Quan Wang Yongdong Zhang Zhendong Mao

As large language models attract increasing attention and find widespread application, concurrent challenges of reliability also arise at the same time. Confidence calibration, an effective analysis method for gauging deep models, serves as a crucial tool assessing improving their reliability. However, such investigation has been comparatively underexplored. In this work, we conduct systematic examination calibration aligned throughout entire construction process, including pretraining...

10.18653/v1/2023.findings-emnlp.654 article EN cc-by 2023-01-01

UniRel: Unified Representation and Interaction for Joint Relational Triple Extraction

OPENALEX - Publications

Wei Tang Benfeng Xu Yuyue Zhao Zhendong Mao Yifeng Liu and 2 more

Relational triple extraction is challenging for its difficulty in capturing rich correlations between entities and relations. Existing works suffer from 1) heterogeneous representations of relations, 2) modeling entity-entity interactions entity-relation interactions. Therefore, the are not fully exploited by existing works. In this paper, we propose UniRel to address these challenges. Specifically, unify relations jointly encoding them within a concatenated natural language sequence, with...

10.48550/arxiv.2211.09039 preprint EN other-oa arXiv (Cornell University) 2022-01-01