NFDI4DS | UHH-SEMS - Publication Details

Li Du

ORCID: 0000-0003-2533-2426

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5023705881

Research Areas

Natural Language Processing Techniques
Topic Modeling
Auction Theory and Applications
Consumer Market Behavior and Pricing
Scheduling and Optimization Algorithms
Supply Chain and Inventory Management
Semantic Web and Ontologies
Machine Learning and Algorithms
Industrial Technology and Control Systems
Business Process Modeling and Analysis
Resource-Constrained Project Scheduling
Evaluation and Optimization Models
Biomedical Text Mining and Ontologies
Scheduling and Timetabling Solutions
Prosthetics and Rehabilitation Robotics
Advanced Computational Techniques and Applications
Simulation and Modeling Applications
Muscle activation and electromyography studies
Intelligent Tutoring Systems and Adaptive Learning
Advanced Graph Neural Networks
Bayesian Modeling and Causal Inference
Optimization and Search Problems
Software Reliability and Analysis Research
Educational Technology and Assessment
Text Readability and Simplification

Fiberhome Technology Group (China)
2024

Harbin Institute of Technology
2019-2024

North China Electric Power University
2021-2024

Northwestern Polytechnical University
2012-2021

Xi'an Aeronautical University
2021

Xidian University
2004-2020

University of Washington
2019

Jilin Medical University
2014

Changchun University
2014

University of Glasgow
2013

The Curious Case of Neural Text Degeneration

OPENALEX - Publications

Ari Holtzman Jan Buys Li Du Maxwell Forbes Yejin Choi

Despite considerable advancements with deep neural language models, the enigma of text degeneration persists when these models are tested as generators. The counter-intuitive empirical observation is that even though use likelihood training objective leads to high quality for a broad range understanding tasks, using decoding bland and strangely repetitive. In this paper, we reveal surprising distributional differences between human machine text. addition, find strategies alone can...

10.48550/arxiv.1904.09751 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Multi-modal fusion for business process prediction in call center scenarios

OPENALEX - Publications

Long Cheng Li Du Cong Liu Yang Hu Fang Fang and 1 more

10.1016/j.inffus.2024.102362 article EN Information Fusion 2024-03-16

Beyond Similarity: A Gradient-based Graph Method for Instruction Tuning Data Selection

OPENALEX - Publications

Zhao Yang Li Du Xiao Ding Yunwei Ouyang Hepeng Wang and 8 more

Large language models (LLMs) have shown great potential across various industries due to their remarkable ability generalize through instruction tuning. However, the limited availability of domain-specific data significantly hampers performance on specialized tasks. While existing methods primarily focus selecting training from general datasets that are similar target domain, they often fail consider joint distribution instructions, resulting in inefficient learning and suboptimal knowledge...

10.48550/arxiv.2502.11062 preprint EN arXiv (Cornell University) 2025-02-16

e-CARE: a New Dataset for Exploring Explainable Causal Reasoning

OPENALEX - Publications

Li Du Xiao Ding Kai Xiong Ting Liu Bing Qin

Understanding causality has vital importance for various Natural Language Processing (NLP) applications. Beyond the labeled instances, conceptual explanations of can provide deep understanding causal fact to facilitate reasoning process. However, such explanation information still remains absent in existing resources. In this paper, we fill gap by presenting a human-annotated explainable CAusal REasoning dataset (e-CARE), which contains over 20K questions, together with natural language...

10.18653/v1/2022.acl-long.33 article EN cc-by Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) 2022-01-01

File processing security detection in multi-cloud environments: a process mining approach

OPENALEX - Publications

Xiaolu Zhang Lei Cui Wuqiang Shen Jijun Zeng Li Du and 2 more

Abstract Cloud computing has gained popularity in recent years, but with its rise comes concerns about data security. Unauthorized access and attacks on cloud-based data, applications, infrastructure are major challenges that must be addressed. While machine learning algorithms have improved intrusion detection systems cloud security, they often fail to consider the entire life cycle of file processing, making it difficult detect certain issues, especially insider attacks. To address these...

10.1186/s13677-023-00474-y article EN cc-by Journal of Cloud Computing Advances Systems and Applications 2023-07-06

ExCAR: Event Graph Knowledge Enhanced Explainable Causal Reasoning

OPENALEX - Publications

Li Du Xiao Ding Kai Xiong Ting Liu Bing Qin

Li Du, Xiao Ding, Kai Xiong, Ting Liu, Bing Qin. Proceedings of the 59th Annual Meeting Association for Computational Linguistics and 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 2021.

10.18653/v1/2021.acl-long.183 article EN cc-by 2021-01-01

Learning Event Graph Knowledge for Abductive Reasoning

OPENALEX - Publications

Li Du Xiao Ding Ting Liu Bing Qin

Li Du, Xiao Ding, Ting Liu, Bing Qin. Proceedings of the 59th Annual Meeting Association for Computational Linguistics and 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 2021.

10.18653/v1/2021.acl-long.403 article EN cc-by 2021-01-01

Enhancing pretrained language models with structured commonsense knowledge for textual inference

OPENALEX - Publications

Li Du Xiao Ding Kai Xiong Ting Liu Bing Qin

10.1016/j.knosys.2022.109488 article EN Knowledge-Based Systems 2022-08-02

Causal-Guided Active Learning for Debiasing Large Language Models

OPENALEX - Publications

Zhouhao Sun Li Du Xiao Ding Yixuan Ma Yang Zhao and 3 more

10.18653/v1/2024.acl-long.778 article EN 2024-01-01

The effects of an undisclosed regular price and a positive leadtime in a presale mechanism

OPENALEX - Publications

Wanxia Mei Li Du Baozhuang Niu Jincheng Wang Jiejian Feng

10.1016/j.ejor.2015.10.033 article EN European Journal of Operational Research 2015-10-27

Reliability optimization of linear consecutive k-out-of-n: F systems with Birnbaum importance–based quantum genetic algorithm

OPENALEX - Publications

Shuai Zhang Jiangbin Zhao Wenjin Zhu Li Du

Component assignment problem is a common challenge of reliability optimization, which non-deterministic polynomial hard widely used in the linear consecutive k-out-of-n systems. In consideration advantages quantum computing and importance measure, this article proposed novel algorithm, Birnbaum importance–based genetic to improve efficiency accuracy for solving component problem. First, model optimization systems established. Second, detailed procedure algorithm introduced solve Moreover,...

10.1177/1687814019842996 article EN cc-by Advances in Mechanical Engineering 2019-04-01

Word of mouth: How upward social comparisons influence the sharing of consumption experiences

OPENALEX - Publications

Ana Suárez Vázquez Li Du Ana Belén del Río Lanza

Abstract An outcome of upward social comparisons that has been largely overlooked is its effect on non‐transactional behaviours (i.e., word mouth). Previous research identified three different emotional reactions to comparisons: admiration, benign envy and malicious envy. Despite the fact their consumption previously analysed, it remains unclear how these affect mouth intention. This study carries out an experimental design demonstrates admiration positively influence behaviour. However,...

10.1002/cb.1902 article EN Journal of Consumer Behaviour 2020-11-09

Modeling Event Background for If-Then Commonsense Reasoning Using Context-aware Variational Autoencoder

OPENALEX - Publications

Li Du Xiao Ding Ting Liu Zhongyang Li

Li Du, Xiao Ding, Ting Liu, Zhongyang Li. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint (EMNLP-IJCNLP). 2019.

10.18653/v1/d19-1270 article EN cc-by 2019-01-01

Heterogeneous graph knowledge enhanced stock market prediction

OPENALEX - Publications

Kai Xiong Xiao Ding Li Du Ting Liu Bing Qin

We focus on the task of stock market prediction based financial text which contains information that could influence movement market. Previous works mainly utilize a single semantic unit text, such as words, events, sentences, to predict tendency However, interaction different-grained within can be useful for context knowledge supplement and predictive selection, then improve performance prediction. To facilitate this, we propose constructing heterogeneous graph with nodes from task. A novel...

10.1016/j.aiopen.2021.09.001 article EN cc-by-nc-nd AI Open 2021-01-01

Mitigating Reporting Bias in Semi-supervised Temporal Commonsense Inference with Probabilistic Soft Logic

OPENALEX - Publications

Bibo Cai Xiao Ding Bowen Chen Li Du Ting Liu

Acquiring high-quality temporal common sense (TCS) knowledge from free-form text is a crucial but challenging problem for event-centric natural language understanding, due to the reporting bias problem: people rarely report commonly observed events highlight special cases. For example, one may "I get up bed in 1 minute", we can observe "It takes me an hour every morning'' text. Models directly trained upon such corpus would capture distorted TCS knowledge, which could influence model...

10.1609/aaai.v36i10.21288 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2022-06-28

Neural Natural Logic Inference for Interpretable Question Answering

OPENALEX - Publications

Jihao Shi Xiao Ding Li Du Ting Liu Bing Qin

Many open-domain question answering problems can be cast as a textual entailment task, where and candidate answers are concatenated to form hypotheses. A QA system then determines if the supporting knowledge bases, regarded potential premises, entail In this paper, we investigate neural-symbolic approach that integrates natural logic reasoning within deep learning architectures, towards developing effective yet explainable models. The proposed model gradually bridges hypothesis premises...

10.18653/v1/2021.emnlp-main.298 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2021-01-01

A profit sharing scheme for a two-firm joint venture

OPENALEX - Publications

Li Du Qiying Hu Liming Liu

10.1016/j.ejor.2004.06.025 article EN European Journal of Operational Research 2004-09-21

An Empirical Analysis of Bidding Behavior in Simultaneous Ascending-Bid Auctions

OPENALEX - Publications

Li Du Qian Chen Na Bian

Collecting auction data on jinmajia.com, we make an analysis bidding behaviors in simultaneous ascending-bid auctions. Our results indicate that it is not optimal for bidders to use the late- and cross-bidding strategies large auctions with soft ending rule.

10.1109/icee.2010.70 article EN International Conference on E-Business and E-Government 2010-05-01

Process Mining for Wind Turbine Maintenance Process Analysis: A Case Study

OPENALEX - Publications

Li Du Long Cheng Cong Liu

The maintenance of the entire life cycle wind turbines is important for a power project. main reason that whether can perform their best during operation period one key factors to measure success or failure farm investment. However, with increasing number installed turbines, how in an efficient way will become challenging problem. To lead better services, first step analyze process workflow really works real-life scenarios. Process mining such technique which analysis over event logs...

10.1109/ei252483.2021.9713329 article EN 2021 IEEE 5th Conference on Energy Internet and Energy System Integration (EI2) 2021-10-22

Deciphering the lmpact of Pretraining Data on Large Language Models through Machine Unlearning

OPENALEX - Publications

Yang Zhao Li Du Xiao Ding Kai Xiong Zhouhao Sun and 3 more

Through pretraining on a corpus with various sources, Large Language Models (LLMs) have gained impressive performance. However, the impact of each component remains opaque. As result, organization is still empirical and may deviate from optimal. To address this issue, we systematically analyze 48 datasets 5 major categories data LLMs measure their impacts using benchmarks about nine model capabilities. Our analyses provide results contribution multiple corpora performances LLMs, along joint...

10.48550/arxiv.2402.11537 preprint EN arXiv (Cornell University) 2024-02-18

Reasoning subevent relation over heterogeneous event graph

OPENALEX - Publications

Ting‐Ting Wu Xiao Ding Li Du Bing Qin Ting Liu

10.1007/s10115-024-02124-4 article EN Knowledge and Information Systems 2024-05-23

Deciphering the Impact of Pretraining Data on Large Language Models through Machine Unlearning

OPENALEX - Publications

Yang Zhao Li Du Xiao Ding Kai Xiong Zhouhao Sun and 3 more

10.18653/v1/2024.findings-acl.559 article EN Findings of the Association for Computational Linguistics: ACL 2022 2024-01-01

When is a Language Process a Language Model?

OPENALEX - Publications

Li Du H K Lee Jason Eisner Ryan Cotterell

10.18653/v1/2024.findings-acl.659 article EN Findings of the Association for Computational Linguistics: ACL 2022 2024-01-01

Causal-Guided Active Learning for Debiasing Large Language Models

OPENALEX - Publications

Li Du Zhouhao Sun Xiao Ding Yixuan Ma Yang Zhao and 3 more

Although achieving promising performance, recent analyses show that current generative large language models (LLMs) may still capture dataset biases and utilize them for generation, leading to poor generalizability harmfulness of LLMs. However, due the diversity over-optimization problem, previous prior-knowledge-based debiasing methods fine-tuning-based not be suitable To address this issue, we explore combining active learning with causal mechanisms propose a casual-guided (CAL) framework,...

10.48550/arxiv.2408.12942 preprint EN arXiv (Cornell University) 2024-08-23

AmpAgent: An LLM-based Multi-Agent System for Multi-stage Amplifier Schematic Design from Literature for Process and Performance Porting

OPENALEX - Publications

Chengjie Liu Wei-Yu Chen A Peng Du Yuan Li Du and 1 more

Multi-stage amplifiers are widely applied in analog circuits. However, their large number of components, complex transfer functions, and intricate pole-zero distributions necessitate extensive manpower for derivation param sizing to ensure stability. In order achieve efficient the function simplify difficulty circuit design, we propose AmpAgent: a multi-agent system based on language models (LLMs) efficiently designing such from literature with process performance porting. AmpAgent is...

10.48550/arxiv.2409.14739 preprint EN arXiv (Cornell University) 2024-09-23

Coming Soon ...