NFDI4DS | UHH-SEMS - Publication Details

Yunshi Lan

ORCID: 0000-0002-0192-8498

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5090588589

Research Areas

Topic Modeling
Natural Language Processing Techniques
Advanced Graph Neural Networks
Multimodal Machine Learning Applications
Semantic Web and Ontologies
Privacy-Preserving Technologies in Data
Intelligent Tutoring Systems and Adaptive Learning
Software Engineering Research
Text Readability and Simplification
Domain Adaptation and Few-Shot Learning
Expert finding and Q&A systems
Speech Recognition and Synthesis
Advanced Text Analysis Techniques
Geophysical Methods and Applications
Adversarial Robustness in Machine Learning
Mental Health via Writing
Stochastic Gradient Optimization Techniques
Innovative Teaching and Learning Methods
Speech and dialogue systems
Mathematics, Computing, and Information Processing
Advanced Neural Network Applications
Complex Network Analysis Techniques
Sentiment Analysis and Opinion Mining
Anomaly Detection Techniques and Applications
Generative Adversarial Networks and Image Synthesis

East China Normal University
2022-2024

Alibaba Group (United States)
2023

Singapore Management University
2016-2021

Query Graph Generation for Answering Multi-hop Complex Questions from Knowledge Bases

OPENALEX - Publications

Yunshi Lan Jing Jiang

Previous work on answering complex questions from knowledge bases usually separately addresses two types of complexity: with constraints and multiple hops relations. In this paper, we handle both complexity at the same time. Motivated by observation that early incorporation into query graphs can more effectively prune search space, propose a modified staged graph generation method flexible ways to generate graphs. Our experiments clearly show our achieves state art three benchmark KBQA datasets.

10.18653/v1/2020.acl-main.91 article EN cc-by 2020-01-01

Improving Multi-hop Knowledge Base Question Answering by Learning Intermediate Supervision Signals

OPENALEX - Publications

Gaole He Yunshi Lan Jing Jiang Wayne Xin Zhao Ji-Rong Wen

Multi-hop Knowledge Base Question Answering (KBQA) aims to find the answer entities that are multiple hops away in Knowl- edge (KB) from question. A major challenge is lack of supervision signals at intermediate steps. Therefore, multi-hop KBQA algorithms can only receive feedback final answer, which makes learning unstable or ineffective. To address this challenge, we propose a novel teacher-student approach for task. In our approach, stu- dent network correct query, while teacher tries...

10.1145/3437963.3441753 preprint EN 2021-03-06

A Survey on Complex Knowledge Base Question Answering: Methods, Challenges and Solutions

OPENALEX - Publications

Yunshi Lan Gaole He Jinhao Jiang Jing Jiang Wayne Xin Zhao and 1 more

Knowledge base question answering (KBQA) aims to answer a over knowledge (KB). Recently, large number of studies focus on semantically or syntactically complicated questions. In this paper, we elaborately summarize the typical challenges and solutions for complex KBQA. We begin with introducing background about KBQA task. Next, present two mainstream categories methods KBQA, namely semantic parsing-based (SP-based) information retrieval-based (IR-based) methods. then review advanced...

10.24963/ijcai.2021/611 article EN 2021-08-01

Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models

OPENALEX - Publications

Lei Wang Wanyu Xu Yihuai Lan Zhiqiang Hu Yunshi Lan and 2 more

Lei Wang, Wanyu Xu, Yihuai Lan, Zhiqiang Hu, Yunshi Roy Ka-Wei Lee, Ee-Peng Lim. Proceedings of the 61st Annual Meeting Association for Computational Linguistics (Volume 1: Long Papers). 2023.

10.18653/v1/2023.acl-long.147 article EN cc-by 2023-01-01

Complex Knowledge Base Question Answering: A Survey

OPENALEX - Publications

Yunshi Lan Gaole He Jinhao Jiang Jing Jiang Wayne Xin Zhao and 1 more

Knowledge base question answering (KBQA) aims to answer a over knowledge (KB). Early studies mainly focused on simple questions KBs and achieved great success. However, their performances complex are still far from satisfactory. Therefore, in recent years, researchers propose large number of novel methods, which looked into the challenges questions. In this survey, we review advances KBQA with focus solving questions, usually contain multiple subjects, express compound relations, or involve...

10.1109/tkde.2022.3223858 article EN IEEE Transactions on Knowledge and Data Engineering 2022-11-24

MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving

OPENALEX - Publications

Zhenwen Liang Jipeng Zhang Lei Wang Qin Wei Yunshi Lan and 2 more

Math word problem (MWP) solving faces a dilemma in number representation learning. In order to avoid the issue and reduce search space of feasible solutions, existing works striving for MWP usually replace real numbers with symbolic placeholders focus on logic reasoning. However, different from common reasoning tasks like program synthesis knowledge graph reasoning, has extra requirements numerical other words, instead value itself, it is reusable property that matters more Therefore, we...

10.18653/v1/2022.findings-naacl.74 article EN cc-by Findings of the Association for Computational Linguistics: NAACL 2022 2022-01-01

Knowledge Base Question Answering With a Matching-Aggregation Model and Question-Specific Contextual Relations

OPENALEX - Publications

Yunshi Lan Shuohang Wang Jing Jiang

Making use of knowledge bases to answer questions (KBQA) is a key direction in question answering systems. Researchers have developed diverse range methods address this problem, but there are still some limitations with the existing methods. Specifically, neural network-based for KBQA not taken advantage recent "matching-aggregation" framework sequence matching, and when representing candidate entity, they may choose most useful context matching. In paper, we explore match answers questions....

10.1109/taslp.2019.2926125 article EN IEEE/ACM Transactions on Audio Speech and Language Processing 2019-07-01

Knowledge Base Question Answering with Topic Units

OPENALEX - Publications

Yunshi Lan Shuohang Wang Jing Jiang

Knowledge base question answering (KBQA) is an important task in natural language processing. Existing methods for KBQA usually start with entity linking, which considers mostly named entities found a as the starting points KB to search answers question. However, relying only on linking look answer candidates may not be sufficient. In this paper, we propose perform topic unit where units cover wider range of KB. We use generation-and-scoring approach gradually refine set units. Furthermore,...

10.24963/ijcai.2019/701 article EN 2019-07-28

MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem Solvers

OPENALEX - Publications

Yihuai Lan Lei Wang Qiyuan Zhang Yunshi Lan Bing Tian Dai and 3 more

While Math Word Problem (MWP) solving has emerged as a popular field of study and made great progress in recent years, most existing methods are benchmarked solely on one or two datasets implemented with different configurations. In this paper, we introduce the first open-source library for MWPs called MWPToolkit, which provides unified, comprehensive, extensible framework research purpose. Specifically, deploy 17 deep learning-based MWP solvers 6 our toolkit. These advanced models solving,...

10.1609/aaai.v36i11.21723 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2022-06-28

GradMA: A Gradient-Memory-based Accelerated Federated Learning with Alleviated Catastrophic Forgetting

OPENALEX - Publications

Kangyang Luo Xiang Li Yunshi Lan Ming Gao

Federated Learning (FL) has emerged as a de facto machine learning area and received rapid increasing research interests from the community. However, catastrophic forgetting caused by data heterogeneity partial participation poses distinctive challenges for FL, which are detrimental to performance. To tackle problems, we propose new FL approach (namely GradMA), takes inspiration continual simultaneously correct server-side worker-side update directions well take full advantage of server's...

10.1109/cvpr52729.2023.00361 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Multi-hop Knowledge Base Question Answering with an Iterative Sequence Matching Model

OPENALEX - Publications

Yunshi Lan Shuohang Wang Jing Jiang

Knowledge Base Question Answering (KBQA) has attracted much attention and recently there been more interest in multi-hop KBQA. In this paper, we propose a novel iterative sequence matching model to address several limitations of previous methods for Our method iteratively grows the candidate relation paths that may lead answer entities. The prunes away less relevant branches incrementally assigns scores paths. Empirical results demonstrate our can significantly outperform existing on three...

10.1109/icdm.2019.00046 article EN 2021 IEEE International Conference on Data Mining (ICDM) 2019-11-01

Improving Zero-shot Visual Question Answering via Large Language Models with Reasoning Question Prompts

OPENALEX - Publications

Yunshi Lan Xiang Li Xin Liu Yang Li Wei Qin and 1 more

Zero-shot Visual Question Answering (VQA) is a prominent vision-language task that examines both the visual and textual understanding capability of systems in absence training data. Recently, by converting images into captions, information across multi-modalities bridged Large Language Models (LLMs) can apply their strong zero-shot generalization to unseen questions. To design ideal prompts for solving VQA via LLMs, several studies have explored different strategies select or generate...

10.1145/3581783.3612389 article EN cc-by 2023-10-26

Read, Diagnose and Chat: Towards Explainable and Interactive LLMs-Augmented Depression Detection in Social Media

OPENALEX - Publications

Wei Qin Zetong Chen Lei Wang Yunshi Lan Weijieying Ren and 1 more

This paper proposes a new depression detection system based on LLMs that is both interpretable and interactive. It not only provides diagnosis, but also diagnostic evidence personalized recommendations natural language dialogue with the user. We address challenges such as processing of large amounts text integrate professional criteria. Our outperforms traditional methods across various settings demonstrated through case studies.

10.48550/arxiv.2305.05138 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Prompting Large Language Models with Chain-of-Thought for Few-Shot Knowledge Base Question Generation

OPENALEX - Publications

Yuanyuan Liang Jianing Wang Hanlun Zhu Lei Wang Weining Qian and 1 more

The task of Question Generation over Knowledge Bases (KBQG) aims to convert a logical form into natural language question. For the sake expensive cost large-scale question annotation, methods KBQG under low-resource scenarios urgently need be developed. However, current heavily rely on annotated data for fine-tuning, which is not well-suited few-shot generation. emergence Large Language Models (LLMs) has shown their impressive generalization ability in tasks. Inspired by Chain-of-Thought...

10.18653/v1/2023.emnlp-main.263 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2023-01-01

TreeEval: Benchmark-Free Evaluation of Large Language Models through Tree Planning

OPENALEX - Publications

Xiang Li Yunshi Lan Chao Yang

Recently, numerous new benchmarks have been established to evaluate the performance of large language models (LLMs) via either computing a holistic score or employing another LLM as judge. However, these approaches suffer from data leakage due open access benchmark and inflexible evaluation process. To address this issue, we introduce TreeEval, benchmark-free method for LLMs that let high-performance host an irreproducible session essentially avoids leakage. Moreover, performs examiner raise...

10.1609/aaai.v39i23.34627 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2025-04-11

Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models

OPENALEX - Publications

Lei Wang Wanyu Xu Yihuai Lan Zhiqiang Hu Yunshi Lan and 2 more

Large language models (LLMs) have recently been shown to deliver impressive performance in various NLP tasks. To tackle multi-step reasoning tasks, few-shot chain-of-thought (CoT) prompting includes a few manually crafted step-by-step demonstrations which enable LLMs explicitly generate steps and improve their task accuracy. eliminate the manual effort, Zero-shot-CoT concatenates target problem statement with "Let's think step by step" as an input prompt LLMs. Despite success of...

10.48550/arxiv.2305.04091 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Safety of Multimodal Large Language Models on Images and Text

OPENALEX - Publications

Xin Liu Yichen Zhu Yunshi Lan Chao Yang Yu Qiao

Attracted by the impressive power of Multimodal Large Language Models (MLLMs), public is increasingly utilizing them to improve efficiency daily work. Nonetheless, vulnerabilities MLLMs unsafe instructions bring huge safety risks when these models are deployed in real-world scenarios. In this paper, we systematically survey current efforts on evaluation, attack, and defense MLLMs' images text. We begin with introducing overview text understanding safety, which helps researchers know detailed...

10.24963/ijcai.2024/901 article EN 2024-07-26

MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem Solvers

OPENALEX - Publications

Yihuai Lan Lei Wang Qiyuan Zhang Yunshi Lan Bing Tian Dai and 3 more

Developing automatic Math Word Problem (MWP) solvers has been an interest of NLP researchers since the 1960s. Over last few years, there are a growing number datasets and deep learning-based methods proposed for effectively solving MWPs. However, most existing benchmarked soly on one or two datasets, varying in different configurations, which leads to lack unified, standardized, fair, comprehensive comparison between methods. This paper presents MWPToolkit, first open-source framework In we...

10.48550/arxiv.2109.00799 preprint EN cc-by arXiv (Cornell University) 2021-01-01

Modeling Transitions of Focal Entities for Conversational Knowledge Base Question Answering

OPENALEX - Publications

Yunshi Lan Jing Jiang

Yunshi Lan, Jing Jiang. Proceedings of the 59th Annual Meeting Association for Computational Linguistics and 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 2021.

10.18653/v1/2021.acl-long.255 article EN cc-by 2021-01-01

Structure-Discourse Hierarchical Graph for Conditional Question Answering on Long Documents

OPENALEX - Publications

Haowei Du Yansong Feng Chen Li Li Yang Yunshi Lan and 1 more

Conditional question answering on long documents aims to find probable answers and identify conditions that need be satisfied make the correct over documents. Existing approaches solve this task by segmenting into multiple sections, attending information at global local tokens predict corresponding conditions. However, natural structure of document discourse relations between sentences in each section are ignored, which crucial for condition retrieving across as well logical interaction To...

10.18653/v1/2023.findings-acl.391 article EN cc-by Findings of the Association for Computational Linguistics: ACL 2022 2023-01-01

Query-Relevant Images Jailbreak Large Multi-Modal Models

OPENALEX - Publications

Xin Liu Yichen Zhu Yunshi Lan Chao Yang Yu Qiao

The security concerns surrounding Large Language Models (LLMs) have been extensively explored, yet the safety of Multimodal (MLLMs) remains understudied. In this paper, we observe that can be easily compromised by query-relevant images, as if text query itself were malicious. To address this, introduce MM-SafetyBench, a comprehensive framework designed for conducting safety-critical evaluations MLLMs against such image-based manipulations. We compiled dataset comprising 13 scenarios,...

10.48550/arxiv.2311.17600 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Survey of Natural Language Processing for Education: Taxonomy, Systematic Review, and Future Trends

OPENALEX - Publications

Yunshi Lan Xinyuan Li Hanyue Du Xuesong Lu Ming Gao and 2 more

Natural Language Processing (NLP) aims to analyze text or speech via techniques in the computer science field. It serves applications domains of healthcare, commerce, education and so on. Particularly, NLP has been widely applied domain its have enormous potential help teaching learning. In this survey, we review recent advances with focus on solving problems relevant domain. detail, begin introducing related background real-world scenarios where could contribute. Then, present a taxonomy...

10.48550/arxiv.2401.07518 preprint EN cc-by arXiv (Cornell University) 2024-01-01

Unsupervised Text Style Transfer via LLMs and Attention Masking with Multi-way Interactions

OPENALEX - Publications

Lei Pan Yunshi Lan Li Yang Weining Qian

Unsupervised Text Style Transfer (UTST) has emerged as a critical task within the domain of Natural Language Processing (NLP), aiming to transfer one stylistic aspect sentence into another style without changing its semantics, syntax, or other attributes. This is especially challenging given intrinsic lack parallel text pairings. Among existing methods for UTST tasks, attention masking approach and Large Models (LLMs) are deemed two pioneering methods. However, they have shortcomings in...

10.48550/arxiv.2402.13647 preprint EN arXiv (Cornell University) 2024-02-21

Coming Soon ...