NFDI4DS | UHH-SEMS - Publication Details

Ying Wen

ORCID: 0000-0003-1247-2382

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5104546839

Research Areas

Reinforcement Learning in Robotics
Topic Modeling
Natural Language Processing Techniques
Adaptive Dynamic Programming Control
Speech Recognition and Synthesis
Semantic Web and Ontologies
Advanced Bandit Algorithms Research
Complex Systems and Time Series Analysis
Adversarial Robustness in Machine Learning
Multi-Agent Systems and Negotiation
Game Theory and Applications
Evolutionary Game Theory and Cooperation
Advanced Memory and Neural Computing
Research Data Management Practices
Advanced Neural Network Applications
Blockchain Technology Applications and Security
Ethics and Social Impacts of AI
Sparse and Compressive Sensing Techniques
Speech and dialogue systems
Multimodal Machine Learning Applications
Smart Grid Energy Management
Sports Analytics and Performance
Space Satellite Systems and Control
Human Pose and Action Recognition
Spacecraft Design and Technology

University of Manchester
2024

King's College London
2024

Shanghai Jiao Tong University
2021-2024

Multiagent Bidirectionally-Coordinated Nets: Emergence of Human-level Coordination in Learning to Play StarCraft Combat Games

OPENALEX - Publications

Peng Peng Ying Wen Yaodong Yang Quan Yuan Zhenkun Tang and 2 more

Many artificial intelligence (AI) applications often require multiple intelligent agents to work in a collaborative effort. Efficient learning for intra-agent communication and coordination is an indispensable step towards general AI. In this paper, we take StarCraft combat game as case study, where the task coordinate team defeat their enemies. To maintain scalable yet effective protocol, introduce Multiagent Bidirectionally-Coordinated Network (BiCNet ['bIknet]) with vectorised extension...

10.48550/arxiv.1703.10069 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning

OPENALEX - Publications

Jakub Grudzien Kuba Ruiqing Chen Muning Wen Ying Wen Fanglei Sun and 2 more

Trust region methods rigorously enabled reinforcement learning (RL) agents to learn monotonically improving policies, leading superior performance on a variety of tasks. Unfortunately, when it comes multi-agent (MARL), the property monotonic improvement may not simply apply; this is because agents, even in cooperative games, could have conflicting directions policy updates. As result, achieving guaranteed joint where each agent acts individually remains an open challenge. In paper, we extend...

10.48550/arxiv.2109.11251 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination

OPENALEX - Publications

Yang Li Zhang Shao Jichen Sun Wenhao Zhang Yali Du and 3 more

Securing coordination between AI agent and teammates (human players or agents) in contexts involving unfamiliar humans continues to pose a significant challenge Zero-Shot Coordination. The issue of cooperative incompatibility becomes particularly prominent when an is unsuccessful synchronizing with certain previously unknown partners. Traditional algorithms have aimed collaborate partners by optimizing fixed objectives within population, fostering diversity strategies behaviors. However,...

10.1613/jair.1.15884 article EN cc-by Journal of Artificial Intelligence Research 2024-07-23

Adaptive Teaming in Multi-Drone Pursuit: Simulation, Training, and Deployment

OPENALEX - Publications

Yang Li Junfan Chen Xue Feng Jianrong Qiu Wenbin Li and 3 more

Adaptive teaming, the ability to collaborate with unseen teammates without prior coordination, remains an underexplored challenge in multi-robot collaboration. This paper focuses on adaptive teaming multi-drone cooperative pursuit, a critical task real-world applications such as border surveillance, search-and-rescue, and counter-terrorism. We first define formalize \textbf{A}daptive Teaming \textbf{M}ulti-\textbf{D}rone \textbf{P}ursuit (AT-MDP) problem introduce AT-MDP framework,...

10.48550/arxiv.2502.09762 preprint EN arXiv (Cornell University) 2025-02-13

RHINO: Learning Real-Time Humanoid-Human-Object Interaction from Human Demonstrations

OPENALEX - Publications

Jingxiao Chen Xinyao Li Jiahang Cao Zhengbang Zhu Wentao Dong and 5 more

Humanoid robots have shown success in locomotion and manipulation. Despite these basic abilities, humanoids are still required to quickly understand human instructions react based on interaction signals become valuable assistants daily life. Unfortunately, most existing works only focus multi-stage interactions, treating each task separately, neglecting real-time feedback. In this work, we aim empower humanoid with reaction abilities achieve various tasks, allowing interrupt at any time,...

10.48550/arxiv.2502.13134 preprint EN arXiv (Cornell University) 2025-02-18

A Study of AI Population Dynamics with Million-agent Reinforcement Learning

OPENALEX - Publications

Yaodong Yang Lantao Yu Yiwei Bai Jun Wang Weinan Zhang and 2 more

We conduct an empirical study on discovering the ordered collective dynamics obtained by a population of intelligence agents, driven million-agent reinforcement learning. Our intention is to put intelligent agents into simulated natural context and verify if principles developed in real world could also be used understanding artificially-created population. To achieve this, we simulate large-scale predator-prey world, where laws are designed only findings or logical equivalence that have...

10.48550/arxiv.1709.04511 preprint EN other-oa arXiv (Cornell University) 2017-01-01

GeoDeepShovel: A platform for building scientific database from geoscience literature withAIassistance

OPENALEX - Publications

Zhang Shao Hui Xu Yuting Jia Ying Wen Dakuo Wang and 3 more

Abstract With the rapid development of big data science, research paradigm in field geosciences has also begun to shift data‐driven scientific discovery. Researchers need read a huge amount literature locate, extract and aggregate relevant results that are published stored PDF format for building database support In this paper, based on findings study about how geoscientists annotate data, we proposed GeoDeepShovel, publicly available AI‐assisted extraction system their needs. GeoDeepShovel...

10.1002/gdj3.186 article EN cc-by Geoscience Data Journal 2023-02-28

On Realization of Intelligent Decision Making in the Real World: A Foundation Decision Model Perspective

OPENALEX - Publications

Ying Wen Ziyu Wan Ming Zhou Shufang Hou Zhe Cao and 5 more

The pervasive uncertainty and dynamic nature of real-world environments present significant challenges for the widespread implementation machine-driven Intelligent Decision-Making (IDM) systems. Consequently, IDM should possess ability to continuously acquire new skills effectively generalize across a broad range applications. advancement Artificial General Intelligence (AGI) that transcends task application boundaries is critical enhancing IDM. Recent studies have extensively investigated...

10.26599/air.2023.9150026 article EN cc-by CAAI Artificial Intelligence Research 2023-12-01

TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision

OPENALEX - Publications

Ruiwen Zhou Yingxuan Yang Muning Wen Ying Wen Wenhao Wang and 4 more

10.1145/3626772.3657788 article EN Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval 2024-07-10

Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games

OPENALEX - Publications

Xiangyu Liu Hangtian Jia Ying Wen Yaodong Yang Yujing Hu and 3 more

Measuring and promoting policy diversity is critical for solving games with strong non-transitive dynamics where strategic cycles exist, there no consistent winner (e.g., Rock-Paper-Scissors). With that in mind, maintaining a pool of diverse policies via open-ended learning an attractive solution, which can generate auto-curricula to avoid being exploited. However, conventional algorithms, are widely accepted definitions diversity, making it hard construct evaluate the policies. In this...

10.48550/arxiv.2106.04958 preprint EN cc-by arXiv (Cornell University) 2021-01-01

A Game-Theoretic Approach to Multi-Agent Trust Region Optimization

OPENALEX - Publications

Ying Wen Hui Chen Yaodong Yang Tian Zheng Minne Li and 2 more

Trust region methods are widely applied in single-agent reinforcement learning problems due to their monotonic performance-improvement guarantee at every iteration. Nonetheless, when multi-agent settings, the of trust no longer holds because an agent's payoff is also affected by other agents' adaptive behaviors. To tackle this problem, we conduct a game-theoretical analysis policy space, and propose method (MATRL), which enables optimization for learning. Specifically, MATRL finds stable...

10.48550/arxiv.2106.06828 preprint EN cc-by arXiv (Cornell University) 2021-01-01

Adaptive Control Strategy for Quadruped Robots in Actuator Degradation Scenarios

OPENALEX - Publications

Xinyuan Wu Wentao Dong Hang Lai Yong Yu Ying Wen

Quadruped robots have strong adaptability to extreme environments but may also experience faults. Once these faults occur, must be repaired before returning the task, reducing their practical feasibility. One prevalent concern among is actuator degradation, stemming from factors like device aging or unexpected operational events. Traditionally, addressing this problem has relied heavily on intricate fault-tolerant design, which demands deep domain expertise developers and lacks...

10.1145/3627676.3627686 preprint EN 2023-11-30

DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning

OPENALEX - Publications

Siyuan Guo Cheng Deng Ying Wen Hechang Chen Yi Chang and 1 more

In this work, we investigate the potential of large language models (LLMs) based agents to automate data science tasks, with goal comprehending task requirements, then building and training best-fit machine learning models. Despite their widespread success, existing LLM are hindered by generating unreasonable experiment plans within scenario. To end, present DS-Agent, a novel automatic framework that harnesses agent case-based reasoning (CBR). development stage, DS-Agent follows CBR...

10.48550/arxiv.2402.17453 preprint EN arXiv (Cornell University) 2024-02-27

Cross-Utterance Conditioned VAE for Speech Generation

OPENALEX - Publications

Yang Li Cheng Yu Guangzhi Sun Weiqin Zu Zheng Tian and 6 more

10.1109/taslp.2024.3453598 article EN IEEE/ACM Transactions on Audio Speech and Language Processing 2024-01-01

Mutual Theory of Mind in Human-AI Collaboration: An Empirical Study with LLM-driven AI Agents in a Real-time Shared Workspace Task

OPENALEX - Publications

Zhang Shao Xihuai Wang Wenhao Zhang Yongshan Chen Lei Gao and 4 more

Theory of Mind (ToM) significantly impacts human collaboration and communication as a crucial capability to understand others. When AI agents with ToM collaborate humans, Mutual (MToM) arises in such human-AI teams (HATs). The MToM process, which involves interactive ToM-based strategy adjustment, affects the team's performance process. To explore we conducted mixed-design experiment using large language model-driven agent modules real-time shared-workspace task. We find that agent's does...

10.48550/arxiv.2409.08811 preprint EN arXiv (Cornell University) 2024-09-13

HOLA-Drone: Hypergraphic Open-ended Learning for Zero-Shot Multi-Drone Cooperative Pursuit

OPENALEX - Publications

Yang Li Dengyu Zhang Junfan Chen Ying Wen Qingrui Zhang and 2 more

Zero-shot coordination (ZSC) is a significant challenge in multi-agent collaboration, aiming to develop agents that can coordinate with unseen partners they have not encountered before. Recent cutting-edge ZSC methods primarily focused on two-player video games such as OverCooked!2 and Hanabi. In this paper, we extend the scope of research multi-drone cooperative pursuit scenario, exploring how construct drone agent capable coordinating multiple capture evaders. We propose novel Hypergraphic...

10.48550/arxiv.2409.08767 preprint EN arXiv (Cornell University) 2024-09-13

Cross-Utterance Conditioned VAE for Non-Autoregressive Text-to-Speech

OPENALEX - Publications

Yang Li Yu Cheng Guangzhi Sun Hua Jiang Fanglei Sun and 4 more

Yang Li, Cheng Yu, Guangzhi Sun, Hua Jiang, Fanglei Weiqin Zu, Ying Wen, Yang, Jun Wang. Proceedings of the 60th Annual Meeting Association for Computational Linguistics (Volume 1: Long Papers). 2022.

10.18653/v1/2022.acl-long.30 article EN cc-by Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) 2022-01-01

Self-Supervised MAFENN for Classifying Low-labeled Distorted Images over Mobile Fading Channels

OPENALEX - Publications

Yang Li Fanglei Sun Jingchen Hu Chang Liu Fan Wu and 8 more

Image distortion during wireless transmission presents a significant challenge for real-world artificial intelligence (AI) applications. Recent methods have attempted to address this issue by integrating neural networks into the system. However, these approaches often require large volume of labeled training data, which can be expensive and time-consuming collect. To issue, we propose novel approach, <bold xmlns:mml="http://www.w3.org/1998/Math/MathML"...

10.1109/tmc.2023.3343939 article EN IEEE Transactions on Mobile Computing 2023-12-19

DCAC: Reducing Unnecessary Conservatism in Offline-to-online Reinforcement Learning

OPENALEX - Publications

Dongxiang Chen Ying Wen

Recent advancements in offline reinforcement learning (RL) have facilitated the training of powerful agents using fixed datasets exclusively. Despite this, quality a dataset plays critical role determining an agent's performance, and high-quality are often scarce. This scarcity necessitates enhancement through subsequent environmental interactions. Particularly, state-action distribution shift may exert potentially detrimental effect on well-initialized policies, thus impeding...

10.1145/3627676.3627677 article EN 2023-11-30

Coming Soon ...