NFDI4DS | UHH-SEMS - Publication Details

Wenhao Li

ORCID: 0000-0003-2985-1098

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5100362638

Research Areas

Reinforcement Learning in Robotics
Machine Learning and ELM
Domain Adaptation and Few-Shot Learning
Topic Modeling
Adaptive Dynamic Programming Control
Sparse and Compressive Sensing Techniques
Distributed Control Multi-Agent Systems
Medical Image Segmentation Techniques
Advanced Neural Network Applications
Cyclone Separators and Fluid Dynamics
Advanced Fluorescence Microscopy Techniques
Multimodal Machine Learning Applications
Metaheuristic Optimization Algorithms Research
Photoacoustic and Ultrasonic Imaging
Text and Document Classification Technologies
Aerodynamics and Acoustics in Jet Flows
Explainable Artificial Intelligence (XAI)
Aerosol Filtration and Electrostatic Precipitation
Advanced Algorithms and Applications
Natural Language Processing Techniques
Model Reduction and Neural Networks
Optimization and Search Problems
Grey System Theory Applications
Artificial Intelligence in Games
Neural dynamics and brain function

Chinese University of Hong Kong, Shenzhen
2024

East China Normal University
2019-2023

Anhui University
2023

Meizu (China)
2023

Tsinghua University
2018-2022

Center for Information Technology
2022

Beijing Academy of Artificial Intelligence
2022

Tongji University
2019

Pennsylvania State University
2019

Shanghai Key Laboratory of Trustworthy Computing
2019

Iteratively-Refined Interactive 3D Medical Image Segmentation With Multi-Agent Reinforcement Learning

OPENALEX - Publications

Xuan Liao Wenhao Li Qisen Xu Xiangfeng Wang Bo Jin and 3 more

Existing automatic 3D image segmentation methods usually fail to meet the clinic use. Many studies have explored an interactive strategy improve performance by iteratively incorporating user hints. However, dynamic process for successive interactions is largely ignored. We here propose model of iterative as a Markov decision (MDP) and solve it with reinforcement learning (RL). Unfortunately, intractable use single-agent RL voxel-wise prediction due large exploration space. To reduce space...

10.1109/cvpr42600.2020.00941 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

Automatic Poetry Generation with Mutual Reinforcement Learning

OPENALEX - Publications

Xiaoyuan Yi Maosong Sun Ruoyu Li Wenhao Li

Poetry is one of the most beautiful forms human language art. As a crucial step towards computer creativity, automatic poetry generation has drawn researchers' attention for decades. In recent years, some neural models have made remarkable progress in this task. However, they are all based on maximum likelihood estimation, which only learns common patterns corpus and results loss-evaluation mismatch. Human experts evaluate terms specific criteria, instead word-level likelihood. To handle...

10.18653/v1/d18-1353 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2018-01-01

Learning structured communication for multi-agent reinforcement learning

OPENALEX - Publications

Junjie Sheng Xiangfeng Wang Bo Jin Junchi Yan Wenhao Li and 3 more

10.1007/s10458-022-09580-8 article EN Autonomous Agents and Multi-Agent Systems 2022-08-26

Pressure prediction for air cyclone centrifugal classifier based on CNN-LSTM enhanced by attention mechanism

OPENALEX - Publications

Wenhao Li Xinhao Li Jiale Yuan Runyu Liu Yuhan Liu and 3 more

10.1016/j.cherd.2024.04.045 article EN Process Safety and Environmental Protection 2024-04-26

GraphThought: Graph Combinatorial Optimization with Thought Generation

OPENALEX - Publications

Zixiao Huang Lifeng Guo Junjie Sheng Haosheng Chen Wenhao Li and 3 more

Large language models (LLMs) have demonstrated remarkable capabilities across various domains, especially in text processing and generative tasks. Recent advancements the reasoning of state-of-the-art LLMs, such as OpenAI-o1, significantly broadened their applicability, particularly complex problem-solving logical inference. However, most existing LLMs struggle with notable limitations handling graph combinatorial optimization (GCO) problems. To bridge this gap, we formally define Optimal...

10.48550/arxiv.2502.11607 preprint EN arXiv (Cornell University) 2025-02-17

SVTformer: Spatial-View-Temporal Transformer for Multi-View 3D Human Pose Estimation

OPENALEX - Publications

Wanruo Zhang Mengyuan Liu Hong Liu Wenhao Li

Recently, transformer-based methods have been introduced to estimate 3D human pose from multiple views by aggregating the spatial-temporal information of joints achieve lifting 2D 3D. However, previous approaches cannot model inter-frame correspondence each view's joint individually, nor can they directly consider all view interactions at time, leading insufficient learning multi-view associations. To address this issue, we propose a Spatial-View-Temporal transformer (SVTformer) decouple...

10.1609/aaai.v39i10.33101 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2025-04-11

HiNet: Novel Multi-Scenario & Multi-Task Learning with Hierarchical Information Extraction

OPENALEX - Publications

Jie Zhou Xianshuai Cao Wenhao Li Lin Bo Kun Zhang and 2 more

Multi-scenario & multi-task learning has been widely applied to many recommendation systems in industrial applications, wherein an effective and practical approach is carry out multi-scenario transfer on the basis of Mixture-of-Expert (MoE) architecture. However, MoE-based method, which aims project all information same feature space, cannot effectively deal with complex relationships inherent among various scenarios tasks, resulting unsatisfactory performance. To tackle problem, we propose...

10.1109/icde55515.2023.00227 article EN 2022 IEEE 38th International Conference on Data Engineering (ICDE) 2023-04-01

A hybrid EMD-GRU model for pressure prediction in air cyclone centrifugal classifiers

OPENALEX - Publications

Haishen Jiang Wenhao Li Yuhan Liu Runyu Liu Yadong Yang and 2 more

10.1016/j.apt.2024.104743 article EN Advanced Powder Technology 2024-12-05

Distributed and Parallel ADMM for Structured Nonconvex Optimization Problem

OPENALEX - Publications

Xiangfeng Wang Junchi Yan Bo Jin Wenhao Li

The nonconvex optimization problems have recently attracted significant attention. However, both efficient algorithm and solid theory are still very limited. difficulty is even pronounced for structured large-scale in many real-world applications. This article proposes an application-driven algorithmic framework with distributed parallel techniques, which jointly handles the high dimensionality of model parameters training data. theoretical convergence our established under moderate...

10.1109/tcyb.2019.2950337 article EN IEEE Transactions on Cybernetics 2019-12-30

A two-stage multi-hypothesis reconstruction scheme in compressed video sensing

OPENALEX - Publications

Weifeng Ou Chunling Yang Wenhao Li Lihong Ma

Existing multi-hypothesis (MH) prediction algorithms in compressed video sensing (CVS) are all deployed measurement domain, which restricts the flexibility of block partitioning reconstruction process and decreases accuracy. To address this issue, paper proposes a two-stage (2sMHR) scheme deploys MH domain pixel successively. Two implementation schemes, GOP-wise frame-wise scheme, developed for 2sMHR. Furthermore, new weighted metric combining Euclidean distance correlation coefficient is...

10.1109/icip.2016.7532808 article EN 2022 IEEE International Conference on Image Processing (ICIP) 2016-08-17

F2A2: Flexible Fully-decentralized Approximate Actor-critic for Cooperative Multi-agent Reinforcement Learning

OPENALEX - Publications

Wenhao Li Bo Jin Xiangfeng Wang Junchi Yan Hongyuan Zha

Traditional centralized multi-agent reinforcement learning (MARL) algorithms are sometimes unpractical in complicated applications, due to non-interactivity between agents, curse of dimensionality and computation complexity. Hence, several decentralized MARL motivated. However, existing methods only handle the fully cooperative setting where massive information needs be transmitted training. The block coordinate gradient descent scheme they used for successive independent actor critic steps...

10.48550/arxiv.2004.11145 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Structured Cooperative Reinforcement Learning With Time-Varying Composite Action Space

OPENALEX - Publications

Wenhao Li Xiangfeng Wang Bo Jin Dijun Luo Hongyuan Zha

In recent years, reinforcement learning has achieved excellent results in low-dimensional static action spaces such as games and simple robotics. However, the space is usually composite, composed of multiple sub-action with different functions, time-varying for practical tasks. The existing sub-actions might be temporarily invalid due to external environment, while unseen can added current system. To solve robustness transferability problems composite spaces, we propose a structured...

10.1109/tpami.2021.3102140 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2021-08-04

Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning

OPENALEX - Publications

Wenhao Li Dan Qiao Baoxiang Wang Xiangfeng Wang Bo Jin and 1 more

The difficulty of appropriately assigning credit is particularly heightened in cooperative MARL with sparse reward, due to the concurrent time and structural scales involved. Automatic subgoal generation (ASG) has recently emerged as a viable approach inspired by utilizing subgoals intrinsically motivated reinforcement learning. However, end-to-end learning complex task planning from rewards without prior knowledge, undoubtedly requires massive training samples. Moreover, diversity-promoting...

10.48550/arxiv.2305.10865 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Market Segmentation Through Information

OPENALEX - Publications

Matthew Elliott Andrea Galeotti Andrew Koh Wenhao Li

An information designer has precise about consumers' preferences over products sold by oligopolists. The chooses what to reveal differentiated frms who, then, compete on price making personalized offers. We ask market outcomes the can achieve. is a metaphor for an internet platform who collects data users and sells it firms can, in turn, target discounts promotions towards different consumers. Our analysis provides new benchmarks demonstrating power that users' endow platforms with. These...

10.2139/ssrn.3432315 article EN SSRN Electronic Journal 2019-01-01

Learning Roles with Emergent Social Value Orientations

OPENALEX - Publications

Wenhao Li Xiangfeng Wang Bo Jin Jingyi Lu Hongyuan Zha

Social dilemmas can be considered situations where individual rationality leads to collective irrationality. The multi-agent reinforcement learning community has leveraged ideas from social science, such as value orientations (SVO), solve in complex cooperative tasks. In this paper, by first introducing the typical "division of labor or roles" mechanism human society, we provide a promising solution for intertemporal (ISD) with SVOs. A novel framework, called Learning Roles Emergent SVOs...

10.48550/arxiv.2301.13812 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Interactive medical image segmentation with self-adaptive confidence calibration

OPENALEX - Publications

Chuyun Shen Wenhao Li Qisen Xu Bin Hu Bo Jin and 4 more

10.1631/fitee.2200299 article EN Frontiers of Information Technology & Electronic Engineering 2023-09-01

Learning Structured Communication for Multi-agent Reinforcement Learning

OPENALEX - Publications

Junjie Sheng Xiangfeng Wang Bo Jin Junchi Yan Wenhao Li and 3 more

This work explores the large-scale multi-agent communication mechanism under a reinforcement learning (MARL) setting. We summarize general categories of topology for structures in MARL literature, which are often manually specified. Then we propose novel framework termed as Learning Structured Communication (LSC) by using more flexible and efficient topology. Our allows adaptive agent grouping to form different hierarchical formations over episodes, is generated an auxiliary task combined...

10.48550/arxiv.2002.04235 preprint EN other-oa arXiv (Cornell University) 2020-01-01

A multihypothesis-based residual reconstruction scheme in compressed video sensing

OPENALEX - Publications

Wenhao Li Chunling Yang Lihong Ma

A multihypothesis-based residual reconstruction scheme (MHRR) is presented in compressed video sensing (CVS). The first predicted by a novel multihypothesis (MH) prediction method and the second then reconstructed independently. In proposed MHRR, of generating hypothesis blocks domain offered, are obtained pixel-domain ME technique linear weights calculated measurement-domain, which can combine advantages measurement MH prediction. Simulation results show that MHRR achieve higher performance...

10.1109/icip.2017.8296786 article EN 2022 IEEE International Conference on Image Processing (ICIP) 2017-09-01

HMRL

OPENALEX - Publications

Yun Hua Xiangfeng Wang Bo Jin Wenhao Li Junchi Yan and 2 more

In spite of the success existing meta reinforcement learning methods, they still have difficulty in a policy effectively for RL problems with sparse reward. this respect, we develop novel framework called Hyper-Meta RL(HMRL), reward problems. It is consisted three modules including cross-environment state embedding module which constructs common space to adapt different environments; based environment-specific shaping extends original trajectory by cross-environmental knowledge...

10.1145/3447548.3467242 article EN 2021-08-12

Complementary information mutual learning for multimodality medical image segmentation

OPENALEX - Publications

Chuyun Shen Wenhao Li Haoqing Chen Xiaoling Wang Fengping Zhu and 3 more

10.1016/j.neunet.2024.106670 article EN Neural Networks 2024-09-06

Coming Soon ...