NFDI4DS | UHH-SEMS - Publication Details

Yixing Lan

ORCID: 0000-0003-4503-643X

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5030762555

Research Areas

Reinforcement Learning in Robotics
Algebraic structures and combinatorial models
Efficiency Analysis Using DEA
Adaptive Dynamic Programming Control
Advanced Algebra and Geometry
Multi-Criteria Decision Making
Environmental Impact and Sustainability
Domain Adaptation and Few-Shot Learning
Evolutionary Algorithms and Applications
Nonlinear Waves and Solitons
Elevator Systems and Control
Energy, Environment, Economic Growth
Optimization and Mathematical Programming
Economic Growth and Productivity
Neural Networks and Reservoir Computing
Advanced Combinatorial Mathematics
Artificial Intelligence in Healthcare
Commutative Algebra and Its Applications
Advanced Bandit Algorithms Research
Fuel Cells and Related Materials
Autophagy in Disease and Therapy
Advanced Statistical Methods and Models
HIV Research and Treatment
Imbalanced Data Classification Techniques
Robot Manipulation and Learning

Fuzhou University
2011-2024

National University of Defense Technology
2021-2024

South China Agricultural University
2023

Ministry of Agriculture and Rural Affairs
2023

Measuring Malmquist productivity index: A new approach based on double frontiers data envelopment analysis

OPENALEX - Publications

Ying‐Ming Wang Yixing Lan

10.1016/j.mcm.2011.06.064 article EN publisher-specific-oa Mathematical and Computer Modelling 2011-07-08

Common weights for fully ranking decision making units by regression analysis

OPENALEX - Publications

Ying‐Ming Wang Ying Luo Yixing Lan

10.1016/j.eswa.2011.01.004 article EN Expert Systems with Applications 2011-01-23

A data envelopment analysis (DEA)-based method for rule reduction in extended belief-rule-based systems

OPENALEX - Publications

Long-Hao Yang Ying‐Ming Wang Yixing Lan Lei Chen Yang-Geng Fu

10.1016/j.knosys.2017.02.021 article EN Knowledge-Based Systems 2017-02-15

AMARL: An Attention-Based Multiagent Reinforcement Learning Approach to the Min-Max Multiple Traveling Salesmen Problem

OPENALEX - Publications

Hao Gao Xing Zhou Xin Xu Yixing Lan Yongqian Xiao

In recent years, the multiple traveling salesmen problem (MTSP or TSP) has received increasing research interest and one of its main applications is coordinated multirobot mission planning, such as cooperative search rescue tasks. However, it still challenging to solve MTSP with improved inference efficiency well solution quality in varying situations, e.g., different city positions, numbers cities, agents. this article, we propose an attention-based multiagent reinforcement learning (AMARL)...

10.1109/tnnls.2023.3236629 article EN IEEE Transactions on Neural Networks and Learning Systems 2023-02-08

Skill Expansion and Composition in Parameter Space

OPENALEX - Publications

Ting Liu Jianxiong Li Yinan Zheng Haoyi Niu Yixing Lan and 2 more

Humans excel at reusing prior knowledge to address new challenges and developing skills while solving problems. This paradigm becomes increasingly popular in the development of autonomous agents, as it develops systems that can self-evolve response like human beings. However, previous methods suffer from limited training efficiency when expanding fail fully leverage facilitate task learning. In this paper, we propose Parametric Skill Expansion Composition (PSEC), a framework designed...

10.48550/arxiv.2502.05932 preprint EN arXiv (Cornell University) 2025-02-09

Advantage-Guided Transformer for Conditional Sequence Modeling in Offline Reinforcement Learning

OPENALEX - Publications

J. B. Wei Xin Xu Yixing Lan Tenglong Liu Yueying Wang

10.36227/techrxiv.174235351.13662510/v1 preprint EN cc-by-nc-sa 2025-03-19

Estimating most productive scale size with double frontiers data envelopment analysis

OPENALEX - Publications

Ying‐Ming Wang Yixing Lan

10.1016/j.econmod.2013.04.021 article EN Economic Modelling 2013-04-29

Transfer reinforcement learning via meta-knowledge extraction using auto-pruned decision trees

OPENALEX - Publications

Yixing Lan Xin Xu Qiang Fang Yujun Zeng Xinwang Liu and 1 more

10.1016/j.knosys.2022.108221 article EN Knowledge-Based Systems 2022-01-25

Extended belief rule-based system using bi-level joint optimization for environmental investment forecasting

OPENALEX - Publications

Long-Hao Yang Fei-Fei Ye Ying-Ming Wang Yixing Lan Chan Li

10.1016/j.asoc.2023.110275 article EN Applied Soft Computing 2023-04-01

Measuring the bias of technical change of industrial energy and environment productivity in China: a global DEA-Malmquist productivity approach

OPENALEX - Publications

Xu Wang Ying‐Ming Wang Yixing Lan

10.1007/s11356-021-13128-w article EN Environmental Science and Pollution Research 2021-04-01

Efficient reinforcement learning with least-squares soft Bellman residual for robotic grasping

OPENALEX - Publications

Yixing Lan Junkai Ren Tao Tang Xin Xu Yifei Shi and 1 more

10.1016/j.robot.2023.104385 article EN Robotics and Autonomous Systems 2023-02-23

GCEN: Multiagent Deep Reinforcement Learning With Grouped Cognitive Feature Representation

OPENALEX - Publications

Hao Gao Xin Xu Chao Yan Yixing Lan Kangxing Yao

In recent years, cooperative Multi-Agent Deep Reinforcement Learning (MADRL) has received increasing research interest and been widely applied to computer games coordinated multi-robot systems, etc. However, it is still challenging realize high solution quality learning efficiency for MADRL under the conditions of incomplete noisy observations. To this end, paper proposes a multi-agent deep reinforcement approach with Grouped Cognitive featurE representatioN (GCEN), following paradigm...

10.1109/tcds.2023.3323987 article EN IEEE Transactions on Cognitive and Developmental Systems 2023-10-16

Efficient Reinforcement Learning from Demonstration via Bayesian Network-Based Knowledge Extraction

OPENALEX - Publications

Yichuan Zhang Yixing Lan Qiang Fang Xin Xu Junxiang Li and 1 more

Reinforcement learning from demonstration (RLfD) is considered to be a promising approach improve reinforcement (RL) by leveraging expert demonstrations as the additional decision‐making guidance. However, most existing RLfD methods only regard low‐level knowledge instances under certain task. Demonstrations are generally used either provide rewards or pretrain neural network‐based RL policy in supervised manner, usually resulting poor generalization capability and weak robustness...

10.1155/2021/7588221 article EN cc-by Computational Intelligence and Neuroscience 2021-01-01

Centralized carbon emission abatement (CEA) allocation based on non-separation using data envelopment analysis: an observation of regional highway transportation systems in China

OPENALEX - Publications

Xu Wang Ying‐Ming Wang Yixing Lan

10.1007/s11356-021-18046-5 article EN Environmental Science and Pollution Research 2022-01-22

Deep reinforcement learning using least‐squares truncated temporal‐difference

OPENALEX - Publications

Junkai Ren Yixing Lan Xin Xu Yichuan Zhang Qiang Fang and 1 more

Abstract Policy evaluation (PE) is a critical sub‐problem in reinforcement learning, which estimates the value function for given policy and can be used improvement. However, there still exist some limitations current PE methods, such as low sample efficiency local convergence, especially on complex tasks. In this study, novel algorithm called Least‐Squares Truncated Temporal‐Difference learning (LST 2 D) proposed. LST D, an adaptive truncation mechanism designed, effectively takes advantage...

10.1049/cit2.12202 article EN cc-by-nc-nd CAAI Transactions on Intelligence Technology 2023-03-16

USP1-Associated Factor 1 Modulates Japanese Encephalitis Virus Replication by Governing Autophagy and Interferon-Stimulated Genes

OPENALEX - Publications

Jinchao Xing Chen Hu Siqi Che Yixing Lan Lihong Huang and 5 more

Japanese encephalitis virus (JEV) is a typical mosquito-borne flavivirus that can cause central nervous system diseases in humans and animals. Host factors attempt to limit replication when the viruses invade host by using various strategies for replication. It essential clarify affect life cycle of JEV explore its underlying mechanism. Here, we found USP1-associated factor 1 (UAF1; also known as WD repeat-containing protein 48) modulated We propagation significantly increased UAF1-depleted...

10.1128/spectrum.03186-22 article EN cc-by Microbiology Spectrum 2023-03-29

An approach to two-sided M&A fits based on a cross-efficiency evaluation with contrasting attitudes

OPENALEX - Publications

Hai-Liu Shi Ying‐Ming Wang Sheng-Qun Chen Yixing Lan

10.1057/s41274-016-0005-6 article EN Journal of the Operational Research Society 2016-07-26

Early warning of risks of copyright infringement in digital library based on extension theory

OPENALEX - Publications

Chan Li Wende Zhang Yixing Lan

Purpose – The purpose of this study is to evaluate the potential risks copyright infringement in digital library based on extension theory. Design/methodology/approach At first, analytic hierarchy process (AHP) used determine weights existing indicator system for early warning. Second, a model built theory library. Finally, real-world application presented show effectiveness and usefulness approach. Findings main findings paper are as follows: warning effective distinguishing degree library;...

10.1108/el-04-2014-0064 article EN The Electronic Library 2016-03-24

Convex or non-convex? A new meta-frontier data envelopment analysis framework considering technology compatibility

OPENALEX - Publications

Lei Chen Yixing Lan S Wang

10.1007/s00291-024-00753-3 article EN OR Spectrum 2024-03-16

Adaptive Advantage-Guided Policy Regularization for Offline Reinforcement Learning

OPENALEX - Publications

Tenglong Liu Yang Li Yixing Lan Hao Gao Wei Pan and 1 more

In offline reinforcement learning, the challenge of out-of-distribution (OOD) is pronounced. To address this, existing methods often constrain learned policy through regularization. However, these suffer from issue unnecessary conservativeness, hampering improvement. This occurs due to indiscriminate use all actions behavior that generates dataset as constraints. The problem becomes particularly noticeable when quality suboptimal. Thus, we propose Adaptive Advantage-guided Policy...

10.48550/arxiv.2405.19909 preprint EN arXiv (Cornell University) 2024-05-30

The parity of Lusztig's restriction functor and Green's formula for a quiver with automorphism

OPENALEX - Publications

Jiepeng Fang Yixing Lan Yumeng Wu

In [8], Fang-Lan-Xiao proved a formula about Lusztig's induction and restriction functors which can induce Green's for the path algebra of quiver over finite field via trace map. this paper, we generalize their to that mixed semisimple perverse sheaves with an automorphism. By applying map, obtain any finite-dimensional hereditary field.

10.48550/arxiv.2406.03238 preprint EN arXiv (Cornell University) 2024-06-05

Lusztig sheaves and integrable highest weight modules in symmetrizable case

OPENALEX - Publications

Yixing Lan Yumeng Wu Jie Xiao

The present paper continues the work of [2]. For any symmetrizable generalized Cartan Matrix $C$ and corresponding quantum group $\mathbf{U}$, we consider associated quiver $Q$ with an admissible automorphism $a$. We construct category $\widetilde{\mathcal{Q}/\mathcal{N}}$ localization Lusztig sheaves for automorphism. Its Grothendieck gives a realization integrable highest weight $\mathbf{U}-$module $\Lambda_{\lambda}$, modulo traceless ones provide (signed) canonical basis...

10.48550/arxiv.2411.09188 preprint EN arXiv (Cornell University) 2024-11-14

SMART: Sequential Multi-Agent Reinforcement Learning with Role Assignment using Transformer

OPENALEX - Publications

Yixing Lan Hao Gao Xin Xu Qiang Fang Yujun Zeng

10.1109/tcds.2024.3504256 article EN IEEE Transactions on Cognitive and Developmental Systems 2024-01-01

A Novel Semi-Supervised Learning Method Using Causal Margin Adaptation for Imbalanced Classification

OPENALEX - Publications

Y. Thomas Hou Yujun Zeng Yixing Lan Junkai Ren Zhaowei Ma

10.1109/prml62565.2024.10779963 article EN 2024-07-19

Sample Efficient Deep Reinforcement Learning With Online State Abstraction and Causal Transformer Model Prediction

OPENALEX - Publications

Yixing Lan Xin Xu Qiang Fang Jianye Hao

Deep reinforcement learning (RL) typically requires a tremendous number of training samples, which are not practical in many applications. State abstraction and world models two promising approaches for improving sample efficiency deep RL. However, both state may degrade the performance. In this article, we propose an abstracted model-based policy (AMPL) algorithm, improves AMPL, novel method via multistep bisimulation is first developed to learn task-related latent spaces. Hence, original...

10.1109/tnnls.2023.3296642 article EN IEEE Transactions on Neural Networks and Learning Systems 2023-08-15

Coming Soon ...