Yu Liu

ORCID: 0000-0001-9107-6390
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Reinforcement Learning in Robotics
  • Silicone and Siloxane Chemistry
  • Muscle activation and electromyography studies
  • Adversarial Robustness in Machine Learning
  • Robot Manipulation and Learning
  • Hydraulic and Pneumatic Systems
  • Space Satellite Systems and Control
  • Robotic Mechanisms and Dynamics
  • Real-time simulation and control systems
  • Evolutionary Algorithms and Applications
  • Soft Robotics and Applications
  • Building Energy and Comfort Optimization
  • EEG and Brain-Computer Interfaces
  • Vehicle Dynamics and Control Systems
  • Autonomous Vehicle Technology and Safety
  • Teleoperation and Haptic Systems
  • Advanced Measurement and Metrology Techniques
  • Anaerobic Digestion and Biogas Production
  • Vehicle Noise and Vibration Control
  • Climate change and permafrost
  • Neuroscience and Neural Engineering
  • Membrane Separation and Gas Transport
  • Planetary Science and Exploration
  • Force Microscopy Techniques and Applications
  • Infection Control and Ventilation

Harbin Institute of Technology
2013-2024

Dalian University of Technology
2021-2024

Shanghai Artificial Intelligence Laboratory
2023-2024

Heilongjiang Institute of Technology
2021-2023

Group Sense (China)
2023

Ministry of Industry and Information Technology
2020-2022

East China Jiaotong University
2022

Shenyang University of Technology
2022

Shenyang Institute of Automation
2022

Chinese Academy of Sciences
2021

Multi-agent reinforcement learning (MARL) suffers from the non-stationarity problem, which is ever-changing targets at every iteration when multiple agents update their policies same time. Starting first principle, in this paper, we manage to solve problem by proposing bidirectional action-dependent Q-learning (ACE). Central development of ACE sequential decision making process wherein only one agent allowed take action Within process, each maximizes its value function given actions taken...

10.1609/aaai.v37i7.26028 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2023-06-26

Numerous tissue-engineered constructs have been investigated as bone scaffolds in regenerative medicine. However, it remains challenging to non-invasively monitor the biodegradation and remodeling of grafts after implantation. Herein, silk fibroin/hydroxyapatite incorporated with ultrasmall superparamagnetic iron oxide (USPIO) nanoparticles were successfully synthesized, characterized, implanted subcutaneously into back nude mice. The USPIO labeled showed good three-dimensional porous...

10.3389/fbioe.2020.00697 article EN cc-by Frontiers in Bioengineering and Biotechnology 2020-06-30

Current constrained reinforcement learning (RL) methods guarantee constraint satisfaction only in expectation, which is inadequate for safety-critical decision problems. Since a satisfied expectation remains high probability of exceeding the cost threshold, solving RL problems with probabilities critical safety. In this work, we consider safety criterion as on conditional value-at-risk (CVaR) cumulative costs, and propose CVaR-constrained policy optimization algorithm (CVaR-CPO) to maximize...

10.1109/tnnls.2023.3331304 article EN IEEE Transactions on Neural Networks and Learning Systems 2024-01-01

The Cartesian path planning of free-floating space robot is much more complex than that fixed-based manipulators, since the end-effector pose (position and orientation) dependent, position-level kinematic equations can not be used to determine joint angles. In this paper, a method based on particle swarm optimization (PSO) proposed solve problem. Firstly, we parameterize trajectory using polynomial functions, then normalize parameterized trajectory. Secondly, transformed an problem by...

10.5772/5605 article EN cc-by International Journal of Advanced Robotic Systems 2008-09-01

Efficient and effective exploration in continuous space is a central problem applying reinforcement learning (RL) to autonomous driving. Skills learned from expert demonstrations or designed for specific tasks can benefit the exploration, but they are usually costly-collected, unbalanced/suboptimal, failing transfer diverse tasks. However, human drivers adapt varied driving without by taking efficient structural explorations entire skill rather than limited with task-specific skills....

10.1109/iros55552.2023.10341449 article EN 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2023-10-01

In this paper, we focus on the integration of motion planning and control for high-performance automated vehicles. We propose a novel tube-based nonlinear model predictive (TNMPC) scheme that combines strengths MPC boundary layer sliding (BLSC) to drive controlled vehicle along real-time trajectory while simultaneously ensuring practical satisfaction constraints. Specifically, our approach uses an extended kinematic with slip angle parameters provide explicit expressions modeling error,...

10.1109/tiv.2023.3342306 article EN IEEE Transactions on Intelligent Vehicles 2023-12-13

Four-wheel, independently driven skid-steer mobile robots have been widely used in some fields, such as indoor product shipping and outdoor inspection exploration. Traditional robot controllers often use a kinematics controller to obtain the desired speed of each wheel, complete closed-loop control wheel achieve robot’s trajectory tracking control. However, based on may lead chattering spin from being directly by motor uneven grounds. To solve these problems, we developed four-wheel, with...

10.3390/electronics12030699 article EN Electronics 2023-01-31

Deep learning (DL) has made tremendous contributions to image processing. Recently, the DL also attracted attention in specialized field of neural decoding from raw myoelectric signals (electromyograms, EMGs). However, our knowledge, most existing methods require some measure preprocessing EMGs. Moreover, research date not accounted for variability signal during time sequences. In this paper, we propose a new convolutional network (CNN) structure that can directly process EMG hand gesture...

10.1142/s0219843619500130 article EN International Journal of Humanoid Robotics 2019-06-27

Cable-Constrained Synchronous Rotating Mechanism (CCSRM) has an important application prospect in the field of cable-driven robots, which can greatly reduce number driving motors while ensuring light and slender body. However, there are obvious cable friction effect elastic deformation CCSRM. These nonlinear characteristics have a significant impact on synchronous motion performance. In this letter, model CCSRM considering is proposed, integrates effects pretension, deformation, between...

10.1109/lra.2020.3007418 article EN IEEE Robotics and Automation Letters 2020-07-07

In this work we introduce reinforcement learning techniques for solving lexicographic multi-objective problems. These are problems that involve multiple reward signals, and where the goal is to learn a policy maximises first signal, subject constraint also second so on. We present family of both action-value gradient algorithms can be used solve such problems, prove they converge policies lexicographically optimal. evaluate scalability performance these empirically, demonstrating their...

10.24963/ijcai.2022/476 article EN Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence 2022-07-01

Benefiting from the powerful learning capacity of deep (DP), a general model for predicting 3-DOF wrist movements could achieve good prediction accuracy those subjects even not involved in training. This paper tends to verify this assumption. Since quantity training dataset DP largely influences model's performance, limited collected number should be extended first through some data-augmentation approaches. In paper, we summarized possible mistakes happen EMG data-collection procedures....

10.1109/robio.2018.8664790 article EN 2021 IEEE International Conference on Robotics and Biomimetics (ROBIO) 2018-12-01

The reaction wheel bicycle robot is a kind of unmanned mobile with great potential. However, the control such robots on curved pavement under inaccurate model parameters, uncertainties and disturbances challenging due to lateral instability udneractuated characteristic. Applying conventional methods this problem often results in brittle controllers. In paper, an online serial-parallel combination reinforcement learning designed achieve path tracking banlancing for pavements. parallel part...

10.1109/access.2023.3268524 article EN cc-by-nc-nd IEEE Access 2023-01-01

Abstract Recent studies have found that deep learning models are vulnerable to adversarial examples, demonstrating applying a certain imperceptible perturbation on clean examples can effectively deceive the well-trained and high-accuracy models. Moreover, achieve considerable level of certainty with attacked label. In contrast, human could barely discern difference between which raised tremendous concern about robust trustworthy techniques. this survey, we reviewed existence, generation,...

10.1007/s10462-024-10841-z article EN cc-by Artificial Intelligence Review 2024-07-08

Fast response heat flux gauges relying on the semi-infinite conduction principle have commonly been used to study in impulse test facilities such as expansion tubes and reflected shock tunnels. For studying very harsh environments experienced at stagnation point of entry vehicles, generally metallic thermocouples are required survive loads which may be above ten megawatts per metre squared peak heating flight can upwards a hundred heavily scaled facility testing, challenging for even...

10.2514/6.2019-3007 article EN AIAA Aviation 2019 Forum 2019-06-14

The application requirements of the tendon-sheath mechanism in field precision machinery are becoming increasingly extensive. However, contact friction between tendon and sheath seriously affects transmission accuracy. In case unavoidable friction, optimizing path to reduce tension loss elastic deformation has become an important research direction. this article, influence law on displacement is obtained using two parameters related curvature path: total bending angle equivalent length....

10.1109/tro.2023.3255545 article EN IEEE Transactions on Robotics 2023-03-29

The paper presents a new calibration method of the parallel robot using genetic algorithm (GA) which has powerful global adaptive probabilistic search ability. In process calibration, calculation measurement residuals through inverse kinematics is adopted, greatly reduces burden and saves operation time. At last, in form simulation experiment, considering influence noise steward respectively calibrated its pose (position orientation) accuracy significant improvement.

10.1109/icma.2007.4303948 article EN International Conference on Mechatronics and Automation 2007-08-01
Coming Soon ...