NFDI4DS | UHH-SEMS - Publication Details

Qiyuan Zhang

ORCID: 0000-0002-8519-4259

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5100739520

Research Areas

Reinforcement Learning in Robotics
Topic Modeling
Natural Language Processing Techniques
Pharmacological Effects of Medicinal Plants
Bone Metabolism and Diseases
Software Engineering Research
Photosynthetic Processes and Mechanisms
Data Stream Mining Techniques
Network Security and Intrusion Detection
Pharmacological Effects of Natural Compounds
Explainable Artificial Intelligence (XAI)
Stock Market Forecasting Methods
Distributed Control Multi-Agent Systems
Real-time simulation and control systems
Spinal Dysraphism and Malformations
Advanced Malware Detection Techniques
Adaptive Dynamic Programming Control
Advanced Multi-Objective Optimization Algorithms
Robotic Path Planning Algorithms
Pregnancy-related medical research
Digital Media and Visual Art
Robotic Locomotion and Control
Traditional Chinese Medicine Analysis
Energy Efficient Wireless Sensor Networks
Autonomous Vehicle Technology and Safety

Jilin Agricultural University
2024

Northeast Agricultural University
2022-2024

Harbin Institute of Technology
2021-2024

Singapore Management University
2022

Lanzhou University
2022

Xidian University
2022

University of Electronic Science and Technology of China
2021

University of Shanghai for Science and Technology
2021

University of Science and Technology of China
2009-2010

Beijing National Laboratory for Molecular Sciences
1998

MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem Solvers

OPENALEX - Publications

Yihuai Lan Lei Wang Qiyuan Zhang Yunshi Lan Bing Tian Dai and 3 more

While Math Word Problem (MWP) solving has emerged as a popular field of study and made great progress in recent years, most existing methods are benchmarked solely on one or two datasets implemented with different configurations. In this paper, we introduce the first open-source library for MWPs called MWPToolkit, which provides unified, comprehensive, extensible framework research purpose. Specifically, deploy 17 deep learning-based MWP solvers 6 our toolkit. These advanced models solving,...

10.1609/aaai.v36i11.21723 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2022-06-28

CVaR-Constrained Policy Optimization for Safe Reinforcement Learning

OPENALEX - Publications

Qiyuan Zhang Shu Leng Xiaoteng Ma Qihan Liu Xueqian Wang and 3 more

Current constrained reinforcement learning (RL) methods guarantee constraint satisfaction only in expectation, which is inadequate for safety-critical decision problems. Since a satisfied expectation remains high probability of exceeding the cost threshold, solving RL problems with probabilities critical safety. In this work, we consider safety criterion as on conditional value-at-risk (CVaR) cumulative costs, and propose CVaR-constrained policy optimization algorithm (CVaR-CPO) to maximize...

10.1109/tnnls.2023.3331304 article EN IEEE Transactions on Neural Networks and Learning Systems 2024-01-01

MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem Solvers

OPENALEX - Publications

Yihuai Lan Lei Wang Qiyuan Zhang Yunshi Lan Bing Tian Dai and 3 more

Developing automatic Math Word Problem (MWP) solvers has been an interest of NLP researchers since the 1960s. Over last few years, there are a growing number datasets and deep learning-based methods proposed for effectively solving MWPs. However, most existing benchmarked soly on one or two datasets, varying in different configurations, which leads to lack unified, standardized, fair, comprehensive comparison between methods. This paper presents MWPToolkit, first open-source framework In we...

10.48550/arxiv.2109.00799 preprint EN cc-by arXiv (Cornell University) 2021-01-01

Acetyl‐11‐keto‐beta‐boswellic acid modulates macrophage polarization and Schwann cell migration to accelerate spinal cord injury repair in rats

OPENALEX - Publications

Yao Wang Zongliang Xiong Yuncong Qiao Qiyuan Zhang Guanghu Zhou and 4 more

Abstract Background Inhibiting secondary inflammatory damage caused by glial cells and creating a stable microenvironment is one of the main strategies to investigate drugs for treatment spinal cord injury. Acetyl‐11‐keto‐beta‐boswellic acid (AKBA) active component natural drug boswellia, which has anti‐inflammatory antioxidant effects offers possible therapeutic option Methods In this study, injury model was established crushing cord, respectively, detect M1 macrophage markers: iNOS, TNF‐α,...

10.1111/cns.14642 article EN cc-by CNS Neuroscience & Therapeutics 2024-03-01

NOAHQA: Numerical Reasoning with Interpretable Graph Question Answering Dataset

OPENALEX - Publications

Qiyuan Zhang Lei Wang Sicheng Yu Shuohang Wang Yang Wang and 2 more

While diverse question answering (QA) datasets have been proposed and contributed significantly to the development of deep learning models for QA tasks, existing fall short in two aspects. First, we lack covering complex questions that involve answers as well reasoning processes get them. As a result, state-of-the-art research on numerical still focuses simple calculations does not provide mathematical expressions or evidence justifying answers. Second, community has lot effort improve...

10.18653/v1/2021.findings-emnlp.350 preprint EN cc-by 2021-01-01

Research on the Genetic Improvement Effects of Lodging Resistance-Related Traits in Maize Core Germplasm

OPENALEX - Publications

Kaizhi Yang Jianjun Tan Qiyuan Zhang Tianhang Bai Sheng Zhou and 4 more

Lodging is a key factor affecting maize yield and harvestability. This study utilized Reid population baselines their improved lines as female parents No-Reid male to form 48 incomplete diallel crosses. The genetic improvement effects, combining ability, heterosis of three lodging resistance-related traits (stem tension, puncture strength, crushing strength at the third internode) were analyzed. Regarding improvement, results indicated that all significantly in compared baselines, with...

10.3390/agronomy15010017 article EN cc-by Agronomy 2024-12-26

Acetyl-11-Keto-β-Boswellic Acid Accelerates the Repair of Spinal Cord Injury in Rats by Resisting Neuronal Pyroptosis with Nrf2

OPENALEX - Publications

Yao Wang Zongliang Xiong Qiyuan Zhang Mengmeng Liu Jingjing Zhang and 3 more

The primary aim of this study is to delve into the potential Acetyl-11-keto-β-boswellic acid (AKBA) in ameliorating neuronal damage induced by acute spinal cord injury, as well unravel intricate underlying mechanisms. A cohort 40 Sprague-Dawley rats was meticulously categorized four groups. Following a seven-day oral administration AKBA, damaged samples were procured for Nissl staining and electron microscopy assess demise. Employing ELISA, immunofluorescence, Western blot (WB), quantitative...

10.3390/ijms25010358 article EN International Journal of Molecular Sciences 2023-12-26

AKBA Promotes Axonal Regeneration via RhoA/Rictor to Repair Damaged Sciatic Nerve

OPENALEX - Publications

Yao Wang Zongliang Xiong Chong Zhou Qiyuan Zhang Shuang Liu and 3 more

The existing studies by our team demonstrated the pro-recovery effect of 3-Acetyl-11-keto-beta-boswellic acid (AKBA) on a sciatic nerve injury. To further investigate role AKBA in peripheral injury repair, TMT quantitative proteomics technique was used to obtain differentially significant proteins Sham group, Model and group. After that, three time points (5, 14, 28 d) four groups (Sham + AKBA, Sham, Model, AKBA) were set up, immunoblotting, immunofluorescence, cellular assays applied...

10.3390/ijms232415903 article EN International Journal of Molecular Sciences 2022-12-14

Distributed Node Authentication in Wireless Sensor Networks

OPENALEX - Publications

Qiyuan Zhang Xuehai Zhou Feng Yang

Authentication is necessary to not only data exchange processes but also network administrative tasks in wireless sensor networks. So far, authentication has drawn much attention from the research community. We propose an efficient distributed node (DNA) scheme that utilizes geographic location and trust relationship among neighboring nodes authenticate identity of nodes. Analysis simulation results demonstrate DNA robust against failure/sleeping resilient attackers.

10.1109/wicom.2009.5301523 article EN 2009-09-01

Natural Residual Reinforcement Learning for Bicycle Robot Control

OPENALEX - Publications

Xianjin Zhu Xudong Zheng Qiyuan Zhang Zhang Chen Yu Liu and 1 more

This work focuses on motion control of the bicycle robot by using proposed NRRL algorithm. Unlike traditional RL algorithm, decomposing main tasks into subtasks manually and introducing qualitative prior knowledge to agent have been applied in Simulation results show that better performance sample efficiency algorithm achieved terms balance path tracking robot. It's believed is available real physical robot, deployment will be realized soon, as has constructed currently.

10.1109/icma52036.2021.9512587 article EN 2022 IEEE International Conference on Mechatronics and Automation (ICMA) 2021-08-08

A Non-cooperative Game Approach for Intrusion Detection in Smartphone Systems

OPENALEX - Publications

Feng Yang Xuehai Zhou Gangyong Jia Qiyuan Zhang

In this paper, we propose an intrusion detection framework for smartphone systems. We formulate the problem into a two-player, non-cooperative, complete-information, constant-sum game. The attacker and security server are players of wants to maximize value system but minimize it. present Nash equilibrium leads defense strategy server. implement results show that proposed is better than traditional ones.

10.1109/cnsr.2010.24 article EN 2010-01-01

Learning to Discover Task-Relevant Features for Interpretable Reinforcement Learning

OPENALEX - Publications

Qiyuan Zhang Xiaoteng Ma Yiqin Yang Chenghao Li Jun Yang and 2 more

Reinforcement Learning (RL) agents are often fed with large-dimensional observations to achieve the ideal performance in complex environments. Unfortunately, massive observation space usually contains useless or even adverse features, which leads low sample efficiency. Existing methods rely on domain knowledge and cross-validation discover efficient features informative for decision-making. To minimize impact of prior knowledge, we propose a temporal-adaptive feature attention algorithm...

10.1109/lra.2021.3091885 article EN IEEE Robotics and Automation Letters 2021-07-21

APD: Learning Diverse Behaviors for Reinforcement Learning Through Unsupervised Active Pre-Training

OPENALEX - Publications

Kailin Zeng Qiyuan Zhang Bin Chen Bin Liang Jun Yang

Unsupervised pre-training in reinforcement learning enables the agent to gain prior environmental knowledge, which is then fine-tuned supervised stage quickly adapt various downstream tasks. In absence of task-related rewards, aims acquire policies (i.e., behaviors) that generate different trajectories explore and master environment. Previous research categorizes states into their associated behaviors by a discriminator. However, an underlying problem persists: such discriminator trained...

10.1109/lra.2022.3214057 article EN IEEE Robotics and Automation Letters 2022-10-01

Collaborative Performance Prediction for Large Language Models

OPENALEX - Publications

Qiyuan Zhang Fuyuan Lyu Xue Liu Chen Ma

10.18653/v1/2024.emnlp-main.150 article EN Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2024-01-01

Collaborative Performance Prediction for Large Language Models

OPENALEX - Publications

Qiyuan Zhang Fuyuan Lyu Xue Liu Chen Ma

Comprehensively understanding and accurately predicting the performance of large language models across diverse downstream tasks has emerged as a pivotal challenge in NLP research. The pioneering scaling law on works demonstrated intrinsic similarities within model families utilized such for prediction. However, they tend to overlook between only consider design factors listed original law. To overcome these limitations, we introduce novel framework, Collaborative Performance Prediction...

10.48550/arxiv.2407.01300 preprint EN arXiv (Cornell University) 2024-07-01

Boosting denoisers with reinforcement learning for image restoration

OPENALEX - Publications

Jie Zhang Qiyuan Zhang Xixuan Zhao Jiangming Kan

10.1007/s00500-022-06840-3 article EN Soft Computing 2022-02-20

An Unknown Protocol Identification Method for Industrial Internet

OPENALEX - Publications

Xinghui Zhu Ziheng Jiang Qiyuan Zhang Shuangrui Zhao Zhiwei Zhang

This paper focuses on the problem of protocol identification in industrial internet and proposes an unknown method. We first establish detection model to classify known protocols, interference signals then store protocols for manual analysis. Based Eps-neighborhood idea, we further develop hit algorithm propose method identify where supervised learning classification is realized. Finally, extensive experimental results are provided illustrate our theoretical findings. It indicates that...

10.1155/2022/3792205 article EN Wireless Communications and Mobile Computing 2022-09-05

Energy transfer kinetics of phycoerythrocyanins (PECs) from the cyanobacteriumAnabaena variabilis (I)

OPENALEX - Publications

Jingmin Zhang Jianping Zhang Zixuan Yang Jingquan Zhao Lijin Jiang and 3 more

10.1007/bf02877730 article EN Science in China Series B Chemistry 1997-06-01

Sim-to-Real Transfer with Action Mapping and State Prediction for Robot Motion Control

OPENALEX - Publications

Xianjin Zhu Xudong Zheng Qiyuan Zhang Zhang Chen Yu Liu and 1 more

Deep reinforcement learning (DRL) has been proved to be a very promising method for robot motion control. However, it usually needs large number of samples in training, which restricts its application real-world robots. Sim-to-Real means transferring training strategies simulation reality, and become one the hottest research areas recent years. Embodiment DRL algorithms is necessary step Sim-to-Real, faced with many challenges, such as poor sample efficiency, wear tear robots, safety, etc....

10.1109/acirs52449.2021.9519311 article EN 2021-07-16

Multi-Agent Reinforcement Learning for Distributed Cooperative Targets Search

OPENALEX - Publications

Yang Sun Zhenning Wu Qiyuan Zhang Zongying Shi Yisheng Zhong

Many real-world applications require intelligent agents to coordinate with other in complex environments. As a stepping stone this goal, the domain of multi-agent cooperative search has emerged as an important challenge for swarm intelligence research, owing its endogenous complicated behavior mechanism and interaction uncertain dynamic environment. We choose address by proposing decentralized reinforcement learning method DELTAS team unmanned aerial vehicles perform efficient area interest....

10.1109/icus52573.2021.9641124 article EN 2021 IEEE International Conference on Unmanned Systems (ICUS) 2021-10-15

Energy transfer kinetics of phycoerythrocyanin trimer from cyanobacteriumAnabaena variabilis (II)

OPENALEX - Publications

Jingmin Zhang Jingquan Zhao Lijin Jiang Jian‐Xin Chen Tong Ye and 1 more

10.1007/bf02882717 article EN Science in China Series C Life Sciences 1998-04-01

Research Progress of Tongue Image Segmentation through Artificial Intelligence and Deep Learning

OPENALEX - Publications

Rui-Yang Peng Yichi Li Yingying Rui Qiyuan Zhang Xinyue Song and 2 more

In recent years, the objectification of tongue image in Traditional Chinese Medicine (TCM) has become a topic extensive discussion. As an important reference index TCM diagnosis, features play role disease prevention and diagnosis. Among them, how to achieve standardization accurate segmentation extracting quantitative feature information key issue that restricts objective development This article mainly reviews two aspects color processing methods. Review currently widely used methods,...

10.1109/aeeca52519.2021.9574273 article EN 2022 IEEE International Conference on Advances in Electrical Engineering and Computer Applications (AEECA) 2021-08-27

Acetyl-11-Keto-Beta-Boswellic Acid Activates the Nrf2/HO-1 Signaling Pathway in Schwann Cells to Reduce Oxidative Stress and Promote Sciatic Nerve Injury Repair

OPENALEX - Publications

Chong Zhou Yao Wang Qiyuan Zhang Guanghu Zhou Xianglin Ma and 2 more

Boswellia is a traditional medicine for bruises and injuries. Its main active ingredient, acetyl-11-keto-beta-boswellic acid, has antioxidant antiapoptotic effects. In this experiment, we used Sprague-Dawley rats to make sciatic nerve injury model detect the transcription factor NF-E2-related 2/heme oxygenase 1 signaling pathway apoptosis, combined with clinical indicators, testing whether acid can reduce oxidative stress promote repair. Our results showed that administration promoted myelin...

10.1055/a-2148-7427 article EN Planta Medica 2023-08-04

Research on Anomaly Detection Method of Non-Standard Machinery Production Process Based on Semi-Supervised Learning

OPENALEX - Publications

Longlong Lv Helong Qiu Huihui Wu Jun Dou Qiang Cao and 3 more

The non-standard machinery refers to customized produced meet specific customer demands. mainstream research direction in data stream anomaly detection focuses on deep learning, which involves learning distribution through a large amount of training data. However, equipment has the characteristics small production scale and sparse samples, making it difficult obtain sufficient annotated sets. This inadequacy results model not enough, thereby rendering unable effectively detect abnormal...

10.33969/j-nana.2023.030204 article EN Journal of Networking and Network Applications 2023-01-01

Coming Soon ...