Zhiyuan Xu

ORCID: 0000-0003-2879-3244
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Neural Network Applications
  • Cloud Computing and Resource Management
  • Software-Defined Networks and 5G
  • Reinforcement Learning in Robotics
  • Domain Adaptation and Few-Shot Learning
  • Data Stream Mining Techniques
  • Advanced Image and Video Retrieval Techniques
  • Human Pose and Action Recognition
  • Adversarial Robustness in Machine Learning
  • Geological and Geochemical Analysis
  • Ideological and Political Education
  • Geoscience and Mining Technology
  • Advanced Vision and Imaging
  • Geomechanics and Mining Engineering
  • Multimodal Machine Learning Applications
  • Advanced MIMO Systems Optimization
  • 3D Shape Modeling and Analysis
  • Optimization and Search Problems
  • Image Processing Techniques and Applications
  • Advanced Image Processing Techniques
  • Age of Information Optimization
  • IoT and Edge/Fog Computing
  • Indoor and Outdoor Localization Technologies
  • Adaptive Dynamic Programming Control
  • earthquake and tectonic studies

University of Chinese Academy of Sciences
2024

Midea Group (China)
2021-2024

Syracuse University
2017-2024

Beijing Institute of Technology
2024

SAIC-GM (China)
2024

Southern University of Science and Technology
2024

Beijing Advanced Sciences and Innovation Center
2024

University of Electronic Science and Technology of China
2013-2023

Hainan University
2020

Beijing University of Posts and Telecommunications
2020

Modern communication networks have become very complicated and highly dynamic, which makes them hard to model, predict control. In this paper, we develop a novel experience-driven approach that can learn well control network from its own experience rather than an accurate mathematical just as human learns new skill (such driving, swimming, etc). Specifically, we, for the first time, propose leverage emerging Deep Reinforcement Learning (DRL) enabling model-free in networks; present effective...

10.1109/infocom.2018.8485853 article EN IEEE INFOCOM 2022 - IEEE Conference on Computer Communications 2018-04-01

In this paper, we propose to leverage the emerging deep learning techniques for spatiotemporal modeling and prediction in cellular networks, based on big system data. First, perform a preliminary analysis dataset from China Mobile, use traffic load as an example show non-zero temporal autocorrelation spatial correlation among neighboring Base Stations (BSs), which motivate us discover both dependencies our study. Then present hybrid model prediction, includes novel autoencoder-based Long...

10.1109/infocom.2017.8057090 article EN IEEE INFOCOM 2022 - IEEE Conference on Computer Communications 2017-05-01

Cloud Radio Access Networks (RANs) have become a key enabling technique for the next generation (5G) wireless communications, which can meet requirements of massively growing data traffic. However, resource allocation in cloud RANs still needs to be further improved order reach objective minimizing power consumption and meeting demands users over long operational period. Inspired by success Deep Reinforcement Learning (DRL) on solving complicated control problems, we present novel DRL-based...

10.1109/icc.2017.7997286 article EN 2017-05-01

Automatic decision-making approaches, such as reinforcement learning (RL), have been applied to (partially) solve the resource allocation problem adaptively in cloud computing system. However, a complete framework exhibits high dimensions state and action spaces, which prohibit usefulness of traditional RL techniques. In addition, power consumption has become one critical concerns design control systems, degrades system reliability increases cooling cost. An effective dynamic management...

10.1109/icdcs.2017.123 preprint EN 2017-06-01

Structured weight pruning is a representative model compression technique of DNNs to reduce the storage and computation requirements accelerate inference. An automatic hyperparameter determination process necessary due large number flexible hyperparameters. This work proposes AutoCompress, an structured framework with following key performance improvements: (i) effectively incorporate combination schemes in process; (ii) adopt state-of-art ADMM-based as core algorithm, propose innovative...

10.1609/aaai.v34i04.5924 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

In this paper, we aim to study networking problems from a whole new perspective by leveraging emerging deep learning, develop an experience-driven approach, which enables network or protocol learn the best way control itself its own experience (e.g., runtime statistics data), just as human learns skill. We present design, implementation and evaluation of reinforcement learning (DRL)-based framework, DRL-CC (DRL for Congestion Control), realizes our design philosophy on multi-path TCP (MPTCP)...

10.1109/jsac.2019.2904358 article EN publisher-specific-oa IEEE Journal on Selected Areas in Communications 2019-03-11

10.1016/s0377-2217(99)00082-x article EN European Journal of Operational Research 2000-11-01

Mobile crowdsourcing (MCS) is now an important source of information for smart cities, especially with the help unmanned aerial vehicles (UAVs) and driverless cars. They are equipped different kinds high-precision sensors, can be scheduled/controlled completely during data collection, which will make MCS system more robust. However, they limited to energy constraint, long-term, long-distance sensing tasks, cities almost too crowded set stationary charging station. Towards this end, in paper...

10.1109/tii.2017.2783439 article EN IEEE Transactions on Industrial Informatics 2017-12-14

In this paper, we propose the first deep reinforcement learning framework to estimate optimal Dynamic Treatment Regimes from observational medical data. This is more flexible and adaptive for high dimensional action state spaces than existing methods model real life complexity in heterogeneous disease progression treatment choices, with goal provide doctor patients data-driven personalized decision recommendations. The proposed contains a supervised step predict most possible expert actions;...

10.1109/ichi.2017.45 article EN 2017-08-01

The raw depth image captured by the indoor sen-sor usually has an extensive range of missing values due to inherent limitations such as inability perceive transparent objects and limited distance range. incomplete map burdens many downstream vision tasks, a rising number completion methods have been proposed alleviate this issue. While most existing meth-ods can generate accurate dense maps from sparse uniformly sampled maps, they are not suitable for complementing large contiguous regions...

10.1109/cvpr52688.2022.00611 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Despite the prominent success of general object detection, performance and efficiency Small Object Detection (SOD) are still unsatisfactory. Unlike existing works that struggle to balance tradeoff between inference speed SOD performance, in this paper, we propose a novel Scale-aware Knowledge Distillation (ScaleKD), which transfers knowledge complex teacher model compact student model. We design two modules boost quality transfer distillation for SOD: 1) scale-decoupled feature module...

10.1109/cvpr52729.2023.01889 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Automatic decision-making approaches, such as reinforcement learning (RL), have been applied to (partially) solve the resource allocation problem adaptively in cloud computing system. However, a complete framework exhibits high dimensions state and action spaces, which prohibit usefulness of traditional RL techniques. In addition, power consumption has become one critical concerns design control systems, degrades system reliability increases cooling cost. An effective dynamic management...

10.48550/arxiv.1703.04221 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Fine-grained instance segmentation is considerably more complicated and challenging than semantic segmentation. Most existing methods only focus on accuracy without paying much attention to inference latency, which, critical real-time applications, such as autonomous driving. In this paper, we aim bridge the gap between by presenting a novel model for segmentation, Sem2Ins, which effectively generates boundaries according leveraging conditional generative adversarial networks (cGANs) coupled...

10.1109/tmm.2021.3114541 article EN IEEE Transactions on Multimedia 2021-09-22

In this paper, we focus on general-purpose Distributed Stream Data Processing Systems (DSDPSs) , which deal with processing of unbounded streams continuous data at scale distributedly in real or near-real time. A fundamental problem a DSDPS is the scheduling (i.e., assigning workload to workers/machines) objective minimizing average end-to-end tuple widely-used solution distribute evenly over machines cluster round-robin manner, obviously not efficient due lack consideration for...

10.14778/3199517.3199521 article EN Proceedings of the VLDB Endowment 2018-02-01

Vision-based autonomous urban driving in dense traffic is quite challenging due to the complicated environment and dynamics of behaviors. Widely-applied methods either heavily rely on hand-crafted rules or learn from limited human experience, which makes them hard generalize rare but critical scenarios. In this paper, we present a novel CAscade Deep REinforcement learning framework, CADRE, achieve model-free vision-based driving. derive representative latent features raw observations, first...

10.1609/aaai.v36i3.20259 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2022-06-28

In this paper, we focus on general-purpose Distributed Stream Data Processing Systems (DSDPSs), which deal with processing of unbounded streams continuous data at scale distributedly in real or near-real time. A fundamental problem a DSDPS is the scheduling (i.e., assigning workload to workers/machines) objective minimizing average end-to-end tuple widely-used solution distribute evenly over machines cluster round-robin manner, obviously not efficient due lack consideration for communication...

10.5555/3199517.3199521 article EN Very Large Data Bases 2018-02-01

In this paper, we focus on general-purpose Distributed Stream Data Processing Systems (DSDPSs), which deal with processing of unbounded streams continuous data at scale distributedly in real or near-real time. A fundamental problem a DSDPS is the scheduling objective minimizing average end-to-end tuple widely-used solution to distribute workload evenly over machines cluster round-robin manner, obviously not efficient due lack consideration for communication delay. Model-based approaches do...

10.14778/3184470.3184474 article EN Proceedings of the VLDB Endowment 2018-02-01

Meta-learning enables a model to learn from very limited data undertake new task. In this paper, we study the general meta-learning with adversarial samples. We present algorithm, ADML (ADversarial Meta-Learner), which leverages clean and samples optimize initialization of learning in an manner. leads following desirable properties: 1) it turns out be effective even cases only samples; 2) is robust samples, i.e., unlike other algorithms, minor performance degradation when there are 3) sheds...

10.48550/arxiv.1806.03316 preprint EN other-oa arXiv (Cornell University) 2018-01-01

This paper presents the deep reinforcement learning (DRL) framework to estimate optimal Dynamic Treatment Regimes from observational medical data. is more flexible and adaptive for high dimensional action state spaces than existing methods model real-life complexity in heterogeneous disease progression treatment choices, with goal of providing doctors patients data-driven personalized decision recommendations. The proposed DRL comprises (i) a supervised step predict expert actions, (ii)...

10.1038/s41598-018-37142-0 article EN cc-by Scientific Reports 2019-02-06

Channel pruning can effectively reduce both computational cost and memory footprint of the original network while keeping a comparable accuracy performance. Though great success has been achieved in channel for 2D image-based convolutional networks (CNNs), existing works seldom extend methods to 3D point-based neural (PNNs). Directly implementing CNN PNNs undermine performance because different representations images point clouds as well architecture disparity. In this paper, we proposed CP...

10.1109/cvpr52729.2023.00513 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Modern communication networks have become very complicated and highly dynamic, which makes them hard to model, predict control. In this paper, we develop a novel experience-driven approach that can learn well control network from its own experience rather than an accurate mathematical just as human learns new skill (such driving, swimming, etc). Specifically, we, for the first time, propose leverage emerging Deep Reinforcement Learning (DRL) enabling model-free in networks; present effective...

10.48550/arxiv.1801.05757 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Video question answering (VideoQA) is a very important but challenging multimedia task, which automatically analyzes questions and videos generates accurate answers. However, research on VideoQA still in its infancy. In this article, we propose novel memory augmented deep recurrent neural network (MA-DRNN) model for VideoQA, features new method encoding questions, augmentation using the emerging differentiable computer (DNC). Specifically, encode textual (questions) information before visual...

10.1109/tnnls.2019.2938015 article EN IEEE Transactions on Neural Networks and Learning Systems 2019-09-20

Experience-driven networking has emerged as a new and highly effective approach for resource allocation in complex communication networks. Deep Reinforcement Learning (DRL) been shown to be useful technique enabling experience-driven networking. In this paper, we focus on practical fundamental problem networking: when network configurations are changed, how train DRL agent effectively quickly adapt the environment. We present an Actor-Critic-based Transfer learning framework Traffic...

10.1109/tnet.2020.3037231 article EN publisher-specific-oa IEEE/ACM Transactions on Networking 2020-01-01
Coming Soon ...