NFDI4DS | UHH-SEMS - Publication Details

Experience-driven Networking: A Deep Reinforcement Learning based Approach

OPENALEX - Publications

Zhiyuan Xu Jian Tang Jingsong Meng Wei Zhang Yanzhi Wang and 2 more

Modern communication networks have become very complicated and highly dynamic, which makes them hard to model, predict control. In this paper, we develop a novel experience-driven approach that can learn well control network from its own experience rather than an accurate mathematical just as human learns new skill (such driving, swimming, etc). Specifically, we, for the first time, propose leverage emerging Deep Reinforcement Learning (DRL) enabling model-free in networks; present effective...

10.1109/infocom.2018.8485853 article EN IEEE INFOCOM 2022 - IEEE Conference on Computer Communications 2018-04-01

Spatiotemporal modeling and prediction in cellular networks: A big data enabled deep learning approach

OPENALEX - Publications

Jing Wang Jian Tang Zhiyuan Xu Yanzhi Wang Guoliang Xue and 2 more

In this paper, we propose to leverage the emerging deep learning techniques for spatiotemporal modeling and prediction in cellular networks, based on big system data. First, perform a preliminary analysis dataset from China Mobile, use traffic load as an example show non-zero temporal autocorrelation spatial correlation among neighboring Base Stations (BSs), which motivate us discover both dependencies our study. Then present hybrid model prediction, includes novel autoencoder-based Long...

10.1109/infocom.2017.8057090 article EN IEEE INFOCOM 2022 - IEEE Conference on Computer Communications 2017-05-01

A deep reinforcement learning based framework for power-efficient resource allocation in cloud RANs

OPENALEX - Publications

Zhiyuan Xu Yanzhi Wang Jian Tang Jing Wang M. Cenk Gursoy

Cloud Radio Access Networks (RANs) have become a key enabling technique for the next generation (5G) wireless communications, which can meet requirements of massively growing data traffic. However, resource allocation in cloud RANs still needs to be further improved order reach objective minimizing power consumption and meeting demands users over long operational period. Inspired by success Deep Reinforcement Learning (DRL) on solving complicated control problems, we present novel DRL-based...

10.1109/icc.2017.7997286 article EN 2017-05-01

A Hierarchical Framework of Cloud Resource Allocation and Power Management Using Deep Reinforcement Learning

OPENALEX - Publications

Ning Liu Zhe Li Jielong Xu Zhiyuan Xu Sheng Lin and 3 more

Automatic decision-making approaches, such as reinforcement learning (RL), have been applied to (partially) solve the resource allocation problem adaptively in cloud computing system. However, a complete framework exhibits high dimensions state and action spaces, which prohibit usefulness of traditional RL techniques. In addition, power consumption has become one critical concerns design control systems, degrades system reliability increases cooling cost. An effective dynamic management...

10.1109/icdcs.2017.123 preprint EN 2017-06-01

AutoCompress: An Automatic DNN Structured Pruning Framework for Ultra-High Compression Rates

OPENALEX - Publications

Ning Liu Xiaolong Ma Zhiyuan Xu Yanzhi Wang Jian Tang and 1 more

Structured weight pruning is a representative model compression technique of DNNs to reduce the storage and computation requirements accelerate inference. An automatic hyperparameter determination process necessary due large number flexible hyperparameters. This work proposes AutoCompress, an structured framework with following key performance improvements: (i) effectively incorporate combination schemes in process; (ii) adopt state-of-art ADMM-based as core algorithm, propose innovative...

10.1609/aaai.v34i04.5924 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

Experience-Driven Congestion Control: When Multi-Path TCP Meets Deep Reinforcement Learning

OPENALEX - Publications

Zhiyuan Xu Jian Tang Chengxiang Yin Yanzhi Wang Guoliang Xue

In this paper, we aim to study networking problems from a whole new perspective by leveraging emerging deep learning, develop an experience-driven approach, which enables network or protocol learn the best way control itself its own experience (e.g., runtime statistics data), just as human learns skill. We present design, implementation and evaluation of reinforcement learning (DRL)-based framework, DRL-CC (DRL for Congestion Control), realizes our design philosophy on multi-path TCP (MPTCP)...

10.1109/jsac.2019.2904358 article EN publisher-specific-oa IEEE Journal on Selected Areas in Communications 2019-03-11

On consistency of the weighted geometric mean complex judgement matrix in AHP

OPENALEX - Publications

Zhiyuan Xu

10.1016/s0377-2217(99)00082-x article EN European Journal of Operational Research 2000-11-01

Two contrasting eclogite cooling histories, North Qaidam HP/UHP terrane, western China: Petrological and isotopic constraints

OPENALEX - Publications

J.X. Zhang Jianfeng Yang C. G. Mattinson Zhiyuan Xu Fancheng Meng and 1 more

10.1016/j.lithos.2005.02.002 article EN Lithos 2005-04-09

Learning-Based Energy-Efficient Data Collection by Unmanned Vehicles in Smart Cities

OPENALEX - Publications

Bo Zhang Chi Harold Liu Jian Tang Zhiyuan Xu Jian Ma and 1 more

Mobile crowdsourcing (MCS) is now an important source of information for smart cities, especially with the help unmanned aerial vehicles (UAVs) and driverless cars. They are equipped different kinds high-precision sensors, can be scheduled/controlled completely during data collection, which will make MCS system more robust. However, they limited to energy constraint, long-term, long-distance sensing tasks, cities almost too crowded set stationary charging station. Towards this end, in paper...

10.1109/tii.2017.2783439 article EN IEEE Transactions on Industrial Informatics 2017-12-14

Deep Reinforcement Learning for Dynamic Treatment Regimes on Medical Registry Data

OPENALEX - Publications

Ying Liu Brent R. Logan Ning Liu Zhiyuan Xu Jian Tang and 1 more

In this paper, we propose the first deep reinforcement learning framework to estimate optimal Dynamic Treatment Regimes from observational medical data. This is more flexible and adaptive for high dimensional action state spaces than existing methods model real life complexity in heterogeneous disease progression treatment choices, with goal provide doctor patients data-driven personalized decision recommendations. The proposed contains a supervised step predict most possible expert actions;...

10.1109/ichi.2017.45 article EN 2017-08-01

RGB-Depth Fusion GAN for Indoor Depth Completion

OPENALEX - Publications

Haowen Wang Mingyuan Wang Zhengping Che Zhiyuan Xu Xiuquan Qiao and 3 more

The raw depth image captured by the indoor sen-sor usually has an extensive range of missing values due to inherent limitations such as inability perceive transparent objects and limited distance range. incomplete map burdens many downstream vision tasks, a rising number completion methods have been proposed alleviate this issue. While most existing meth-ods can generate accurate dense maps from sparse uniformly sampled maps, they are not suitable for complementing large contiguous regions...

10.1109/cvpr52688.2022.00611 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

ScaleKD: Distilling Scale-Aware Knowledge in Small Object Detector

OPENALEX - Publications

Yichen Zhu Qiqi Zhou Ning Liu Zhiyuan Xu Zhicai Ou and 2 more

Despite the prominent success of general object detection, performance and efficiency Small Object Detection (SOD) are still unsatisfactory. Unlike existing works that struggle to balance tradeoff between inference speed SOD performance, in this paper, we propose a novel Scale-aware Knowledge Distillation (ScaleKD), which transfers knowledge complex teacher model compact student model. We design two modules boost quality transfer distillation for SOD: 1) scale-decoupled feature module...

10.1109/cvpr52729.2023.01889 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

LLaVA-Phi: Efficient Multi-Modal Assistant with Small Language Model

OPENALEX - Publications

Yichen Zhu Minjie Zhu Ning Liu Zhiyuan Xu Yaxin Peng

10.1145/3688863.3689575 article EN 2024-10-26

A Hierarchical Framework of Cloud Resource Allocation and Power Management Using Deep Reinforcement Learning

OPENALEX - Publications

Ning Liu Zhe Li Zhiyuan Xu Jielong Xu Sheng Lin and 3 more

Automatic decision-making approaches, such as reinforcement learning (RL), have been applied to (partially) solve the resource allocation problem adaptively in cloud computing system. However, a complete framework exhibits high dimensions state and action spaces, which prohibit usefulness of traditional RL techniques. In addition, power consumption has become one critical concerns design control systems, degrades system reliability increases cooling cost. An effective dynamic management...

10.48550/arxiv.1703.04221 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Bridging the Gap Between Semantic Segmentation and Instance Segmentation

OPENALEX - Publications

Chengxiang Yin Jian Tang Tongtong Yuan Zhiyuan Xu Yanzhi Wang

Fine-grained instance segmentation is considerably more complicated and challenging than semantic segmentation. Most existing methods only focus on accuracy without paying much attention to inference latency, which, critical real-time applications, such as autonomous driving. In this paper, we aim bridge the gap between by presenting a novel model for segmentation, Sem2Ins, which effectively generates boundaries according leveraging conditional generative adversarial networks (cGANs) coupled...

10.1109/tmm.2021.3114541 article EN IEEE Transactions on Multimedia 2021-09-22

Model-free control for distributed stream data processing using deep reinforcement learning

OPENALEX - Publications

Teng Li Zhiyuan Xu Jian Tang Yanzhi Wang

In this paper, we focus on general-purpose Distributed Stream Data Processing Systems (DSDPSs) , which deal with processing of unbounded streams continuous data at scale distributedly in real or near-real time. A fundamental problem a DSDPS is the scheduling (i.e., assigning workload to workers/machines) objective minimizing average end-to-end tuple widely-used solution distribute evenly over machines cluster round-robin manner, obviously not efficient due lack consideration for...

10.14778/3199517.3199521 article EN Proceedings of the VLDB Endowment 2018-02-01

CADRE: A Cascade Deep Reinforcement Learning Framework for Vision-Based Autonomous Urban Driving

OPENALEX - Publications

Yinuo Zhao Kun Wu Zhiyuan Xu Zhengping Che Lu Qi and 2 more

Vision-based autonomous urban driving in dense traffic is quite challenging due to the complicated environment and dynamics of behaviors. Widely-applied methods either heavily rely on hand-crafted rules or learn from limited human experience, which makes them hard generalize rare but critical scenarios. In this paper, we present a novel CAscade Deep REinforcement learning framework, CADRE, achieve model-free vision-based driving. derive representative latent features raw observations, first...

10.1609/aaai.v36i3.20259 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2022-06-28

Model-free control for distributed stream data processing using deep reinforcement learning

OPENALEX - Publications

Teng Li Zhiyuan Xu Jian Tang Yanzhi Wang

In this paper, we focus on general-purpose Distributed Stream Data Processing Systems (DSDPSs), which deal with processing of unbounded streams continuous data at scale distributedly in real or near-real time. A fundamental problem a DSDPS is the scheduling (i.e., assigning workload to workers/machines) objective minimizing average end-to-end tuple widely-used solution distribute evenly over machines cluster round-robin manner, obviously not efficient due lack consideration for communication...

10.5555/3199517.3199521 article EN Very Large Data Bases 2018-02-01

Model-free control for distributed stream data processing using deep reinforcement learning

OPENALEX - Publications

Teng Li Zhiyuan Xu Jian Tang Yanzhi Wang

In this paper, we focus on general-purpose Distributed Stream Data Processing Systems (DSDPSs), which deal with processing of unbounded streams continuous data at scale distributedly in real or near-real time. A fundamental problem a DSDPS is the scheduling objective minimizing average end-to-end tuple widely-used solution to distribute workload evenly over machines cluster round-robin manner, obviously not efficient due lack consideration for communication delay. Model-based approaches do...

10.14778/3184470.3184474 article EN Proceedings of the VLDB Endowment 2018-02-01

Adversarial Meta-Learning

OPENALEX - Publications

Chengxiang Yin Jian Tang Zhiyuan Xu Yanzhi Wang

Meta-learning enables a model to learn from very limited data undertake new task. In this paper, we study the general meta-learning with adversarial samples. We present algorithm, ADML (ADversarial Meta-Learner), which leverages clean and samples optimize initialization of learning in an manner. leads following desirable properties: 1) it turns out be effective even cases only samples; 2) is robust samples, i.e., unlike other algorithms, minor performance degradation when there are 3) sheds...

10.48550/arxiv.1806.03316 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Learning the Dynamic Treatment Regimes from Medical Registry Data through Deep Q-network

OPENALEX - Publications

Ning Liu Ying Liu Brent R. Logan Zhiyuan Xu Jian Tang and 1 more

This paper presents the deep reinforcement learning (DRL) framework to estimate optimal Dynamic Treatment Regimes from observational medical data. is more flexible and adaptive for high dimensional action state spaces than existing methods model real-life complexity in heterogeneous disease progression treatment choices, with goal of providing doctors patients data-driven personalized decision recommendations. The proposed DRL comprises (i) a supervised step predict expert actions, (ii)...

10.1038/s41598-018-37142-0 article EN cc-by Scientific Reports 2019-02-06

CP3: Channel Pruning Plug-in for Point-Based Networks

OPENALEX - Publications

Yaomin Huang Ning Liu Zhengping Che Zhiyuan Xu Chaomin Shen and 5 more

Channel pruning can effectively reduce both computational cost and memory footprint of the original network while keeping a comparable accuracy performance. Though great success has been achieved in channel for 2D image-based convolutional networks (CNNs), existing works seldom extend methods to 3D point-based neural (PNNs). Directly implementing CNN PNNs undermine performance because different representations images point clouds as well architecture disparity. In this paper, we proposed CP...

10.1109/cvpr52729.2023.00513 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Experience-driven Networking: A Deep Reinforcement Learning based Approach

OPENALEX - Publications

Zhiyuan Xu Jian Tang Jingsong Meng Wei Zhang Yanzhi Wang and 2 more

Modern communication networks have become very complicated and highly dynamic, which makes them hard to model, predict control. In this paper, we develop a novel experience-driven approach that can learn well control network from its own experience rather than an accurate mathematical just as human learns new skill (such driving, swimming, etc). Specifically, we, for the first time, propose leverage emerging Deep Reinforcement Learning (DRL) enabling model-free in networks; present effective...

10.48550/arxiv.1801.05757 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Memory Augmented Deep Recurrent Neural Network for Video Question Answering

OPENALEX - Publications

Chengxiang Yin Jian Tang Zhiyuan Xu Yanzhi Wang

Video question answering (VideoQA) is a very important but challenging multimedia task, which automatically analyzes questions and videos generates accurate answers. However, research on VideoQA still in its infancy. In this article, we propose novel memory augmented deep recurrent neural network (MA-DRNN) model for VideoQA, features new method encoding questions, augmentation using the emerging differentiable computer (DNC). Specifically, encode textual (questions) information before visual...

10.1109/tnnls.2019.2938015 article EN IEEE Transactions on Neural Networks and Learning Systems 2019-09-20

An Actor-Critic-Based Transfer Learning Framework for Experience-Driven Networking

OPENALEX - Publications

Zhiyuan Xu Dejun Yang Jian Tang Yinan Tang Tongtong Yuan and 2 more

Experience-driven networking has emerged as a new and highly effective approach for resource allocation in complex communication networks. Deep Reinforcement Learning (DRL) been shown to be useful technique enabling experience-driven networking. In this paper, we focus on practical fundamental problem networking: when network configurations are changed, how train DRL agent effectively quickly adapt the environment. We present an Actor-Critic-based Transfer learning framework Traffic...

10.1109/tnet.2020.3037231 article EN publisher-specific-oa IEEE/ACM Transactions on Networking 2020-01-01