NFDI4DS | UHH-SEMS - Publication Details

Zulin Wang

ORCID: 0000-0002-1328-7739

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5103046319

Research Areas

Image and Video Quality Assessment
Visual Attention and Saliency Detection
Error Correcting Code Techniques
Advanced Wireless Communication Techniques
Video Coding and Compression Technologies
Advanced Vision and Imaging
Radar Systems and Signal Processing
Cooperative Communication and Network Coding
Advanced Image Processing Techniques
Advanced Data Compression Techniques
PAPR reduction in OFDM
Target Tracking and Data Fusion in Sensor Networks
Advanced Image Fusion Techniques
Image and Signal Denoising Methods
Video Surveillance and Tracking Methods
DNA and Biological Computing
Advanced Image and Video Retrieval Techniques
Coding theory and cryptography
Advanced Data Storage Technologies
Wireless Communication Networks Research
Image Enhancement Techniques
Advanced SAR Imaging Techniques
Algorithms and Data Compression
Indoor and Outdoor Localization Technologies
Face Recognition and Perception

Beihang University
2016-2025

Anhui University of Science and Technology
2023

Taishan Medical University
2016-2019

Yuhuangding Hospital
2019

Geospatial Research (United Kingdom)
2018

Shanghai Eighth People Hospital
2009-2017

Wuhan University
2013-2017

University of California, Davis
2012

Jiangxi University of Science and Technology
2012

China Astronaut Research and Training Center
2012

A Large-Scale Database and a CNN Model for Attention-Based Glaucoma Detection

OPENALEX - Publications

Liu Li Mai Xu Hanruo Liu Yang Li Xiaofei Wang and 4 more

Glaucoma is one of the leading causes irreversible vision loss. Many approaches have recently been proposed for automatic glaucoma detection based on fundus images. However, none existing can efficiently remove high redundancy in images detection, which may reduce reliability and accuracy detection. To avoid this disadvantage, paper proposes an attention-based convolutional neural network (CNN) called AG-CNN. Specifically, we first establish a large-scale (LAG) database, includes 11 760...

10.1109/tmi.2019.2927226 article EN IEEE Transactions on Medical Imaging 2019-07-11

Development and Validation of a Deep Learning System to Detect Glaucomatous Optic Neuropathy Using Fundus Photographs

OPENALEX - Publications

Hanruo Liu Liu Li I. Michael Wormstone Chunyan Qiao Chun Zhang and 25 more

A deep learning system (DLS) that could automatically detect glaucomatous optic neuropathy (GON) with high sensitivity and specificity expedite screening for GON.To establish a DLS detection of GON using retinal fundus images glaucoma diagnosis convoluted neural networks (GD-CNN) has the ability to be generalized across populations.In this cross-sectional study, classification was developed automated obtained from Chinese Glaucoma Study Alliance, Handan Eye Study, online databases. The...

10.1001/jamaophthalmol.2019.3501 article EN JAMA Ophthalmology 2019-09-12

Multi-frame Quality Enhancement for Compressed Video

OPENALEX - Publications

Ren Yang Mai Xu Zulin Wang Tianyi Li

The past few years have witnessed great success in applying deep learning to enhance the quality of compressed image/video. existing approaches mainly focus on enhancing a single frame, ignoring similarity between consecutive frames. In this paper, we investigate that heavy fluctuation exists across video frames, and thus low frames can be enhanced using neighboring high seen as Multi-Frame Quality Enhancement (MFQE). Accordingly, paper proposes an MFQE approach for video, first attempt...

10.1109/cvpr.2018.00697 preprint EN 2018-06-01

MFQE 2.0: A New Approach for Multi-Frame Quality Enhancement on Compressed Video

OPENALEX - Publications

Zhenyu Guan Qunliang Xing Mai Xu Ren Yang Tie Liu and 1 more

The past few years have witnessed great success in applying deep learning to enhance the quality of compressed image/video. existing approaches mainly focus on enhancing a single frame, not considering similarity between consecutive frames. Since heavy fluctuation exists across video frames as investigated this paper, frame can be utilized for enhancement low-quality given their neighboring high-quality This task is Multi-Frame Quality Enhancement (MFQE). Accordingly, paper proposes an MFQE...

10.1109/tpami.2019.2944806 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2019-10-02

Predicting Head Movement in Panoramic Video: A Deep Reinforcement Learning Approach

OPENALEX - Publications

Mai Xu Yuhang Song Jianyi Wang Minglang Qiao Liangyu Huo and 1 more

Panoramic video provides immersive and interactive experience by enabling humans to control the field of view (FoV) through head movement (HM). Thus, HM plays a key role in modeling human attention on panoramic video. This paper establishes database collecting subjects' sequences. From this database, we find that data are highly consistent across subjects. Furthermore, deep reinforcement learning (DRL) can be applied predict positions, via maximizing reward imitating scanpaths agent's...

10.1109/tpami.2018.2858783 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2018-07-25

Road Structure Refined CNN for Road Extraction in Aerial Image

OPENALEX - Publications

Yanan Wei Zulin Wang Mai Xu

In this letter, we propose a road structure refined convolutional neural network (RSRCNN) approach for extraction in aerial images. order to obtain structured output of extraction, both deconvolutional and fusion layers are designed the architecture RSRCNN. For training RSRCNN, new loss function is proposed incorporate geometric information cross-entropy loss, thus called road-structure-based function. Experimental results demonstrate that trained RSRCNN model able advance state-of-the-art...

10.1109/lgrs.2017.2672734 article EN IEEE Geoscience and Remote Sensing Letters 2017-03-14

Enhancing Quality for HEVC Compressed Videos

OPENALEX - Publications

Ren Yang Mai Xu Tie Liu Zulin Wang Zhenyu Guan

The latest High Efficiency Video Coding (HEVC) standard has been increasingly applied to generate video streams over the Internet. However, HEVC compressed videos may incur severe quality degradation, particularly at low bit-rates. Thus, it is necessary enhance visual of decoder side. To this end, paper proposes a Quality Enhancement Convolutional Neural Network (QE-CNN) method that does not require any modification encoder achieve enhancement for HEVC. In particular, our QE-CNN learns...

10.1109/tcsvt.2018.2867568 article EN IEEE Transactions on Circuits and Systems for Video Technology 2018-08-29

Reducing Complexity of HEVC: A Deep Learning Approach

OPENALEX - Publications

Mai Xu Tianyi Li Zulin Wang Xin Deng Ren Yang and 1 more

High Efficiency Video Coding (HEVC) significantly reduces bit-rates over the proceeding H.264 standard but at expense of extremely high encoding complexity. In HEVC, quad-tree partition coding unit (CU) consumes a large proportion HEVC complexity, due to bruteforce search for rate-distortion optimization (RDO). Therefore, this paper proposes deep learning approach predict CU reducing complexity both intra- and inter-modes, which is based on convolutional neural network (CNN) long- short-term...

10.1109/tip.2018.2847035 article EN IEEE Transactions on Image Processing 2018-06-13

A Deep Learning Approach for Multi-Frame In-Loop Filter of HEVC

OPENALEX - Publications

Tianyi Li Mai Xu Ce Zhu Ren Yang Zulin Wang and 1 more

An extensive study on the in-loop filter has been proposed for a high efficiency video coding (HEVC) standard to reduce compression artifacts, thus improving efficiency. However, in existing approaches, is always applied each single frame, without exploiting content correlation among multiple frames. In this paper, we propose multi-frame (MIF) HEVC, which enhances visual quality of encoded frame by leveraging its adjacent Specifically, first construct large-scale database containing frames...

10.1109/tip.2019.2921877 article EN IEEE Transactions on Image Processing 2019-06-14

Assessing Visual Quality of Omnidirectional Videos

OPENALEX - Publications

Mai Xu Chen Li Zhenzhong Chen Zulin Wang Zhenyu Guan

In contrast with traditional video, omnidirectional video enables spherical viewing direction support for head-mounted displays, providing an interactive and immersive experience. Unfortunately, to the best of our knowledge, there are few visual quality assessment (VQA) methods, either subjective or objective, coding. This paper proposes both objective methods assessing loss in encoding video. Specifically, we first present a new database, which includes data from several subjects watching...

10.1109/tcsvt.2018.2886277 article EN IEEE Transactions on Circuits and Systems for Video Technology 2018-12-11

Decoder-side HEVC quality enhancement with scalable convolutional neural network

OPENALEX - Publications

Ren Yang Mai Xu Zulin Wang

The latest High Efficiency Video Coding (HEVC) has been increasingly used to generate video streams over Internet. However, the decoded HEVC may incur severe quality degradation, especially at low bit-rates. Thus, it is necessary enhance visual of videos decoder side. To this end, we propose in paper a Decoder-side Scalable Convolutional Neural Network (DS-CNN) approach achieve enhancement for HEVC, which does not require any modification encoder. In particular, our DS-CNN learns model...

10.1109/icme.2017.8019299 article EN 2022 IEEE International Conference on Multimedia and Expo (ICME) 2017-07-01

DeepMTT: A deep learning maneuvering target-tracking algorithm based on bidirectional LSTM network

OPENALEX - Publications

Jingxian Liu Zulin Wang Mai Xu

10.1016/j.inffus.2019.06.012 article EN Information Fusion 2019-06-05

Optimal Bit Allocation for CTU Level Rate Control in HEVC

OPENALEX - Publications

Shengxi Li Mai Xu Zulin Wang Xiaoyan Sun

For High Efficiency Video Coding (HEVC), the R– <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$\lambda $ </tex-math></inline-formula> scheme is latest rate control (RC) scheme, which investigates relationships among allocated bits, slope of rate-distortion (R-D) curve , and quantization parameter. However, we argue that bit allocation in existing not optimal. In this paper, therefore propose an optimal...

10.1109/tcsvt.2016.2589878 article EN IEEE Transactions on Circuits and Systems for Video Technology 2016-07-11

Bridge the Gap Between VQA and Human Behavior on Omnidirectional Video

OPENALEX - Publications

Chen Li Mai Xu Xinzhe Du Zulin Wang

Omnidirectional video enables spherical stimuli with the $360 \times 180^ \circ$ viewing range. Meanwhile, only viewport region of omnidirectional can be seen by observer through head movement (HM), and an even smaller within clearly perceived eye (EM). Thus, subjective quality may correlated HM EM human behavior. To fill in gap between behavior, this paper proposes a large-scale visual assessment (VQA) dataset video, called VQA-OV, which collects 60 reference sequences 540 impaired...

10.1145/3240508.3240581 article EN Proceedings of the 30th ACM International Conference on Multimedia 2018-10-15

Joint Learning of 3D Lesion Segmentation and Classification for Explainable COVID-19 Diagnosis

OPENALEX - Publications

Xiaofei Wang Lai Jiang Liu Li Mai Xu Xin Deng and 6 more

Given the outbreak of COVID-19 pandemic and shortage medical resource, extensive deep learning models have been proposed for automatic diagnosis, based on 3D computed tomography (CT) scans. However, existing independently process lesion segmentation disease classification, ignoring inherent correlation between these two tasks. In this paper, we propose a joint model classification diagnosing COVID-19, called DeepSC-COVID, as first attempt in direction. Specifically, establish large-scale CT...

10.1109/tmi.2021.3079709 article EN IEEE Transactions on Medical Imaging 2021-05-13

Region-of-Interest Based Conversational HEVC Coding with Hierarchical Perception Model of Face

OPENALEX - Publications

Mai Xu Xin Deng Shengxi Li Zulin Wang

In this paper, we propose a region-of-interest (ROI) based HEVC coding approach for conversational videos, with novel hierarchical perception model of face (HP model), to improve the perceived visual quality state-of-the-art standard. contrast previous ROI-based video approaches, HP allows unequal importance facial features (e.g., eyes and mouth) within region, by generating pixel-wise weight map. Benefitting from such model, adaptive tree unit (CTU) partition structure is developed...

10.1109/jstsp.2014.2314864 article EN IEEE Journal of Selected Topics in Signal Processing 2014-04-02

Learning to Detect Video Saliency With HEVC Features

OPENALEX - Publications

Mai Xu Lai Jiang Xiaoyan Sun Zhaoting Ye Zulin Wang

Saliency detection has been widely studied to predict human fixations, with various applications in computer vision and image processing. For saliency detection, we argue this paper that the state-of-the-art High Efficiency Video Coding (HEVC) standard can be used generate useful features compressed domain. Therefore, proposes learn video model, regard HEVC features. First, establish an eye tracking database for which downloaded from https://github.com/remega/video_database. Through...

10.1109/tip.2016.2628583 article EN IEEE Transactions on Image Processing 2016-11-14

Closed-Form Optimization on Saliency-Guided Image Compression for HEVC-MSP

OPENALEX - Publications

Shengxi Li Mai Xu Yun Ren Zulin Wang

High efficiency video coding (HEVC) is the latest standard, and it has best performance among all existing standards. HEVC main still picture profile (HEVC-MSP) also achieves top in image compr-ession. In this paper, we propose a closed-form bit allocation approach to optimize saliency-guided PSNR (viewed as perceptual distortion) such that of HEVC-based compression can be significantly improved from subjective perspective. Specifically, formulation established minimize distortion with...

10.1109/tmm.2017.2721544 article EN IEEE Transactions on Multimedia 2017-06-29

Joint Learning of Multi-Level Tasks for Diabetic Retinopathy Grading on Low-Resolution Fundus Images

OPENALEX - Publications

Xiaofei Wang Mai Xu Jicong Zhang Lai Jiang Liu Li and 4 more

Diabetic retinopathy (DR) is a leading cause of permanent blindness among the working-age people. Automatic DR grading can help ophthalmologists make timely treatment for patients. However, existing methods are usually trained with high resolution (HR) fundus images, such that performance decreases lot given low (LR) which common in clinic. In this paper, we mainly focus on LR images. According to our analysis task, find that: 1) image super-resolution (ISR) boost both and lesion...

10.1109/jbhi.2021.3119519 article EN IEEE Journal of Biomedical and Health Informatics 2021-10-15

Weight-based R-λ rate control for perceptual HEVC coding on conversational videos

OPENALEX - Publications

Shengxi Li Mai Xu Xin Deng Zulin Wang

10.1016/j.image.2015.04.011 article EN Signal Processing Image Communication 2015-05-14

Cyclical NOMA Based UAV-Enabled Wireless Network

OPENALEX - Publications

Jinjing Sun Zulin Wang Qin Huang

In order to achieve high spectral efficiency and low access delay, this paper introduces cyclical non-orthogonal multiple (NOMA) into unmanned aerial vehicle (UAV)-enabled wireless network. It allows the UAV communicate with ground users in same time–frequency resources, cyclically. The minimum throughput over all is maximized by jointly optimizing multiuser communication scheduling NOMA trajectory. turns out that maximization of a mixed integer non-linear non-convex optimization problem....

10.1109/access.2018.2888855 article EN cc-by-nc-nd IEEE Access 2018-12-20

Viewport-Dependent Saliency Prediction in 360° Video

OPENALEX - Publications

Minglang Qiao Mai Xu Zulin Wang Ali Borji

Saliency prediction in traditional images and videos has drawn extensive research interests recent years. Few works have been proposed for saliency over 360° videos. They focus on directly predicting fixations the whole panorama. When viewing videos, a person can only observe content her viewport, which means that fraction of scene be seen at any given time. In this paper, we study human attention viewport propose novel visual model, dubbed saliency, to predict Two contributions are...

10.1109/tmm.2020.2987682 article EN IEEE Transactions on Multimedia 2020-04-20

Coming Soon ...