Zhong Zhou

ORCID: 0000-0002-5825-7517
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Vision and Imaging
  • Video Surveillance and Tracking Methods
  • Computer Graphics and Visualization Techniques
  • Advanced Image and Video Retrieval Techniques
  • Human Pose and Action Recognition
  • Robotics and Sensor-Based Localization
  • Advanced Neural Network Applications
  • Peer-to-Peer Network Technologies
  • 3D Shape Modeling and Analysis
  • Image Enhancement Techniques
  • Distributed and Parallel Computing Systems
  • Gait Recognition and Analysis
  • Simulation and Modeling Applications
  • Caching and Content Delivery
  • Multimodal Machine Learning Applications
  • Remote Sensing and LiDAR Applications
  • Anomaly Detection Techniques and Applications
  • Domain Adaptation and Few-Shot Learning
  • Simulation Techniques and Applications
  • Video Analysis and Summarization
  • Fluid Dynamics Simulations and Interactions
  • Fluid Dynamics and Heat Transfer
  • Face recognition and analysis
  • 3D Surveying and Cultural Heritage
  • Cloud Computing and Resource Management

Beihang University
2016-2025

South China Agricultural University
2025

PLA Information Engineering University
2020-2024

State Key Laboratory of Virtual Reality Technology and Systems
2008-2024

Virtual Reality Medical Center
2010-2024

University of Stuttgart
2024

Central South University
2024

State Grid Hunan Electric Power Company Limited
2023

Purple Mountain Laboratories
2022

Institute of Electrical and Electronics Engineers
2021

The multiscale defect detection for photovoltaic (PV) cell electroluminescence (EL) images is a challenging task, due to the feature vanishing as network deepens. To address this problem, an attention-based top-down and bottom-up architecture developed accomplish fusion. This architecture, called bidirectional attention pyramid (BAFPN), can make all layers of share similar semantic features. In BAFPN, cosine similarity employed measure importance each pixel in fused Furthermore, novel object...

10.1109/tie.2021.3070507 article EN IEEE Transactions on Industrial Electronics 2021-04-07

The anomaly detection in photovoltaic (PV) cell electroluminescence (EL) image is of great significance for the vision-based fault diagnosis. Many researchers are committed to solving this problem, but a large-scale open-world dataset required validate their novel ideas. We build PV EL Anomaly Detection (PVEL-AD <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">1, 2, 3</sup> ) polycrystalline solar cell, which contains 36 543 near-infrared images...

10.1109/tii.2022.3162846 article EN IEEE Transactions on Industrial Informatics 2022-03-29

Image/video stitching is a technology for solving the field of view (FOV) limitation images/ videos. It stitches multiple overlapping images/videos to generate wide-FOV image/video, and has been used in various fields such as sports broadcasting, video surveillance, street view, entertainment. This survey reviews image/video algorithms, with particular focus on those developed recent years. Image first calculates corresponding relationships between images, deforms aligns matched then blends...

10.3724/sp.j.2096-5796.2018.0008 article EN cc-by-nc-nd Virtual Reality & Intelligent Hardware 2019-01-10

10.1007/s11432-017-9189-6 article EN Science China Information Sciences 2017-11-17

CSI-Based physical layer authentication is a promising candidate to achieve fast and lightweight for wireless communication. However, the current methods usually cannot initial are susceptible channel noise. Besides, learning-aided requires illegitimate state information (CSI) samples that difficult obtain. This letter proposes newly deep-CSI-based scheme solve above problems. We map CSI device's location further its authenticated identity via deep learning in static environment. Therefore,...

10.1109/lwc.2022.3180901 article EN IEEE Wireless Communications Letters 2022-06-08

For the problem of action detection, most existing methods require that relevant portions interest in training videos have been manually annotated with bounding boxes. Some recent works tried to avoid tedious manual annotation , and proposed automatically identify videos. However, these only concerned identification either spatial or temporal domain, may get irrelevant contents from another domain. These are usually undesirable phase, which will lead a degradation detection performance. This...

10.1109/tmm.2015.2404779 article EN IEEE Transactions on Multimedia 2015-02-18

Online inter-camera trajectory association is a promising topic in intelligent video surveillance, which concentrates on associating trajectories belong to the same individual across different cameras according time. It remains challenging due inconsistent appearance of person and lack spatio-temporal constraints between cameras. Besides, orientation variations partial occlusions significantly increase difficulty association. Targeting solve these problems, this work proposes an...

10.1145/3240508.3240663 article EN Proceedings of the 30th ACM International Conference on Multimedia 2018-10-15

The login functionality, being the gateway to app usage, plays a critical role in both user experience and application security. As Android apps increasingly incorporate functionalities, they support variety of authentication methods with complicated processes, catering personalized experiences. However, complexities managing different operations processes make it difficult for developers handle them correctly. In this paper, we present first empirical study issues apps. We analyze 361 from...

10.48550/arxiv.2502.04200 preprint EN arXiv (Cornell University) 2025-02-06

Existing localization methods commonly employ vision to perceive scene and achieve in GNSS-denied areas, yet they often struggle environments with complex lighting conditions, dynamic objects or privacy-preserving areas. Humans possess the ability describe various scenes using natural language help others infer location by recognizing recalling rich semantic information these descriptions. Harnessing presents a potential solution for robust localization. Thus, this study introduces new task,...

10.1109/tip.2025.3546853 article EN IEEE Transactions on Image Processing 2025-01-01

Long non-coding RNAs (lncRNAs) are critical in the regulation of various biological processes. In recent years, plethora lncRNAs have been identified mammalian genomes through different approaches, and researchers constantly reporting regulatory roles these lncRNAs, which leads to complexity literature about particular lncRNAs. Therefore, for convenience researchers, we collected relationships built a database called 'LncReg'. This is developed by collecting 1081 validated lncRNA-associated...

10.1093/database/bav083 article EN Database 2015-01-01

Semantic segmentation is a challenging task that needs to handle large scale variations, deformations and different viewpoints. In this paper, we develop novel network named Gated Path Selection Network (GPSNet), which aims learn adaptive receptive fields. GPSNet, first design two-dimensional multi-scale - SuperNet, densely incorporates features from growing To dynamically select desirable semantic context, gate prediction module further introduced. contrast previous works focus on...

10.1109/tip.2020.3046921 article EN IEEE Transactions on Image Processing 2021-01-01

Context modeling or multi-level feature fusion methods have been proved to be effective in improving semantic segmentation performance. However, they are not specialized deal with the problems of pixel-context mismatch and spatial misalignment, high computational complexity hinders their widespread application real-time scenarios. In this work, we propose a lightweight Spatial Feature Calibration Network (CSFCN) address above issues pooling-based sampling-based attention mechanisms. CSFCN...

10.1109/tip.2023.3318967 article EN IEEE Transactions on Image Processing 2023-01-01

Vehicle re-identification (re-id) is a promising topic, which focuses on retrieving the same vehicles across different cameras. It challenging due to variations of illumination and camera viewpoints. To solve these problems, we present multi -attribute driven vehicle re-id approach learn discriminative representations. The proposed consists multi-branch architecture re-ranking strategy. extracts color, model, appearance features, explicitly leverages attribute cues enhance generalization...

10.1109/icip.2018.8451776 article EN 2018-09-07

Unmanned aerial vehicles (UAVs) can capture high-quality photos and have been widely used for large-scale urban 3D reconstruction. However, even with the help of commercial flight control software, it is still a challenging task non-professional users to full-coverage in complex environments, which normally leads incomplete In this paper, we propose novel path planning method reconstruction scenes. The proposed approach first captures photos, following an initial generate coarse model as...

10.3390/rs13050989 article EN cc-by Remote Sensing 2021-03-05

Road damage detection (RDD) is indispensable for safe autonomous driving. Existing RDD models focus on designing feature representations following expert knowledge. However, collecting and labeling all types of samples time-consuming leads to insufficient training data. To alleviate the adverse effect few samples, a novel few-shot road detector (FSRDD) proposed in this paper detect rare damages. The FSRDD includes three stages. First, fully annotated abundant base classes are leveraged train...

10.1109/tits.2022.3208188 article EN IEEE Transactions on Intelligent Transportation Systems 2022-09-29

Open-set object detection (OSOD) aims to detect the known categories and reject unknown objects in a dynamic world, which has achieved significant attention. However, previous approaches only consider this problem data-abundant conditions, while neglecting few-shot scenes. In paper, we seek solution for generalized open-set (G-FOOD), avoid detecting classes as with high confidence score maintaining performance of detection. The main challenge task is that few training samples induce model...

10.1109/tip.2024.3364495 article EN IEEE Transactions on Image Processing 2024-01-01

Two-color pyrometric methods have been widely used in noncontact temperature measurement area. However, it is difficult to get synchronous monochromatic images for two-color formula. Some researches use beam splitter obtain two or more optical paths capture the different images, but complex will bring spatiotemporal matching errors. Another method uses color camera Red, Green, Blue (RGB) channel as RGB substituting Dirac delta function spectral response result inaccuracy of results. In fact,...

10.1109/tim.2015.2444251 article EN IEEE Transactions on Instrumentation and Measurement 2015-07-09

Crowd counting is a challenging problem due to the diverse crowd distribution and background interference. In this paper, we propose new approach for head size estimation reduce impact of different scale noise. Different from just using local information distance between human heads, global people in whole image also under consideration. We obey order far- near-region (small large) spread size, ensure that propagation uninterrupted by inserting dummy points. The estimated further exploited,...

10.1109/tip.2020.3009030 article EN IEEE Transactions on Image Processing 2020-01-01

In recent years, the amount of remote sensing imagery data has increased exponentially. The ability to quickly and effectively find required images from massive archives is key organization, management, sharing image information. This paper proposes a high-resolution retrieval method with Gabor-CA-ResNet split-based deep feature transform network. main contributions include two points. (1) For complex texture, diverse scales, special viewing angles images, A network taking ResNet as backbone...

10.3390/rs13050869 article EN cc-by Remote Sensing 2021-02-26

Due to occlusions and objects' non-rigid deformation in the scene, obtained motion trajectories from common trackers may contain a number of missing or mis-associated entries. To cluster such corrupted point based into multiple motions is still hard problem. In this paper, we present an approach that exploits temporal spatial characteristics tracked points facilitate segmentation incomplete trajectories, thereby obtain highly robust results against severe data noises. Our method first uses...

10.1109/iccv.2013.383 article EN 2013-12-01
Coming Soon ...