Hong Zhang

ORCID: 0000-0002-1282-3755
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Video Surveillance and Tracking Methods
  • Advanced Image and Video Retrieval Techniques
  • Advanced Neural Network Applications
  • Image Enhancement Techniques
  • Advanced Vision and Imaging
  • Visual Attention and Saliency Detection
  • Advanced Image Processing Techniques
  • Human Pose and Action Recognition
  • Advanced Image Fusion Techniques
  • Image and Signal Denoising Methods
  • Infrared Target Detection Methodologies
  • Image Processing Techniques and Applications
  • Anomaly Detection Techniques and Applications
  • Image Retrieval and Classification Techniques
  • Video Analysis and Summarization
  • Multimodal Machine Learning Applications
  • Fire Detection and Safety Systems
  • Image and Video Quality Assessment
  • Gait Recognition and Analysis
  • Robotics and Sensor-Based Localization
  • Software Engineering Research
  • Software Reliability and Analysis Research
  • Color Science and Applications
  • Remote-Sensing Image Classification
  • Domain Adaptation and Few-Shot Learning

Beihang University
2015-2024

Wuhan University of Science and Technology
2012-2024

North Minzu University
2024

Ocean University of China
2023-2024

Xi’an University of Posts and Telecommunications
2019-2023

Argonne National Laboratory
2023

China Electric Power Research Institute
2023

Tencent (China)
2023

Changchun Institute of Technology
2023

National Computer Network Emergency Response Technical Team/Coordination Center of Chinar
2023

Object detection in drone-captured images is a popular task recent years. As drones always navigate at different altitudes, the object scale varies considerably, which burdens optimization of models. Moreover, high-speed and low-altitude flight cause motion blur on densely packed objects, leads to great challenges. To solve two issues mentioned above, based YOLOv5, we add an additional prediction head detect tiny-scale objects replace CNN-based heads with transformer (TPH), constructing...

10.3390/rs15061687 article EN cc-by Remote Sensing 2023-03-21

Semantic segmentation is a critical module in robotics related applications, especially autonomous driving. Most of the research on semantic focused improving accuracy with less attention paid to computationally efficient solutions. Majority algorithms have customized optimizations without scalability and there no systematic way compare them. In this paper, we present real-time benchmarking framework study various for We implemented generic meta-architecture via decoupled design where...

10.1109/cvprw.2018.00101 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2018-06-01

Inspired by the recent advances of image super-resolution using convolutional neural network (CNN), we propose a CNN-based block up-sampling scheme for intra frame coding. A can be down-sampled before being compressed normal coding, and then up-sampled to its original resolution. Different from previous studies on down/up-sampling-based methods in our have been designed training CNN instead hand-crafted. We explore new structure up-sampling, which features deconvolution feature maps,...

10.1109/tcsvt.2017.2727682 article EN IEEE Transactions on Circuits and Systems for Video Technology 2017-07-17

Aiming at simultaneous detection and segmentation (SD-S), we propose a proposal-free framework, which detect segment object instances via mid-level patches. We design unified trainable network on patches, is followed by fast effective patch aggregation algorithm to infer instances. Our method benefits from end-to-end training. Without proposal generation, computation time can also be reduced. In experiments, our yields results 62.1% 61.8% in terms of mAPr VOC2012 val SDS val, are...

10.1109/cvpr.2016.342 article EN 2016-06-01

Semantic segmentation for high-resolution remote sensing images is one of the most significant tasks in field applications. Remote contain substantial detailed information ground objects, such as shape, location, and texture. Therefore, these objects make exhibit large intraclass variance small interclass variance, which makes it very difficult to be recognized. In this study, an end-to-end attention-based semantic network (SSAtNet) proposed. A pyramid attention pooling module proposed...

10.1109/tgrs.2021.3085889 article EN cc-by-nc-nd IEEE Transactions on Geoscience and Remote Sensing 2021-08-13

According to Darwinian evolutionary theory, numerous species in the wild have developed remarkable adaptive mechanisms, involving pattern rearrangement and environmental assimilation, evade predators. These obfuscation strategies pose significant challenges for both individuals algorithms when performing Camouflage Object Detection (COD) task complex intricate scenarios. Inspired by human COD task, which involve assigning uncertainties entire input then focusing on highly uncertain areas...

10.1109/tmm.2023.3295095 article EN IEEE Transactions on Multimedia 2023-07-13

Recently, by virtue of the high computational efficiency and accuracy, discriminative correlation filter (DCF)- based tracking methods have gained attraction in field unmanned aerial vehicle (UAV). However, conventional DCF-based merely rely on cyclic shift to produce training samples. As a result, trained these samples owns limited ability, ineffectively addressing various challenges stage. Here, promote filter's we develop feature block-aware (CF) method. Specifically, extracted is divided...

10.1109/lsp.2024.3373528 article EN IEEE Signal Processing Letters 2024-01-01

With the wide applications of vision based intelligent systems, image and video analysis technologies have attracted attention researchers in computer field. In analysis, human activity recognition is an important research direction. By interpreting understanding activity, we can recognize predict occurrence crimes help police or other agencies react immediately. past, a large number papers been published on sequences. this paper, provide comprehensive survey recent development techniques,...

10.3390/s130201635 article EN cc-by Sensors 2013-01-25

10.1007/s11277-018-5246-z article EN Wireless Personal Communications 2018-01-11

Timely and accurately detecting personal protective equipment (PPE) usage among workers is essential for substation safety management. However, traditional algorithms encounter difficulties in substations due to issues such as varying target scales, intricate backgrounds, many model parameters. Therefore, this paper proposes MEAG-YOLO, an enhanced PPE detection built upon YOLOv8n. First, the incorporates Multi-Scale Channel Attention (MSCA) module improve feature extraction. Second, it newly...

10.3390/app14114766 article EN cc-by Applied Sciences 2024-05-31

As the Internet-of-Things (IoT) and edge computing have been major paradigms for distributed data collection, communication, processing, smart city applications in real world tend to adopt IoT broadly. Today, more machine learning algorithms would be deployed into front-end sensors, devices, centres rather than centralised cloud centres. However, sensors devices are usually not so capable as those units huge centres, this sake, practice, engineers choose compromise limited capacity of...

10.3390/s19091987 article EN cc-by Sensors 2019-04-28

As a significant research direction in remote sensing fields, unmanned aerial vehicles (UAVs) tracking has achieved rapid development recent years. However, due to limited power and computation resources on platforms, the methods deployed UAVs usually require high computational efficiency performance. In addition, various challenges (i.e., similar object, background clutter occlusion) have inevitably occurred during UAV phase. Therefore, considering above issues comprehensively, this paper...

10.1109/jstars.2023.3306273 article EN cc-by-nc-nd IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 2023-01-01

This paper proposes a novel cross-media retrieval approach. First, an isomorphic subspace is constructed based on canonical correlation analysis (CCA) to learn multi-modal correlations of media objects; second, polar coordinates are used judge the general distance objects with different modalities in subspace. Since integrity semantic not likely learned from limited training samples, users' relevance feedback accurately refine similarities. We also propose methods map new into subspace, and...

10.1109/icip.2006.312707 article EN International Conference on Image Processing 2006-10-01

Abstract Objective Current approaches to food volume estimation require the person carry a fiducial marker (e.g. checkerboard card), be placed next before taking picture. This procedure is inconvenient and post-processing of picture time-consuming sometimes inaccurate. These problems keep people from using smartphone for self-administered dietary assessment. The current bioengineering study presents novel smartphone-based imaging approach table-side which overcomes limitations. Design We...

10.1017/s136898001800054x article EN Public Health Nutrition 2018-04-06

In recent years, the excellent image-based object detection algorithms are transferred to video directly. These frame-by-frame processing methods suboptimal owing degenerate appearance such as motion blur, defocus and rare poses. The existing works for mostly focus on feature aggregation at pixel level instance level, but blur impact in process has not been exploited well so far. this article, we propose an end-to-end blur-aid network (BFAN) detection. proposed BFAN focuses influenced by...

10.1109/access.2020.3038913 article EN cc-by IEEE Access 2020-01-01

The rapid development of pay-as-you-go cloud services motivates the increasing number resource demands. However, volatile demands bring new challenges for current techniques to minimize cost capacity planning and VM provisioning while satisfying customer service vendors will incur enormous revenue loss within long-term inappropriate planning, especially when fluctuate abruptly frequently. In this paper, we cast as a classification problem propose an integrated framework, which effectively...

10.1109/tsc.2018.2804916 article EN IEEE Transactions on Services Computing 2018-02-12

To address the issues of fuzzy scene details, reduced definition, and poor visibility in images captured under non-uniform lighting conditions, this paper presents an algorithm for effectively enhancing such images. Firstly, adaptive color balance method is employed to differences low-light images, ensuring a more uniform distribution yielding image with improved consistency. Subsequently, obtained transformed from RGB space HSV space, wherein multi-scale Gaussian function utilized...

10.3390/app13179535 article EN cc-by Applied Sciences 2023-08-23
Coming Soon ...