Qijun Zhao

ORCID: 0000-0003-4651-7163
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Biometric Identification and Security
  • Face recognition and analysis
  • Face and Expression Recognition
  • Video Surveillance and Tracking Methods
  • Advanced Image and Video Retrieval Techniques
  • Visual Attention and Saliency Detection
  • Forensic Fingerprint Detection Methods
  • Advanced Neural Network Applications
  • Advanced Vision and Imaging
  • Human Pose and Action Recognition
  • Infrared Target Detection Methodologies
  • Generative Adversarial Networks and Image Synthesis
  • Forensic and Genetic Research
  • 3D Shape Modeling and Analysis
  • Animal Vocal Communication and Behavior
  • Digital Media Forensic Detection
  • Gait Recognition and Analysis
  • Advanced Steganography and Watermarking Techniques
  • UAV Applications and Optimization
  • Image Processing and 3D Reconstruction
  • Fire Detection and Safety Systems
  • Emotion and Mood Recognition
  • Image Retrieval and Classification Techniques
  • Image and Video Quality Assessment
  • Computer Graphics and Visualization Techniques

Zhaotong University
2017-2025

Sichuan University
2016-2025

Chengdu University
2016-2025

Nanjing University of Aeronautics and Astronautics
2022-2025

Tibet University
2019-2023

Gansu Provincial Maternal and Child Health Hospital
2022

University of Benin
2022

Ningbo University
2020

Michigan State University
2010-2013

Hong Kong Polytechnic University
2006-2011

This paper proposes a novel joint learning and densely-cooperative fusion (JL-DCF) architecture for RGB-D salient object detection. Existing models usually treat RGB depth as independent information design separate networks feature extraction from each. Such schemes can easily be constrained by limited amount of training data or over-reliance on an elaborately-designed process. In contrast, our JL-DCF learns both inputs through Siamese network. To this end, we propose two effective...

10.1109/cvpr42600.2020.00312 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

Modern crowd counting methods usually employ deep neural networks (DNN) to estimate counts via density regression. Despite their significant improvements, the regression-based are incapable of providing detection individuals in crowds. The detection-based methods, on other hand, have not been largely explored recent trends due needs for expensive bounding box annotations. In this work, we instead propose a new network with only point supervision required. It can simultaneously detect size...

10.1109/cvpr.2019.00663 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

Existing RGB-D salient object detection (SOD) models usually treat RGB and depth as independent information design separate networks for feature extraction from each. Such schemes can easily be constrained by a limited amount of training data or over-reliance on an elaborately designed process. Inspired the observation that modalities actually present certain commonality in distinguishing objects, novel joint learning densely cooperative fusion (JL-DCF) architecture is to learn both inputs...

10.1109/tpami.2021.3073689 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2021-01-01

RGB-D salient object detection (SOD) recently has attracted increasing research interest and many deep learning methods based on encoder-decoder architectures have emerged. However, most existing SOD models conduct feature fusion either in the single encoder or decoder stage, which hardly guarantees sufficient cross-modal ability. In this paper, we make first attempt addressing through 3D convolutional neural networks. The proposed model, named RD3D, aims at pre-fusion stage in-depth to...

10.1609/aaai.v35i2.16191 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2021-05-18

This paper proposes an encoder-decoder network to disentangle shape features during 3D face reconstruction from single 2D images, such that the tasks of reconstructing accurate shapes and learning discriminative for recognition can be accomplished simultaneously. Unlike existing methods, our proposed method directly regresses dense tackles identity residual (i.e., non-identity) components in explicitly separately based on a composite model with latent representations. We devise training...

10.1109/cvpr.2018.00547 preprint EN 2018-06-01

Face alignment and 3D face reconstruction are traditionally accomplished as separated tasks. By exploring the strong correlation between 2D landmarks shapes, in contrast, we propose a joint method to simultaneously solve these two problems for images of arbitrary poses expressions. This method, based on summation model faces cascaded regression shape spaces, iteratively alternately applies regressors, one updating other shape. The correlated via 3D-to-2D mapping matrix, which is updated each...

10.1109/tpami.2018.2885995 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2018-12-10

RGB-D salient object detection (SOD) recently has attracted increasing research interest by benefiting conventional RGB SOD with extra depth information. However, existing models often fail to perform well in terms of both efficiency and accuracy, which hinders their potential applications on mobile devices real-world problems. An underlying challenge is that the model accuracy usually degrades when simplified have few parameters. To tackle this dilemma also inspired fact quality a key...

10.1145/3474085.3475240 article EN Proceedings of the 30th ACM International Conference on Multimedia 2021-10-17

Depth information has been proved beneficial in RGB-D salient object detection (SOD). However, depth maps obtained often suffer from low quality and inaccuracy. Most existing SOD models have no cross-modal interactions or only unidirectional to RGB their encoder stages, which may lead inaccurate features when facing depth. To address this limitation, we propose conduct progressive bidirectional as early the stage, yielding a novel bi-directional transfer-and-selection network named BTS-Net,...

10.1109/icme51207.2021.9428263 article EN 2022 IEEE International Conference on Multimedia and Expo (ICME) 2021-06-09

With the increasing demand for search and rescue, it is highly demanded to detect objects of interest in large-scale images captured by Unmanned Aerial Vehicles (UAVs), which quite challenging due extremely small scales objects. Most existing methods employed Feature Pyramid Network (FPN) enrich shallow layers' features combing deep contextual features. However, under limitation inconsistency gradient computation across different layers, layers FPN are not fully exploited tiny In this paper,...

10.1109/lgrs.2021.3103069 article EN IEEE Geoscience and Remote Sensing Letters 2021-08-17

RGB-depth (RGB-D) salient object detection (SOD) recently has attracted increasing research interest, and many deep learning methods based on encoder–decoder architectures have emerged. However, most existing RGB-D SOD models conduct explicit controllable cross-modal feature fusion either in the single encoder or decoder stage, which hardly guarantees sufficient ability. To this end, we make first attempt addressing through 3-D convolutional neural networks. The proposed model, named RD3D,...

10.1109/tnnls.2022.3202241 article EN IEEE Transactions on Neural Networks and Learning Systems 2022-09-13

High-resolution automated fingerprint recognition systems (AFRSs) offer higher security because they are able to make use of level-3 features, such as pores, that not available in lower resolution ( <; 500-dpi) images. One the main parameters affecting quality a digital image and issues cost, interoperability, performance an AFRS is choice resolution. In this paper, we identify optimal for using two most representative features: minutiae pores. We first designed multiresolution acquisition...

10.1109/tim.2010.2062610 article EN IEEE Transactions on Instrumentation and Measurement 2010-08-24

Compared to conventional saliency detection by handcrafted features, deep convolutional neural networks (CNNs) recently have been successfully applied field with superior performance on locating salient objects. However, due repeated sub-sampling operations inside CNNs such as pooling and convolution, many CNN-based models fail maintain fine-grained spatial details boundary structures of To remedy this issue, paper proposes a novel end-to-end learning-based refinement model named Refinet,...

10.1109/tmm.2018.2859746 article EN publisher-specific-oa IEEE Transactions on Multimedia 2018-07-25

10.1016/j.compag.2021.106523 article EN Computers and Electronics in Agriculture 2021-11-10

Estimating homography from an image pair is a fundamental problem in alignment. Unsupervised learning methods have received increasing attention this field due to their promising performance and label-free training. However, existing do not explicitly consider the of plane-induced parallax, which will make predicted compromised on multiple planes. In work, we propose novel method HomoGAN guide unsupervised estimation focus dominant plane. First, multi-scale transformer network designed...

10.1109/cvpr52688.2022.01714 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01
Coming Soon ...