Qijie Zhao

ORCID: 0000-0002-5695-0043
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Gaze Tracking and Assistive Technology
  • Advanced Neural Network Applications
  • Advanced Image and Video Retrieval Techniques
  • Industrial Vision Systems and Defect Detection
  • Robotics and Automated Systems
  • Domain Adaptation and Few-Shot Learning
  • Video Surveillance and Tracking Methods
  • Image and Object Detection Techniques
  • 3D Surveying and Cultural Heritage
  • Advanced Computing and Algorithms
  • Image Processing Techniques and Applications
  • Robotics and Sensor-Based Localization
  • Handwritten Text Recognition Techniques
  • Ergonomics and Musculoskeletal Disorders
  • Human Pose and Action Recognition
  • Hand Gesture Recognition Systems
  • Vehicle License Plate Recognition
  • Image Enhancement Techniques
  • Glaucoma and retinal disorders
  • Sleep and Work-Related Fatigue
  • Anomaly Detection Techniques and Applications
  • Color perception and design
  • Advanced Measurement and Detection Methods
  • Dental Radiography and Imaging
  • Face recognition and analysis

Shanghai University
2014-2024

Beijing Institute of Radio Metrology and Measurement
2021

Peking University
2018-2020

State Key Laboratory of Industrial Control Technology
2011

University of Science and Technology Liaoning
2003

We present MMDetection, an object detection toolbox that contains a rich set of and instance segmentation methods as well related components modules. The started from codebase MMDet team who won the track COCO Challenge 2018. It gradually evolves into unified platform covers many popular contemporary not only includes training inference codes, but also provides weights for more than 200 network models. believe this is by far most complete toolbox. In paper, we introduce various features...

10.48550/arxiv.1906.07155 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Feature pyramids are widely exploited by both the state-of-the-art one-stage object detectors (e.g., DSSD, RetinaNet, RefineDet) and two-stage Mask RCNN, DetNet) to alleviate problem arising from scale variation across instances. Although these with feature achieve encouraging results, they have some limitations due that only simply construct pyramid according inherent multiscale, pyramidal architecture of backbones which originally designed for classification task. Newly, in this work, we...

10.1609/aaai.v33i01.33019259 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2019-07-17

In existing CNN based detectors, the backbone network is a very important component for basic feature1 extraction, and performance of detectors highly depends on it. this paper, we aim to achieve better detection by building more powerful from ones like ResNet ResNeXt. Specifically, propose novel strategy assembling multiple identical backbones composite connections between adjacent backbones, form named Composite Backbone Network (CBNet). way, CBNet iteratively feeds output features...

10.1609/aaai.v34i07.6834 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

The convolution operation suffers from a limited receptive filed, while global modeling is fundamental to dense prediction tasks, such as semantic segmentation. In this paper, we apply graph into the segmentation task and propose an improved Laplacian. reasoning directly performed in original feature space organized spatial pyramid. Different existing methods, our Laplacian data-dependent introduce attention diagonal matrix learn better distance metric. It gets rid of projecting...

10.1109/cvpr42600.2020.00897 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

The panoptic segmentation task requires a unified result from semantic and instance outputs that may contain overlaps. However, current studies widely ignore modeling In this study, we aim to model overlap relations among instances resolve them for segmentation. Inspired by scene graph representation, formulate the overlapping problem as simplified case, named graph. We leverage each object's category, geometry appearance features perform relational embedding, output relation matrix encodes...

10.1609/aaai.v34i07.6955 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

The captured underwater images suffer from color cast and haze effect caused by absorption scattering. These interdependent phenomena jointly degrade images, resulting in failure of autonomous machines to recognize image contents. Most existing learning-based methods for enhancement (UIE) treat the degraded process as a whole ignore interaction between correction dehazing. Thus, they often obtain unnatural results. To this end, we propose novel joint network optimize results dehazing...

10.1109/lra.2021.3070253 article EN IEEE Robotics and Automation Letters 2021-03-31

Since underwater images are seriously degraded due to the attenuation of light, artificial light (AL) is often used assist photography in underwater. However, normal imaging process changed by AL. It observed that AL source typically alters condition a large extent, resulting non-uniform illumination images. In addition, color distortion area affected little because close object suffers attenuation. most existing image enhancement algorithms ignore this phenomenon. their results, areas tend...

10.1109/tcsvt.2023.3237993 article EN IEEE Transactions on Circuits and Systems for Video Technology 2023-01-18

The ability to detect small objects and the speed of object detector are very important for application autonomous driving, in this paper, we propose an effective yet efficient one-stage detector, which gained second place Road Object Detection competition CVPR2018 workshop - Workshop Autonomous Driving(WAD). proposed inherits architecture SSD introduces a novel Comprehensive Feature Enhancement(CFE) module into it. Experimental results on dataset as well MSCOCO demonstrate that (named...

10.48550/arxiv.1806.09790 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Interior wall painting is widely used in construction, installation repair, decoration, and other construction industry. By replacing human manual operation, robotic developed to improve efficiency reduce the cost. This paper proposes a parameters estimation scheme for process. The parameter system built on single camera, laser level, four distance sensors. camera's intrinsic are obtained by calibration method. relative position between robot through single-image measurement. plane equation...

10.1109/tim.2018.2878427 article EN IEEE Transactions on Instrumentation and Measurement 2018-11-15

Abstract Slow detection of redundant objects and low accuracy in assembly lines, particularly the setting civil aircraft assembly, are tough challenging problems. To address these issues, a object method based on computer vision augmented reality (AR) smart glasses is proposed this paper. The uses AR as image collection hardware takes live collected by camera input deep learning machine model. model, Feature Pyramid Networks-CenterNet, inspired CenterNet combined with multi-scale feature...

10.1088/1361-6501/ac7cbd article EN Measurement Science and Technology 2022-06-28

Many outdoor activities, such as bicycling, driving, and riding segways can be modeled rigid bodies on a moving platform. In this study, an attitude estimation scheme is established for the body-moving platform system by using two gyroscopes relative measurements between body The proposed estimates drift-free attitudes of partial absolute without any global information or reference. kinematic model plays important role to obtain angles. To illustrate this, compared another design with...

10.1109/tmech.2018.2811730 article EN IEEE/ASME Transactions on Mechatronics 2018-03-08

In this report, our approach to tackling the task of ActivityNet 2018 Kinetics-600 challenge is described in detail. Though spatial-temporal modelling methods, which adopt either such end-to-end framework as I3D \cite{i3d} or two-stage frameworks (i.e., CNN+RNN), have been proposed existing state-of-the-arts for task, video far from being well solved. challenge, we propose network (StNet) better joint and comprehensively understanding. Besides, given that multi-modal information contained...

10.48550/arxiv.1806.10319 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Wood defect detection has an important influence on the automation of wood industry. In view complexity segmentation, this paper proposes hybrid algorithm genetic and particle swarm optimization algorithm. Firstly, contrast image is enhanced by linear transformation function. Then, applying swarm-genetic respectively for finally morphological processing performed. The result shows that a better more stable effect detecting defects compared with

10.1109/icalip.2016.7846635 article EN 2016-07-01

10.1007/s12193-014-0171-2 article EN Journal on Multimodal User Interfaces 2014-11-25

In existing CNN based detectors, the backbone network is a very important component for basic feature extraction, and performance of detectors highly depends on it. this paper, we aim to achieve better detection by building more powerful from backbones like ResNet ResNeXt. Specifically, propose novel strategy assembling multiple identical composite connections between adjacent backbones, form named Composite Backbone Network (CBNet). way, CBNet iteratively feeds output features previous...

10.48550/arxiv.1909.03625 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Slip is one of the major causes for falls during human walking. Providing external knee assistive torque can potentially help people perform successful slip recovery and avoid serious injuries. In this paper, we present a design characterization wearable robotic device slip-induced fall prevention. The consists set compliant components with impedance feedback control. bench performance evaluations show that capable tracking angle profile walking gait therefore be worn regular without...

10.1016/j.ifacol.2017.08.887 article EN IFAC-PapersOnLine 2017-07-01

Natural human-robot interaction plays an important role in effective nursing services system provided by service robots for the elderly and disabled people. This paper proposed a multimodal "human-robot integration" collaboration system, set up shared interface between human robot. Consequently, Users can naturally communicate retrieve information from collaborative with multimodality(e.g. head gesture, eye gaze) interactive dialogue approach. By this way, making fully understand human's...

10.1109/bibm.2014.6999239 article EN 2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2014-11-01

Scene text recognition is very challenging due to the complex background, low resolution, perspective distortion and curved placement, etc. Most of state-of-the-art methods adopt attention-based encoder-decoder framework, usually get suboptimal performance for images misalignment between attention region target character region. In this paper, a novel module, named Gated Cascade Attention Module (GCAM), proposed increase alignment precision in cascade way. Moreover, channel spatial module...

10.1109/icme.2019.00179 article EN 2022 IEEE International Conference on Multimedia and Expo (ICME) 2019-07-01

Feature pyramids are widely exploited by both the state-of-the-art one-stage object detectors (e.g., DSSD, RetinaNet, RefineDet) and two-stage Mask R-CNN, DetNet) to alleviate problem arising from scale variation across instances. Although these with feature achieve encouraging results, they have some limitations due that only simply construct pyramid according inherent multi-scale, pyramidal architecture of backbones which actually designed for classification task. Newly, in this work, we...

10.48550/arxiv.1811.04533 preprint EN other-oa arXiv (Cornell University) 2018-01-01
Coming Soon ...