Yonggang Qi

ORCID: 0000-0001-8280-3541
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Image and Video Retrieval Techniques
  • Advanced Vision and Imaging
  • Computer Graphics and Visualization Techniques
  • Multimodal Machine Learning Applications
  • Image Retrieval and Classification Techniques
  • Robotics and Sensor-Based Localization
  • Domain Adaptation and Few-Shot Learning
  • 3D Shape Modeling and Analysis
  • Human Pose and Action Recognition
  • Advanced Neural Network Applications
  • Generative Adversarial Networks and Image Synthesis
  • Visual Attention and Saliency Detection
  • Hand Gesture Recognition Systems
  • Video Analysis and Summarization
  • Human Motion and Animation
  • Video Surveillance and Tracking Methods
  • Interactive and Immersive Displays
  • Emotion and Mood Recognition
  • Synthesis and properties of polymers
  • Augmented Reality Applications
  • Organic Electronics and Photovoltaics
  • Handwritten Text Recognition Techniques
  • Vehicle License Plate Recognition
  • Conducting polymers and applications
  • Text and Document Classification Technologies

Beijing University of Posts and Telecommunications
2015-2025

State Key Laboratory of Polymer Physics and Chemistry
2023-2024

Chinese Academy of Sciences
2023-2024

Changchun Institute of Applied Chemistry
2023-2024

China Special Equipment Inspection and Research Institute
2022

MIT Lincoln Laboratory
2004

Sketch-based image retrieval (SBIR) is a challenging task due to the ambiguity inherent in sketches when compared with photos. In this paper, we propose novel convolutional neural network based on Siamese for SBIR. The main idea pull output feature vectors closer input sketch-image pairs that are labeled as similar, and push them away if irrelevant. This achieved by jointly tuning two networks which linked one loss function. Experimental results Flickr15K demonstrate proposed method offers...

10.1109/icip.2016.7532801 article EN 2022 IEEE International Conference on Image Processing (ICIP) 2016-08-17

We propose a perceptual grouping framework that organizes image edges into meaningful structures and demonstrate its usefulness on various computer vision tasks. Our grouper formulates edge as graph partition problem, where learning to rank method is developed encode probabilities of candidate pairs. In particular, RankSVM employed for the first time combine multiple Gestalt principles cue grouping. Afterwards, an based object proposal measure introduced yields proposals comparable...

10.1109/cvpr.2015.7298795 article EN 2015-06-01

This paper studies the problem of zero-short sketch-based image retrieval (ZS-SBIR), however with two significant differentiators to prior art (i) we tackle all variants (inter-category, intracategory, and cross datasets) ZS-SBIR just one network ("everything"), (ii) would really like understand how this sketch-photo matching operates ("explainable"). Our key innovation lies realization that such a cross-modal could be reduced comparisons groups local patches - akin seasoned "bag-of-words"...

10.1109/cvpr52729.2023.02236 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

We provide a new fully automatic framework to analyze facial action units, the fundamental building blocks of expression enumerated in Paul Ekman's coding system (FACS). The units examined here include upper muscle movements such as inner eyebrow raise, eye widening, and so forth, which combine form expressions. Although prior methods have obtained high recognition rates for recognizing these either use manually preprocessed image sequences or require human specification features; thus, they...

10.1109/amfg.2003.1240843 article EN 2004-04-23

In this paper, for the first time, we investigate problem of generating 3D shapes from professional 2D sketches via deep learning. We target done by artists, as these are likely to contain more details than ones produced novices, and thus reconstruction such poses a higher demand on level detail in reconstructed models. This is importantly different previous work, where training testing was conducted either synthetic or novices. Novices often depict that physically unrealistic, while models...

10.1109/tcsvt.2020.3040900 article EN IEEE Transactions on Circuits and Systems for Video Technology 2020-11-26

10.1109/icassp49660.2025.10889388 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2025-03-12

Despite the rapid advancements in text-to-image (T2I) synthesis, enabling precise visual control remains a significant challenge. Existing works attempted to incorporate multi-facet controls (text and sketch), aiming enhance creative over generated images. However, our pilot study reveals that expressive power of humans far surpasses capabilities current methods. Users desire more versatile approach can accommodate their diverse intents, ranging from controlling individual subjects...

10.1609/aaai.v39i3.32240 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2025-04-11

We investigate the problem of stroke-level sketch segmentation, which is to train machines assign strokes with semantic part labels given a input sketch. Solving segmentation opens door for fine-grained interpretation, can benefit many novel sketch-based applications, including recognition and image retrieval. In this paper, we treat as seqence-to-sequence generation problem, reccurent nueral networks (RNN)-based model SketchSegNet presented translate sequence into thier labels. addition,...

10.1109/mlsp.2018.8516988 article EN 2018-09-01

In this paper we study, for the first time, problem of fine-grained sketch-based 3D shape retrieval. We advocate use sketches as a input modality to retrieve shapes at instance-level - e.g., given sketch chair, set out specific chair from gallery all chairs. Fine-grained retrieval (FG-SBSR) has not been possible till now due lack datasets that exhibit one-to-one sketch-3D correspondences. The key contribution is two new datasets, consisting total 4,680 pairings object categories. Even with...

10.1109/tip.2021.3118975 article EN IEEE Transactions on Image Processing 2021-01-01

We investigate the problem of stroke-level sketch segmentation, which is to automatically assign strokes a given with semantic labels. Solving segmentation opens door for fine-grained interpretation, can benefit many novel sketch-based applications, including recognition and image retrieval. In this paper, we propose an approach multi-class by considering it as sequence-to-sequence generation problem. Specifically, end-to-end learned network <italic...

10.1109/access.2019.2929804 article EN cc-by IEEE Access 2019-01-01

Sketch is used for rendering the visual world since prehistoric times, and has become ubiquitous nowadays with increasing availability of touchscreens on portable devices. However, how to automatically map images sketches, a problem that profound implications applications such as sketch-based image retrieval, still remains open. In this paper, we propose novel method draws sketch from single natural image. extraction posed within an unified contour grouping framework, where perceptual first...

10.1109/icip.2013.6738056 article EN 2013-09-01

The key challenge in designing a sketch representation lies with handling the abstract and iconic nature of sketches. Existing work predominantly utilizes either, (i) pixelative format that treats sketches as natural images employing off-the-shelf CNN-based networks, or (ii) an elaborately designed vector leverages structural information drawing orders using sequential RNN-based methods. While lacks intuitive exploitation cues, are absent most cases limiting their practical usage. Hence,...

10.1109/iccv48922.2021.00099 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

Polymer materials with multiple stimuli-responsive properties have demonstrated many potential and practical applications. By covalently introducing spiropyran (SP1) spirothiopyran (STP) into the polyurethane backbone, photochromic, mechanochromic, thermally discolored polymer been prepared. In this work, we report for first time that white light (violet, blue, green light) above a certain intensity can activate STP to color. Based on discovery, SP1 exhibit reversible three-color changes...

10.1021/acsami.4c10488 article EN ACS Applied Materials & Interfaces 2024-08-02

Humans tend to understand image scene by recognizing visual elements, then conjecturing and inferring based on them, hence are able search relevant images. In this paper, we concern about the problem of complex retrieval reasoning dense captions, which is similar way human perception for searching Specifically, transform into a captioning graph matching issue using structured language descriptions retrieval. Experimental results novel proposed large-scale content-based dataset demonstrate...

10.1109/vcip.2017.8305157 article EN 2017-12-01

User intention understanding from text is an important task in NLP. In this paper, we study the problem of phone-changing prediction. And propose a novel feature extraction method, which selects most representative feature, to represent user's scratch. Then adopt supervised learning approach, that train SVM classifier, for addition, dataset scratches and their corresponding labels are collected real network environment. The experimental results validate effectiveness our proposed approach.

10.1109/splim.2016.7528398 article EN 2016-07-01

Sketches are distinctly different to photos. They highly abstract and exhibit a severe lack of visual cues. Prior works have therefore explored additional traits unique sketches help recognition such as stroke ordering. In this paper, we pioneer in studying the role structure sketches, for task sketch recognition. particular, propose novel graph representation specifically designed which follows inherent hierarchical relationship (segment-stroke-sketch") sketching elements. By conforming...

10.1109/icme46284.2020.9102957 article EN 2022 IEEE International Conference on Multimedia and Expo (ICME) 2020-06-09

10.1007/s11263-022-01623-7 article EN International Journal of Computer Vision 2022-06-07

Traffic sign recognition (TSR) is an indispensable component for vision-based system of self-driving car. Promising results have been achieved which especially benefit from the rapid development deep neural networks recently. However, there are few works focusing on algorithms' performances towards different complex conditions, such as weather and viewpoint variations. In this paper, we propose a new real-world TSR dataset, dataset with several fine-grained conditions fine labeled involving...

10.1109/vcip.2018.8698666 article EN 2018-12-01

Image-to-sketch translation is to learn the mapping between an image and a corresponding human drawn sketch. Machine can be trained mimic drawing process using training set of aligned image-sketch pairs. However, collect such paired data quite expensive or even unavailable for many cases since sketches exhibit various level abstractness preferences. Hence we present approach learning image-to-sketch network via unpaired examples. A network, which translate representation in latent space...

10.1109/vcip47243.2019.8965725 article EN 2019-12-01
Coming Soon ...