Qingnan Fan

ORCID: 0000-0003-1249-2826
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Image Enhancement Techniques
  • Advanced Image Processing Techniques
  • Advanced Vision and Imaging
  • Multimodal Machine Learning Applications
  • Image and Signal Denoising Methods
  • Advanced Neural Network Applications
  • Robotics and Sensor-Based Localization
  • Robot Manipulation and Learning
  • Human Pose and Action Recognition
  • Advanced Image Fusion Techniques
  • Domain Adaptation and Few-Shot Learning
  • Computer Graphics and Visualization Techniques
  • Image Processing Techniques and Applications
  • 3D Surveying and Cultural Heritage
  • 3D Shape Modeling and Analysis
  • Advanced Image and Video Retrieval Techniques
  • Generative Adversarial Networks and Image Synthesis
  • COVID-19 diagnosis using AI
  • Medical Image Segmentation Techniques
  • Robotic Path Planning Algorithms
  • Human Motion and Animation
  • AI in cancer detection
  • Sparse and Compressive Sensing Techniques
  • Music and Audio Processing
  • Lung Cancer Diagnosis and Treatment

Tencent (China)
2021-2023

Beijing University of Civil Engineering and Architecture
2022

Bellevue Hospital Center
2022

Soochow University
2021

Stanford University
2019-2021

Shandong University
2014-2019

Shandong University of Science and Technology
2019

Image dehazing aims to recover the uncorrupted content from a hazy image. Instead of leveraging traditional low-level or handcrafted image priors as restoration constraints, e.g., dark channels and increased contrast, we propose an end-to-end gated context aggregation network directly restore final haze-free In this network, adopt latest smoothed dilation technique help remove gridding artifacts caused by widely-used dilated convolution with negligible extra parameters, leverage sub-network...

10.1109/wacv.2019.00151 article EN 2019-01-01

This paper proposes a deep neural network structure that exploits edge information in addressing representative low-level vision tasks such as layer separation and image filtering. Unlike most other learning strategies applied this context, our approach tackles these challenging problems by estimating edges reconstructing images using only cascaded convolutional layers arranged no handcrafted or application-specific image-processing components are required. We apply the resulting...

10.1109/iccv.2017.351 article EN 2017-10-01

The emergence of low-cost 3D printers steers the investigation new geometric problems that control quality fabricated object. In this paper, we present a method to reduce material cost and weight given object while providing durable printed model is resistant impact external forces. We introduce hollowing optimization algorithm based on concept honeycomb-cells structure. Honeycombs structures are known be minimal strength in tension. utilize Voronoi diagram compute irregular honeycomb-like...

10.1145/2601097.2601168 article EN ACM Transactions on Graphics 2014-07-22

We introduce JumpCut, a new mask transfer and interpolation method for interactive video cutout. Given source frame which foreground is already available, we compute an estimate of the at another, typically non-successive, target frame. Observing that background regions exhibit different motions, leverage these differences by computing two separate nearest-neighbor fields (split-NNF) from to These NNFs are then used jointly predict coherent labeling pixels in The same split-NNF also aid...

10.1145/2816795.2818105 article EN ACM Transactions on Graphics 2015-10-27

Image smoothing represents a fundamental component of many disparate computer vision and graphics applications. In this paper, we present unified unsupervised (label-free) learning framework that facilitates generating flexible high-quality effects by directly from data using deep convolutional neural networks (CNNs). The heart the design is training signal as novel energy function includes an edge-preserving regularizer which helps maintain important yet potentially vulnerable image...

10.1145/3272127.3275081 article EN ACM Transactions on Graphics 2018-11-28

While invaluable for many computer vision applications, decomposing a natural image into intrinsic reflectance and shading layers represents challenging, underdetermined inverse problem. As opposed to strict reliance on conventional optimization or filtering solutions with strong prior assumptions, deep learning based approaches have also been proposed compute decompositions when granted access sufficient labeled training data. The downside is that current data sources are quite limited,...

10.1109/cvpr.2018.00932 preprint EN 2018-06-01

In this work, we tackle the problem of category-level online pose tracking objects from point cloud sequences. For first time, propose a unified framework that can handle 9DoF for novel rigid object instances as well per-part articulated known categories. Here pose, comprising 6D and 3D size, is equivalent to amodal bounding box representation with free pose. Given depth at current frame estimated last frame, our end-to-end pipeline learns accurately update Our composed three modules: 1)...

10.1109/iccv48922.2021.01296 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

Many different deep networks have been used to approximate, accelerate or improve traditional image operators. Among these operators, many contain parameters which need be tweaked obtain the satisfactory results, we refer as "parameterized operators". However, most existing trained for operators are only designed one specific parameter configuration, does not meet needs of real scenarios that usually require flexible settings. To overcome this limitation, propose a new decoupled learning...

10.1109/tpami.2019.2925793 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2019-06-28

This paper proposes a method for representation learning of multimodal data using contrastive losses. A traditional approach is to contrast different modalities learn the information shared among them. However, that could fail complementary synergies between might be useful downstream tasks. Another concatenate all into tuple and then positive negative correspondences. consider only stronger while ignoring weaker ones. To address these issues, we propose novel objective, TupleInfoNCE. It...

10.1109/iccv48922.2021.00079 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

We present C · ASE, an efficient and effective framework that learns Conditional Adversarial Skill Embeddings for physics-based characters. ASE enables the physically simulated character to learn a diverse repertoire of skills while providing controllability in form direct manipulation be performed. This is achieved by dividing heterogeneous skill motions into distinct subsets containing homogeneous samples training low-level conditional model behavior distribution. The skill-conditioned...

10.1145/3610548.3618205 preprint EN 2023-12-10

Object goal navigation (ObjectNav) in unseen environments is a fundamental task for Embodied AI. Agents existing works learn ObjectNav policies based on 2D maps, scene graphs, or image sequences. Considering this happens 3D space, 3D-aware agent can advance its capability via learning from fine-grained spatial information. However, leveraging representation be prohibitively unpractical policy floor-level task, due to low sample efficiency and expensive computational cost. In work, we propose...

10.1109/cvpr52729.2023.00645 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Visual prompt, a pair of before-and-after edited images, can convey indescribable imagery transformations and prosper in image editing. However, current visual prompt methods rely on pretrained text-guided image-to-image generative model that requires triplet text, before, after images for retraining over text-to-image model. Such crafting triplets processes limit the scalability generalization In this paper, we present framework based any single without reliance explicit thus enhancing...

10.48550/arxiv.2501.03495 preprint EN arXiv (Cornell University) 2025-01-06

Self-supervised representation learning is a critical problem in computer vision, as it provides way to pretrain feature extractors on large unlabeled datasets that can be used an initialization for more efficient and effective training downstream tasks. A promising approach use contrastive learn latent space where features are close similar data samples far apart dissimilar ones. This has demonstrated tremendous success pretraining both image point cloud extractors, but been barely...

10.48550/arxiv.2012.13089 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Optical flow in heavy rainy scenes is challenging due to the presence of both rain steaks and veiling effect, which break existing optical constraints. Concerning this, we propose a deep-learning based method designed handle rain. We introduce feature multiplier our network that transforms features an image affected by effect into are less it, call veiling-invariant features. establish new mapping operation space produce streak-invariant The on pyramid structure input images, basic idea...

10.1109/iccv.2019.00740 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Autonomous part assembly is a challenging yet crucial task in 3D computer vision and robotics. Analogous to buying an IKEA furniture, given set of parts that can assemble single shape, intelligent agent needs perceive the geometry, reason propose pose estimations for input parts, finally call robotic planning control routines actuation. In this paper, we focus on estimation subproblem from side involving geometric relational reasoning over geometry. Essentially, generative predict 6-DoF...

10.48550/arxiv.2006.07793 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Point cloud is an important 3D data representation widely used in many essential applications. Leveraging deep neural networks, recent works have shown great success processing point clouds. However, those networks are vulnerable to various adversarial attacks, which can be summarized as two primary types: perturbation that affects local distribution, and surface distortion causes dramatic changes geometry. In this paper, we simultaneously address both the aforementioned attacks by learning...

10.48550/arxiv.2010.05272 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Localizing the camera in a known indoor environment is key building block for scene mapping, robot navigation, AR, etc. Recent advances estimate pose via optimization over 2D/3D-3D correspondences established between coordinates 2D/3D space and 3D world space. Such mapping estimated with either convolution neural network or decision tree using only static input image sequence, which makes these approaches vulnerable to dynamic environments that are quite common yet challenging real world. To...

10.1109/cvpr46437.2021.00844 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

We study the problem of multi-robot active mapping, which aims for complete scene map construction in minimum time steps. The key to this lies goal position estimation enable more efficient robot movements. Previous approaches either choose frontier as via a myopic solution that hinders efficiency, or maximize long-term value reinforcement learning directly regress position, but does not guarantee construction. In paper, we propose novel algorithm, namely NeuralCoMapping, takes advantage...

10.1109/cvpr52688.2022.01442 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Perceiving and manipulating 3D articulated objects (e.g., cabinets, doors) in human environments is an important yet challenging task for future home-assistant robots. The space of exceptionally rich their myriad semantic categories, diverse shape geometry, complicated part functionality. Previous works mostly abstract kinematic structure with estimated joint parameters poses as the visual representations objects. In this paper, we propose object-centric actionable priors a novel...

10.48550/arxiv.2106.14440 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Traditional image processing operators often provide some control parameters to tweak the final results. Recently, different convolutional neural networks have been used approximate or improve these operators. However, in those methods, one single model can only handle operator of a specific parameter value and does not support tuning. In this paper, we propose new plugin module, "Adaptive Filterbank Pyramid", which be inserted into backbone network multiple continuous Our module explicitly...

10.1109/tip.2020.3009844 article EN IEEE Transactions on Image Processing 2020-01-01
Coming Soon ...