Jiaming Sun

ORCID: 0000-0003-4053-8510
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Vision and Imaging
  • Robotics and Sensor-Based Localization
  • Advanced Neural Network Applications
  • Computer Graphics and Visualization Techniques
  • Human Pose and Action Recognition
  • Video Surveillance and Tracking Methods
  • Advanced Image and Video Retrieval Techniques
  • 3D Shape Modeling and Analysis
  • Advanced Image Processing Techniques
  • Image Enhancement Techniques
  • Markov Chains and Monte Carlo Methods
  • Traffic Prediction and Management Techniques
  • Transportation Planning and Optimization
  • Hydraulic and Pneumatic Systems
  • Optical measurement and interference techniques
  • Multimodal Machine Learning Applications
  • Remote Sensing and LiDAR Applications
  • Stochastic processes and statistical mechanics
  • Robotic Path Planning Algorithms
  • Hand Gesture Recognition Systems
  • Traffic control and management
  • Natural Language Processing Techniques
  • Non-Destructive Testing Techniques
  • Generative Adversarial Networks and Image Synthesis
  • Speech Recognition and Synthesis

China Telecom (China)
2023-2025

China Telecom
2023-2025

Zhejiang University
2020-2024

Dalian Maritime University
2023-2024

Jilin University
2022-2024

Changchun University of Chinese Medicine
2023

University of Electronic Science and Technology of China
2023

Changchun University of Science and Technology
2022

Zhejiang Lab
2022

Hebei University of Technology
2008-2022

We present a novel method for local image feature matching. Instead of performing detection, description, and matching sequentially, we propose to first establish pixel-wise dense matches at coarse level later refine the good fine level. In contrast methods that use cost volume search correspondences, self cross attention layers in Transformer obtain descriptors are conditioned on both images. The global receptive field provided by enables our produce low-texture areas, where detectors...

10.1109/cvpr46437.2021.00881 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

We present a novel framework named NeuralRecon for real-time 3D scene reconstruction from monocular video. Unlike previous methods that estimate single-view depth maps separately on each key-frame and fuse them later, we propose to directly reconstruct local surfaces represented as sparse TSDF volumes video fragment sequentially by neural network. A learning-based fusion module based gated recurrent units is used guide the network features fragments. This de-sign allows capture smoothness...

10.1109/cvpr46437.2021.01534 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

10.1016/j.engappai.2023.105946 article EN Engineering Applications of Artificial Intelligence 2023-02-16

Recent advances in implicit neural representations and differentiable rendering make it possible to simultaneously recover the geometry materials of an object from multi-view RGB images captured under unknown static illumination. Despite promising results achieved, indirect illumination is rarely modeled previous methods, as requires expensive recursive path tracing which makes inverse computationally intractable. In this paper, we propose a novel approach efficiently recovering...

10.1109/cvpr52688.2022.01809 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

We propose a new method named OnePose for object pose estimation. Unlike existing instance-level or category-level methods, does not rely on CAD models and can handle objects in arbitrary categories without instance-or category-specific network training. draws the idea from visual localization only requires simple RGB video scan of to build sparse SfM model object. Then, this is registered query images with generic feature matching network. To mitigate slow runtime we graph attention that...

10.1109/cvpr52688.2022.00670 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

We are witnessing an explosion of neural implicit representations in computer vision and graphics. Their applicability has recently expanded beyond tasks such as shape generation image-based rendering to the fundamental problem 3D reconstruction. However, existing methods typically assume constrained environments with constant illumination captured by a small set roughly uniformly distributed cameras. introduce new method that enables efficient accurate surface reconstruction from Internet...

10.1145/3528233.3530718 preprint EN 2022-07-20

Underwater image enhancement presents a significant challenge due to the complex and diverse underwater environments that result in severe degradation phenomena such as light absorption, scattering, color distortion. More importantly, obtaining paired training data for these scenarios is challenging task, which further hinders generalization performance of models. To address issues, we propose novel approach, Hybrid Contrastive Learning Regularization (HCLR-Net). Our method built upon...

10.1007/s11263-024-01987-y article EN cc-by International Journal of Computer Vision 2024-02-04

In this paper, we propose a novel system named Disp R-CNN for 3D object detection from stereo images. Many recent works solve problem by first recovering point cloud with disparity estimation and then apply detector. The map is computed the entire image, which costly fails to leverage category-specific prior. contrast, design an instance network (iDispNet) that predicts only pixels on objects of interest learns shape prior more accurate estimation. To address challenge scarcity annotation in...

10.1109/cvpr42600.2020.01056 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

We propose a new method for object pose estimation without CAD models. The previous feature-matching-based OnePose has shown promising results under one-shot setting which eliminates the need models or object-specific training. However, relies on detecting repeatable image keypoints and is thus prone to failure low-textured objects. keypoint-free pipeline remove keypoint detection. Built upon detector-free feature matching LoFTR, we devise SfM reconstruct semi-dense point-cloud model object....

10.48550/arxiv.2301.07673 preprint EN cc-by arXiv (Cornell University) 2023-01-01

10.1109/cvpr52733.2024.01893 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

10.1109/cvpr52733.2024.02040 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

We present a novel framework named NeuralRecon for real-time 3D scene reconstruction from monocular video. Unlike previous methods that estimate single-view depth maps separately on each key-frame and fuse them later, we propose to directly reconstruct local surfaces represented as sparse TSDF volumes video fragment sequentially by neural network. A learning-based fusion module based gated recurrent units is used guide the network features fragments. This design allows capture smoothness...

10.1109/tpami.2024.3393141 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2024-04-24

10.1109/cvpr52733.2024.00100 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

In this paper, we propose a novel system named Disp R-CNN for 3D object detection from stereo images. Many recent works solve problem by first recovering point clouds with disparity estimation and then apply detector. The map is computed the entire image, which costly fails to leverage category-specific prior. contrast, design an instance network (iDispNet) that predicts only pixels on objects of interest learns shape prior more accurate estimation. To address challenge scarcity annotation...

10.1109/tpami.2021.3076678 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2021-01-01

We present a novel method for local image feature matching. Instead of performing detection, description, and matching sequentially, we propose to first establish pixel-wise dense matches at coarse level later refine the good fine level. In contrast methods that use cost volume search correspondences, self cross attention layers in Transformer obtain descriptors are conditioned on both images. The global receptive field provided by enables our produce low-texture areas, where detectors...

10.48550/arxiv.2104.00680 preprint EN cc-by arXiv (Cornell University) 2021-01-01

This paper presents an approach that reconstructs a hand-held object from monocular video. In contrast to many recent methods directly predict geometry by trained network, the proposed does not require any learned prior about and is able recover more accurate detailed geometry. The key idea hand motion naturally provides multiple views of can be reliably estimated pose tracker. Then, recovered solving multi-view reconstruction problem. We devise implicit neural representation-based method...

10.1145/3550469.3555401 article EN 2022-11-29

We present a novel method for local image feature matching. Instead of performing detection, description, and matching sequentially, we propose to first establish pixel-wise dense matches at coarse level later refine the good fine level. In contrast methods that use cost volume search correspondences, self cross attention layers in Transformer obtain descriptors are conditioned on both images. The global receptive field provided by enables our produce low-texture areas, where detectors...

10.1109/tpami.2022.3223530 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2022-11-21

This paper targets high-fidelity and real-time view synthesis of dynamic 3D scenes at 4K resolution. Recently, some methods on have shown impressive rendering quality. However, their speed is still limited when high-resolution images. To overcome this problem, we propose 4K4D, a 4D point cloud representation that supports hardware rasterization enables unprecedented speed. Our built feature grid so the points are naturally regularized can be robustly optimized. In addition, design novel...

10.48550/arxiv.2310.11448 preprint EN other-oa arXiv (Cornell University) 2023-01-01
Coming Soon ...