NFDI4DS | UHH-SEMS - Publication Details

LoFTR: Detector-Free Local Feature Matching with Transformers

OPENALEX - Publications

Jiaming Sun Zehong Shen Yuang Wang Hujun Bao Xiaowei Zhou

We present a novel method for local image feature matching. Instead of performing detection, description, and matching sequentially, we propose to first establish pixel-wise dense matches at coarse level later refine the good fine level. In contrast methods that use cost volume search correspondences, self cross attention layers in Transformer obtain descriptors are conditioned on both images. The global receptive field provided by enables our produce low-texture areas, where detectors...

10.1109/cvpr46437.2021.00881 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video

OPENALEX - Publications

Jiaming Sun Yiming Xie Linghao Chen Xiaowei Zhou Hujun Bao

We present a novel framework named NeuralRecon for real-time 3D scene reconstruction from monocular video. Unlike previous methods that estimate single-view depth maps separately on each key-frame and fuse them later, we propose to directly reconstruct local surfaces represented as sparse TSDF volumes video fragment sequentially by neural network. A learning-based fusion module based gated recurrent units is used guide the network features fragments. This de-sign allows capture smoothness...

10.1109/cvpr46437.2021.01534 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

Multi-view underwater image enhancement method via embedded fusion mechanism

OPENALEX - Publications

Jingchun Zhou Jiaming Sun Weishi Zhang Zifan Lin

10.1016/j.engappai.2023.105946 article EN Engineering Applications of Artificial Intelligence 2023-02-16

Modeling Indirect Illumination for Inverse Rendering

OPENALEX - Publications

Yuanqing Zhang Jiaming Sun Xingyi He Huan Fu Rongfei Jia and 1 more

Recent advances in implicit neural representations and differentiable rendering make it possible to simultaneously recover the geometry materials of an object from multi-view RGB images captured under unknown static illumination. Despite promising results achieved, indirect illumination is rarely modeled previous methods, as requires expensive recursive path tracing which makes inverse computationally intractable. In this paper, we propose a novel approach efficiently recovering...

10.1109/cvpr52688.2022.01809 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

OnePose: One-Shot Object Pose Estimation without CAD Models

OPENALEX - Publications

Jiaming Sun Zihao Wang Siyu Zhang Xingyi He Hongcheng Zhao and 2 more

We propose a new method named OnePose for object pose estimation. Unlike existing instance-level or category-level methods, does not rely on CAD models and can handle objects in arbitrary categories without instance-or category-specific network training. draws the idea from visual localization only requires simple RGB video scan of to build sparse SfM model object. Then, this is registered query images with generic feature matching network. To mitigate slow runtime we graph attention that...

10.1109/cvpr52688.2022.00670 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Neural 3D Reconstruction in the Wild

OPENALEX - Publications

Jiaming Sun Xi Chen Qianqian Wang Zhengqi Li Hadar Averbuch‐Elor and 2 more

We are witnessing an explosion of neural implicit representations in computer vision and graphics. Their applicability has recently expanded beyond tasks such as shape generation image-based rendering to the fundamental problem 3D reconstruction. However, existing methods typically assume constrained environments with constant illumination captured by a small set roughly uniformly distributed cameras. introduce new method that enables efficient accurate surface reconstruction from Internet...

10.1145/3528233.3530718 preprint EN 2022-07-20

HCLR-Net: Hybrid Contrastive Learning Regularization with Locally Randomized Perturbation for Underwater Image Enhancement

OPENALEX - Publications

Jingchun Zhou Jiaming Sun Chongyi Li Qiuping Jiang Man Zhou and 3 more

Underwater image enhancement presents a significant challenge due to the complex and diverse underwater environments that result in severe degradation phenomena such as light absorption, scattering, color distortion. More importantly, obtaining paired training data for these scenarios is challenging task, which further hinders generalization performance of models. To address issues, we propose novel approach, Hybrid Contrastive Learning Regularization (HCLR-Net). Our method built upon...

10.1007/s11263-024-01987-y article EN cc-by International Journal of Computer Vision 2024-02-04

Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance Disparity Estimation

OPENALEX - Publications

Jiaming Sun Linghao Chen Yiming Xie Siyu Zhang Qinhong Jiang and 2 more

In this paper, we propose a novel system named Disp R-CNN for 3D object detection from stereo images. Many recent works solve problem by first recovering point cloud with disparity estimation and then apply detector. The map is computed the entire image, which costly fails to leverage category-specific prior. contrast, design an instance network (iDispNet) that predicts only pixels on objects of interest learns shape prior more accurate estimation. To address challenge scarcity annotation in...

10.1109/cvpr42600.2020.01056 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

OnePose++: Keypoint-Free One-Shot Object Pose Estimation without CAD Models

OPENALEX - Publications

Xingyi He Jiaming Sun Yuang Wang Di Huang Hujun Bao and 1 more

We propose a new method for object pose estimation without CAD models. The previous feature-matching-based OnePose has shown promising results under one-shot setting which eliminates the need models or object-specific training. However, relies on detecting repeatable image keypoints and is thus prone to failure low-textured objects. keypoint-free pipeline remove keypoint detection. Built upon detector-free feature matching LoFTR, we devise SfM reconstruct semi-dense point-cloud model object....

10.48550/arxiv.2301.07673 preprint EN cc-by arXiv (Cornell University) 2023-01-01

4K4D: Real-Time 4D View Synthesis at 4K Resolution

OPENALEX - Publications

Zhen Xu Sida Peng Haotong Lin Guangzhao He Jiaming Sun and 3 more

10.1109/cvpr52733.2024.01893 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

Detector-Free Structure from Motion

OPENALEX - Publications

Xingyi He Jiaming Sun Yifan Wang Sida Peng Qixing Huang and 2 more

10.1109/cvpr52733.2024.02040 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

X-ray weld defect detection based on AF-RCNN

OPENALEX - Publications

Weipeng Liu Shengqi Shan Haiyong Chen Rui Wang Jiaming Sun and 1 more

10.1007/s40194-022-01281-w article EN Welding in the World 2022-03-16

NeuralRecon: Real-Time Coherent 3D Scene Reconstruction from Monocular Video

OPENALEX - Publications

Xi Chen Jiaming Sun Yiming Xie Hujun Bao Xiaowei Zhou

We present a novel framework named NeuralRecon for real-time 3D scene reconstruction from monocular video. Unlike previous methods that estimate single-view depth maps separately on each key-frame and fuse them later, we propose to directly reconstruct local surfaces represented as sparse TSDF volumes video fragment sequentially by neural network. A learning-based fusion module based gated recurrent units is used guide the network features fragments. This design allows capture smoothness...

10.1109/tpami.2024.3393141 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2024-04-24

TrafficWise: Leveraging World Models for Generalized and Interpretable Traffic Control

OPENALEX - Publications

Junjun Hu Xingyuan Dai Xiaojun Li Chenglong Ye Yan Zhang and 4 more

10.1109/mits.2025.3543446 article EN IEEE Intelligent Transportation Systems Magazine 2025-01-01

Scaffold Internal Network Bioprinting for Vascularized Tissue Regeneration

OPENALEX - Publications

Lai Suo Yaqi Guo Shan Mou Yichao Jin Dandan Zou and 2 more

10.1016/j.compositesb.2025.112401 article EN Composites Part B Engineering 2025-03-01

Correction: HCLR-Net: Hybrid Contrastive Learning Regularization with Locally Randomized Perturbation for Underwater Image Enhancement

OPENALEX - Publications

Jingchun Zhou Jiaming Sun Chongyi Li Qiuping Jiang Man Zhou and 3 more

10.1007/s11263-024-02131-6 article EN cc-by International Journal of Computer Vision 2024-06-04

Relightable and Animatable Neural Avatar from Sparse-View Video

OPENALEX - Publications

Zhen Xu Sida Peng Geng Chen Linzhan Mou Zihan Yan and 3 more

10.1109/cvpr52733.2024.00100 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

Shape Prior Guided Instance Disparity Estimation for 3D Object Detection

OPENALEX - Publications

Ling-Hao Chen Jiaming Sun Yiming Xie Siyu Zhang Qing Shuai and 4 more

In this paper, we propose a novel system named Disp R-CNN for 3D object detection from stereo images. Many recent works solve problem by first recovering point clouds with disparity estimation and then apply detector. The map is computed the entire image, which costly fails to leverage category-specific prior. contrast, design an instance network (iDispNet) that predicts only pixels on objects of interest learns shape prior more accurate estimation. To address challenge scarcity annotation...

10.1109/tpami.2021.3076678 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2021-01-01

LoFTR: Detector-Free Local Feature Matching with Transformers

OPENALEX - Publications

Jiaming Sun Zehong Shen Yuang Wang Hujun Bao Xiaowei Zhou

We present a novel method for local image feature matching. Instead of performing detection, description, and matching sequentially, we propose to first establish pixel-wise dense matches at coarse level later refine the good fine level. In contrast methods that use cost volume search correspondences, self cross attention layers in Transformer obtain descriptors are conditioned on both images. The global receptive field provided by enables our produce low-texture areas, where detectors...

10.48550/arxiv.2104.00680 preprint EN cc-by arXiv (Cornell University) 2021-01-01

Reconstructing Hand-Held Objects from Monocular Video

OPENALEX - Publications

Di Huang X. Ji Xingyi He Jiaming Sun Tong He and 3 more

This paper presents an approach that reconstructs a hand-held object from monocular video. In contrast to many recent methods directly predict geometry by trained network, the proposed does not require any learned prior about and is able recover more accurate detailed geometry. The key idea hand motion naturally provides multiple views of can be reliably estimated pose tracker. Then, recovered solving multi-view reconstruction problem. We devise implicit neural representation-based method...

10.1145/3550469.3555401 article EN 2022-11-29

Semi-Dense Feature Matching With Transformers and its Applications in Multiple-View Geometry

OPENALEX - Publications

Zehong Shen Jiaming Sun Yuang Wang Xingyi He Hujun Bao and 1 more

We present a novel method for local image feature matching. Instead of performing detection, description, and matching sequentially, we propose to first establish pixel-wise dense matches at coarse level later refine the good fine level. In contrast methods that use cost volume search correspondences, self cross attention layers in Transformer obtain descriptors are conditioned on both images. The global receptive field provided by enables our produce low-texture areas, where detectors...

10.1109/tpami.2022.3223530 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2022-11-21

4K4D: Real-Time 4D View Synthesis at 4K Resolution

OPENALEX - Publications

Zhen Xu Sida Peng Haotong Lin Guangzhao He Jiaming Sun and 3 more

This paper targets high-fidelity and real-time view synthesis of dynamic 3D scenes at 4K resolution. Recently, some methods on have shown impressive rendering quality. However, their speed is still limited when high-resolution images. To overcome this problem, we propose 4K4D, a 4D point cloud representation that supports hardware rasterization enables unprecedented speed. Our built feature grid so the points are naturally regularized can be robustly optimized. In addition, design novel...

10.48550/arxiv.2310.11448 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Fuzzy surfacelet neural network evaluation model optimized by adaptive dragonfly algorithm for pipeline network integrity management

OPENALEX - Publications

Jiaming Sun Bin Zhao Diankui Gao Lizhi Xu

10.1016/j.asoc.2021.107862 article EN Applied Soft Computing 2021-09-03