Jingbo Wang

ORCID: 0000-0001-9700-6262
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Neural Network Applications
  • Human Pose and Action Recognition
  • Advanced Vision and Imaging
  • Advanced Image and Video Retrieval Techniques
  • Generative Adversarial Networks and Image Synthesis
  • Human Motion and Animation
  • 3D Shape Modeling and Analysis
  • Computer Graphics and Visualization Techniques
  • Multimodal Machine Learning Applications
  • Domain Adaptation and Few-Shot Learning
  • Video Surveillance and Tracking Methods
  • Visual Attention and Saliency Detection
  • Face recognition and analysis
  • Robotics and Sensor-Based Localization
  • Hand Gesture Recognition Systems
  • Industrial Vision Systems and Defect Detection
  • Statistical Methods and Inference
  • Automated Road and Building Extraction
  • Statistical Methods and Bayesian Inference
  • Cloud Computing and Remote Desktop Technologies
  • Banking Systems and Strategies
  • Technology and Security Systems
  • Geophysical Methods and Applications
  • Ear Surgery and Otitis Media
  • CCD and CMOS Imaging Sensors

Shanxi Eye Hospital
2025

Shanxi Medical University
2025

Beijing Academy of Quantum Information Sciences
2025

Tibet University
2024

Tongji University
2024

Xi’an University of Posts and Telecommunications
2024

Hainan University
2024

Shanghai Artificial Intelligence Laboratory
2024

Chinese University of Hong Kong
2018-2023

University of Hong Kong
2023

Most existing methods of semantic segmentation still suffer from two aspects challenges: intra-class inconsistency and inter-class indistinction. To tackle these problems, we propose a Discriminative Feature Network (DFN), which contains sub-networks: Smooth Border Network. Specifically, to handle the problem, specially design with Channel Attention Block global average pooling select more discriminative features. Furthermore, make bilateral features boundary distinguishable deep...

10.1109/cvpr.2018.00199 article EN 2018-06-01

Recent works have widely explored the contextual dependencies to achieve more accurate segmentation results. However, most approaches rarely distinguish different types of dependencies, which may pollute scene understanding. In this work, we directly supervise feature aggregation intra-class and interclass context clearly. Specifically, develop a Context Prior with supervision Affinity Loss. Given an input image corresponding ground truth, Loss constructs ideal affinity map learning Prior....

10.1109/cvpr42600.2020.01243 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

Panoptic segmentation, which needs to assign a category label each pixel and segment object instance simultaneously, is challenging topic. Traditionally, the existing approaches utilize two independent models without sharing features, makes pipeline inefficient implement. In addition, heuristic method usually employed merge results. However, overlapping relationship between instances difficult determine sufficient context information during merging process. To address problems, we propose...

10.1109/cvpr.2019.00633 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

Semantic segmentation requires both rich spatial information and sizeable receptive field. However, modern approaches usually compromise resolution to achieve real-time inference speed, which leads poor performance. In this paper, we address dilemma with a novel Bilateral Segmentation Network (BiSeNet). We first design Spatial Path small stride preserve the generate high-resolution features. Meanwhile, Context fast downsampling strategy is employed obtain sufficient On top of two paths,...

10.48550/arxiv.1808.00897 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Most existing methods of semantic segmentation still suffer from two aspects challenges: intra-class inconsistency and inter-class indistinction. To tackle these problems, we propose a Discriminative Feature Network (DFN), which contains sub-networks: Smooth Border Network. Specifically, to handle the problem, specially design with Channel Attention Block global average pooling select more discriminative features. Furthermore, make bilateral features boundary distinguishable deep...

10.48550/arxiv.1804.09337 preprint EN other-oa arXiv (Cornell University) 2018-01-01

The ability to synthesize long-term human motion sequences in real-world scenes can facilitate numerous applications. Previous approaches for scene-aware synthesis are constrained by pre-defined target objects or positions and thus limit the diversity of human-scene interactions synthesized motions. In this paper, we focus on problem synthesizing diverse motions under guidance action sequences. To achieve this, first decompose into three aspects, namely interaction (e.g. sitting different...

10.1109/cvpr52688.2022.01981 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

We revisit human motion synthesis, a task useful in various real-world applications, this paper. Whereas number of methods have been developed previously for task, they are often limited two aspects: 1) focus on the poses while leaving location movement behind, and 2) ignore impact environment motion. In paper, we propose new framework, with interaction between scene taken into account. Considering uncertainty motion, formulate as generative whose objective is to generate plausible...

10.1109/cvpr46437.2021.01203 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

Neural Radiance Field (NeRF) has emerged as a compelling method to represent 3D objects and scenes for photo-realistic rendering. However, its implicit representation causes difficulty in manipulating the models like explicit mesh representation. Several recent advances NeRF manipulation are usually restricted by shared renderer network, or suffer from large model size. To circumvent hurdle, this paper, we present an neural field that enables efficient convenient of models. achieve goal,...

10.48550/arxiv.2205.14870 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Realistic human-centric rendering plays a key role in both computer vision and graphics. Rapid progress has been made the algorithm aspect over years, yet existing datasets benchmarks are rather impoverished terms of diversity (e.g., outfit's fabric/material, body's interaction with objects, motion sequences), which crucial for effect. Researchers usually constrained to explore evaluate small set problems on current datasets, while real-world applications require methods be robust across...

10.1109/iccv51070.2023.01829 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2023-10-01

Depth information has proven to be a useful cue in the semantic segmentation of RGB-D images for providing geometric counterpart RGB representation. Most existing works simply assume that depth measurements are accurate and well-aligned with pixels models problem as cross-modal feature fusion obtain better representations achieve more segmentation. This, however, may not lead satisfactory results actual data generally noisy, which might worsen accuracy networks go deeper. In this paper, we...

10.48550/arxiv.2007.09183 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Advanced by transformer architecture, vision foundation models (VFMs) achieve remarkable progress in performance and generalization ability. Segment Anything Model (SAM) is one model that can generalized segmentation. However, most VFMs cannot run realtime, which makes it difficult to transfer them into several products. On the other hand, current real-time segmentation mainly has purpose, such as semantic on driving scene. We argue diverse outputs are needed for real applications. Thus,...

10.48550/arxiv.2401.10228 preprint EN other-oa arXiv (Cornell University) 2024-01-01

Convolutional neural networks (CNN) have achieved great success in RGB semantic segmentation. RGB-D images provide additional depth information, which can improve segmentation performance. To take full advantages of the 3D geometry relations provided by images, this paper, we propose 2.5D convolution, mimics one convolution kernel several masked 2D kernels. Our effectively process spatial between pixels a manner similar to while still sampling on plane, and thus saves computational cost. And...

10.1109/icip.2019.8803757 article EN 2022 IEEE International Conference on Image Processing (ICIP) 2019-08-26

3D interacting hand reconstruction is essential to facilitate human-machine interaction and human behaviors understanding. Previous works in this field either rely on auxiliary inputs such as depth images or they can only handle a single if monocular RGB are used. Single-hand methods tend generate collided meshes, when applied closely hands, since cannot model the interactions between two hands explicitly. In paper, we make first attempt reconstruct from images. Our method meshes with both...

10.1109/3dv53792.2021.00053 article EN 2021 International Conference on 3D Vision (3DV) 2021-12-01

This paper investigates the potential of enhancing Neural Radiance Fields (NeRF) with semantics to expand their applications. Although NeRF has been proven useful in real-world applications like VR and digital creation, lack hinders interaction objects complex scenes. We propose imitate backbone feature off-the-shelf perception models achieve zero-shot semantic segmentation NeRF. Our framework reformulates process by directly rendering features only applying decoder from models. eliminates...

10.48550/arxiv.2305.16233 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Underwater autonomous path planning is a critical component of intelligent underwater vehicle system design, especially for maritime conservation and monitoring missions. Effective these robots necessitates considering various constraints related to robot kinematics, optimization objectives, other pertinent factors. Sample-based strategies have successfully tackled this problem, particularly the rapidly exploring random tree star (RRT*) algorithm. However, conventional path-searching...

10.3390/app14020947 article EN cc-by Applied Sciences 2024-01-22

10.1109/cvpr52733.2024.00642 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16
Coming Soon ...