Shuhan Shen

ORCID: 0000-0002-8704-7914
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Vision and Imaging
  • Robotics and Sensor-Based Localization
  • 3D Surveying and Cultural Heritage
  • Advanced Image and Video Retrieval Techniques
  • Remote Sensing and LiDAR Applications
  • Optical measurement and interference techniques
  • 3D Shape Modeling and Analysis
  • Video Surveillance and Tracking Methods
  • Image and Object Detection Techniques
  • Computer Graphics and Visualization Techniques
  • Automated Road and Building Extraction
  • Human Pose and Action Recognition
  • Image Retrieval and Classification Techniques
  • Advanced Neural Network Applications
  • Face and Expression Recognition
  • Advanced Numerical Analysis Techniques
  • Indoor and Outdoor Localization Technologies
  • Image Processing Techniques and Applications
  • Remote-Sensing Image Classification
  • Industrial Vision Systems and Defect Detection
  • Gaze Tracking and Assistive Technology
  • Topic Modeling
  • Face recognition and analysis
  • Image Processing and 3D Reconstruction
  • Medical Image Segmentation Techniques

Chinese Academy of Sciences
2015-2024

Institute of Automation
2015-2024

University of Chinese Academy of Sciences
2016-2024

Shandong Institute of Automation
2010-2024

Luoyang Institute of Science and Technology
2024

Mayo Clinic in Arizona
2024

Jinzhou Medical University
2022-2023

Beijing Academy of Artificial Intelligence
2019-2023

Imperial College London
2023

University of California, Davis
2021

In this paper, we propose a depth-map merging based multiple view stereo method for large-scale scenes which takes both accuracy and efficiency into account. the proposed method, an efficient patch-based matching process is used to generate at each image with acceptable errors, followed by refinement enforce consistency over neighboring views. Compared state-of-the-art methods, can reconstruct quite accurate dense point clouds high computational efficiency. Besides, could be easily...

10.1109/tip.2013.2237921 article EN IEEE Transactions on Image Processing 2013-03-14

Batteries have the potential to significantly reduce greenhouse gas emissions from on-road transportation. However, environmental and social impacts of producing lithium-ion batteries, particularly cathode materials, concerns over material criticality are frequently highlighted as barriers widespread electric vehicle adoption. Circular economy strategies, like reuse recycling, can secure regional supplies. To understand for circularity, we undertake a dynamic global flow analysis pack-level...

10.1021/acs.est.0c07030 article EN Environmental Science & Technology 2021-03-25

Structure-from-Motion (SfM) methods can be broadly categorized as incremental or global according to their ways estimate initial camera poses. While system has advanced in robustness and accuracy, the efficiency remains its key challenge. To solve this problem, reconstruction simultaneously estimates all poses from epipolar geometry graph, but it is usually sensitive outliers. In work, we propose a new hybrid SfM method tackle issues of efficiency, accuracy unified framework. More...

10.1109/cvpr.2017.257 article EN 2017-07-01

Intelligent transportation systems (ITS) and navigation usually demand good timeliness on their used vector road network maps. Unfortunately, existing methods for map generation, such as the surveying method satellite image digitizing method, are difficult to generate up-to-date This paper presents a automatic generation of from massive GPS vehicle trajectories, which can trajectory data that be widely available. Given set data, input first is processed construct bitmap depicts network. Then...

10.1109/itsc.2009.5309871 article EN 2009-10-01

Purpose The purpose of this paper is to propose a screw axis identification (SAI) method based on the product exponentials (POE) model, which concerned with calibrating serial robot m joints equipped stereo‐camera vision system. Design/methodology/approach Different from conventional approaches, like circle point analysis (CPA) or system theoretic must collect great deal data, joint parameters for proposed only needs measure m+1 times n (n≥3) target points mounted manipulator end‐effector....

10.1108/01439911211201609 article EN Industrial Robot the international journal of robotics research and application 2012-03-02

One of the potentially effective means for large-scale 3D scene reconstruction is to reconstruct in a global manner, rather than incrementally, by fully exploiting available auxiliary information on imaging condition, such as camera location Global Positioning System (GPS), orientation inertial measurement unit (or compass), focal length from EXIF, and so on. However, information, though informative valuable, usually too noisy be directly usable. In this paper, we present an approach taking...

10.1109/tip.2015.2449557 article EN IEEE Transactions on Image Processing 2015-07-14

As an intermediate step between raw remote sensing data and digital maps, classification has been a challenging long-standing problem in the research community. In this work, automated effective supervised framework is presented for classifying high-resolution data. Specifically, method proceeds three main stages: feature extraction, classification, classified result refinement. extraction stage, both multispectral images 3D geometry are used, which utilizes complementary information from...

10.3390/ijgi6080245 article EN cc-by ISPRS International Journal of Geo-Information 2017-08-10

Depth completion aims to recover pixelwise depth from incomplete and noisy measurements with or without the guidance of a reference RGB image. This task attracted considerable research interest due its importance in various computer vision-based applications, such as scene understanding, autonomous driving, 3-D reconstruction, object detection, pose estimation, trajectory prediction, so on. As system input, an map is usually generated by projecting points collected ranging sensors, LiDAR...

10.1109/tnnls.2022.3201534 article EN IEEE Transactions on Neural Networks and Learning Systems 2022-09-02

Image based modeling and laser scanning are two commonly used approaches in large-scale architectural scene reconstruction nowadays. In order to generate a complete reconstruction, an effective way is completely cover the using ground aerial images, supplemented by on certain regions with low textures complicated structures. Thus, key issue accurately calibrate cameras register scans unified framework. To this end, we proposed three-step pipeline for merging images scans. First, captured...

10.1109/tcsvt.2019.2943892 article EN IEEE Transactions on Circuits and Systems for Video Technology 2019-09-27

Semantic segmentation of 3D scenes is one the most important tasks in field computer vision and has attracted much attention. In this paper, we propose a novel framework for semantic aerial photogrammetry models, which uses orthographic projection to improve efficiency while still ensuring high precision, can also be applied multiple types models (i.e., textured mesh or colored point cloud). our pipeline, first obtain RGB images elevation from scene through projection, then use image network...

10.1109/tcsvt.2023.3273224 article EN IEEE Transactions on Circuits and Systems for Video Technology 2023-05-05

In this paper, we put forward a new method for surface reconstruction from image-based point clouds. particular, introduce visibility model each line of sight to preserve scene details without decreasing the noise filtering ability. To make proposed suitable clouds with heavy noise, likelihood energy term total binary labeling problem Delaunay tetrahedra, and give its s-t graph implementation. Besides, further improve performance dense technique, which helps keep object edge sharp. The...

10.3390/s19061278 article EN cc-by Sensors 2019-03-13

Urban scene modeling is a challenging task for the photogrammetry and computer vision community due to its large scale, structural complexity, topological delicacy. This paper presents an efficient multistep framework large-scale urban scenes from aerial images. It takes images textured 3D mesh model generated by image-based system as input outputs compact polygon models with semantics at different levels of detail (LODs). Based on key observation that buildings usually have piecewise planar...

10.1109/tip.2021.3106811 article EN IEEE Transactions on Image Processing 2021-01-01

With the popularization of smartphones, larger collection videos with high quality is available, which makes scale scene reconstruction increase dramatically. However, high-resolution video produces more match outliers, and frame rate brings redundant images. To solve these problems, a tailor-made framework proposed to realize an accurate robust structure-from-motion based on monocular videos. The key ideas include two points: one use spatial temporal continuity sequences improve accuracy...

10.1109/tip.2022.3156375 article EN IEEE Transactions on Image Processing 2022-01-01

10.1016/j.isprsjprs.2023.01.020 article EN ISPRS Journal of Photogrammetry and Remote Sensing 2023-01-30

Visual localization, also known as camera pose estimation, is a crucial component of many applications, such robotics, autonomous driving, and augmented reality. Traditional visual localization algorithms typically run on point cloud maps generated by Structure-from-Motion (SfM) or Simultaneous Localization Mapping (SLAM). However, features are sensitive to weak textures illumination changes. In addition, the 3D often contain millions points, which puts higher demands device storage...

10.1109/lra.2024.3387137 article EN IEEE Robotics and Automation Letters 2024-04-10

We present a method for recovering the 3-D shape of an inextensible deformable surface from monocular image sequence. State-of-the-art methods on this problem, utilize <i xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">L</i> <sub xmlns:xlink="http://www.w3.org/1999/xlink">¿</sub> -norm reprojection residual vectors and formulate tracking problem as Second-Order Cone Programming (SOCP) problem. Instead using which is sensitive to outliers, we use...

10.1109/tip.2009.2038115 article EN IEEE Transactions on Image Processing 2009-12-10

10.1016/j.patcog.2017.11.003 article EN publisher-specific-oa Pattern Recognition 2017-11-06

The incremental Structure-from-Motion (SfM) technique has advanced in both robustness and accuracy, but the efficiency scalability remain its key challenges. In this paper, we propose a novel batched SfM to tackle these problems unified framework, where two iteration loops are contained. inner loop is tracks triangulation loop, selection method proposed find compact subset of for bundle adjustment (BA). outer camera registration batch cameras simultaneously added alleviate drifting risk...

10.1109/3dv.2017.00032 article EN 2021 International Conference on 3D Vision (3DV) 2017-10-01
Coming Soon ...