Jinglei Shi

ORCID: 0000-0003-2926-0415
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Vision and Imaging
  • Advanced Image Processing Techniques
  • Image Enhancement Techniques
  • Image Processing Techniques and Applications
  • Image and Signal Denoising Methods
  • Advanced Neural Network Applications
  • COVID-19 diagnosis using AI
  • Optical measurement and interference techniques
  • Optical Coherence Tomography Applications
  • Computer Graphics and Visualization Techniques
  • Advanced Image and Video Retrieval Techniques
  • Video Coding and Compression Technologies
  • Industrial Vision Systems and Defect Detection
  • Advanced Fluorescence Microscopy Techniques
  • Advanced Text Analysis Techniques
  • Domain Adaptation and Few-Shot Learning
  • Robotics and Sensor-Based Localization
  • Topic Modeling
  • Power Systems and Renewable Energy
  • Solar Thermal and Photovoltaic Systems
  • Video Surveillance and Tracking Methods
  • Digital Media Forensic Detection
  • Sentiment Analysis and Opinion Mining
  • Photoacoustic and Ultrasonic Imaging
  • Photonic and Optical Devices

Nankai University
2023-2024

Institut national de recherche en informatique et en automatique
2020-2022

Inria Rennes - Bretagne Atlantique Research Centre
2020-2022

Centre de Recherche en Informatique
2021

Kunming University of Science and Technology
2021

Huazhong University of Science and Technology
2018

Shandong Academy of Building Research
2016

In this paper, we propose a learning-based depth estimation framework suitable for both densely and sparsely sampled light fields. The proposed consists of three processing steps: initial estimation, fusion with occlusion handling, refinement. can be performed from flexible subset input views. disparity estimates, relying on two warping error measures, allows us to have an accurate in occluded regions along the contours. contrast methods computation cost volumes, approach does not need any...

10.1109/tip.2019.2923323 article EN IEEE Transactions on Image Processing 2019-06-21

In this paper, we present a learning-based framework for light field view synthesis from subset of input views. Building upon light-weight optical flow estimation network to obtain depth maps, our method employs two reconstruction modules in pixel and feature domains respectively. For the pixel-wise reconstruction, occlusions are explicitly handled by disparity-dependent interpolation filter, whereas inpainting on disoccluded areas is learned convolutional layers. Due disparity...

10.1109/cvpr42600.2020.00263 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

10.1109/cvpr52733.2024.00285 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

This paper proposes a learning based solution to disparity (depth) estimation for either densely or sparsely sampled light fields. Disparity between stereo pairs among sparse subset of anchor views is first estimated by fine-tuned FlowNet 2.0 network adapted prediction task. These coarse estimates are fused exploiting the photo-consistency warping error, and refined Multi-view Stereo Refinement Network (MSRNet). The propagation from viewpoints towards other performed an occlusion-aware soft...

10.1109/icassp.2019.8683773 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019-04-17

Light fields capture 3D scene information by recording light rays emitted from a at various orientations. They offer more immersive perception, compared with classic 2D images, but the cost of huge data volumes. In this paper, we design compact neural network representation for field compression task. same vein as deep image prior, takes randomly initialized noise input and is trained in supervised manner order to best reconstruct target Sub-Aperture Images (SAIs). The composed two types...

10.1109/tip.2024.3418670 article EN IEEE Transactions on Image Processing 2024-01-01

This paper presents JAWS, an optimization-driven approach that achieves the robust transfer of visual cinematic features from a reference in-the-wild video clip to newly generated clip. To this end, we rely on implicit-neural-representation (INR) in way compute shares same as We propose general formulation camera optimization problem INR computes extrinsic and intrinsic parameters well timing. By leveraging differentiability neural representations, can back-propagate our designed losses...

10.1109/cvpr52729.2023.01624 preprint EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

In this paper, we propose a novel light field compression method based on low rank-constrained neural scene representation. While most existing methods directly compress the views, our first learns Multi-Layer Perceptron (MLP)-based Neural Radiance Field (NeRF) from input views. To be able to efficiently NeRF representation, weights of MLP are optimized under low-rank constraint using Alternating Direction Method Multipliers (ADMM) optimization method. The then decomposed into Tensor Train...

10.1109/icassp49357.2023.10095668 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023-05-05

Deep generative models have proven to be effective priors for solving a variety of image processing problems. However, the learning realistic priors, based on large number parameters, requires amount training data. It has been shown recently, with so-called deep prior (DIP), that randomly initialized neural networks can act as good without learning. In this paper, we propose model light fields, which is compact and does not require any data other than field itself. To show potential proposed...

10.1109/tip.2022.3217374 article EN IEEE Transactions on Image Processing 2022-01-01

We propose in this paper a Quantized Distilled Low-Rank Neural Radiance Field (QDLR-NeRF) representation for the task of light field compression. While existing compression methods encode set sub-aperture images, our proposed method learns an implicit scene form (NeRF), which also enables view synthesis. To reduce its size, model is first learned under (LR) constraint using Tensor Train (TT) decomposition within Alternating Direction Method Multipliers (ADMM) optimization framework. further...

10.48550/arxiv.2208.00164 preprint EN other-oa arXiv (Cornell University) 2022-01-01

In this paper, we propose a deep residual architecture that can be used both for synthesizing high quality angular views in light fields and temporal frames classical videos. The proposed framework consists of an optical flow estimator optimized view synthesis, trainable feature extractor convolutional network pixel feature-based reconstruction. Among these modules, the fine-tuning specifically synthesis task yields scene depth or motion information is well targeted problem. cooperation with...

10.1109/tci.2022.3160671 article EN IEEE Transactions on Computational Imaging 2022-01-01

10.1109/icme57554.2024.10687732 article EN 2022 IEEE International Conference on Multimedia and Expo (ICME) 2024-07-15

How to explore useful features from images as prompts guide the deep image restoration models is an effective way solve restoration. In contrast mining spatial relations within prompt, which leads characteristics of different frequencies being neglected and further remaining subtle or undetectable artifacts in restored image, we develop a Frequency Prompting method, dubbed FPro, can effectively provide prompt components frequency perspective guild model address these differences....

10.48550/arxiv.2404.00288 preprint EN arXiv (Cornell University) 2024-03-30

Exploring motion information is important for the deblurring task. Recent window-based transformer approaches have achieved decent performance in image deblurring. Note that causing blurry results usually composed of translation and rotation movements window-shift operation Cartesian coordinate system by only directly explores orthogonal directions. Thus, these methods limitation modeling part. To alleviate this problem, we introduce polar coordinate-based transformer, which has angles...

10.48550/arxiv.2404.00358 preprint EN arXiv (Cornell University) 2024-03-30

In this paper, we leverage image complexity as a prior for refining segmentation features to achieve accurate real-time semantic segmentation. The design philosophy is based on the observation that different pixel regions within an exhibit varying levels of complexity, with higher complexities posing greater challenge We thus introduce guidance and propose Image Complexity prior-guided Feature Refinement Network (ICFRNet). This network aggregates both produce attention map Guided Attention...

10.48550/arxiv.2408.13771 preprint EN arXiv (Cornell University) 2024-08-25

10.1016/j.image.2022.116721 article EN Signal Processing Image Communication 2022-05-06

The application of deep learning in traditional industries has not gained much attention. However, a great potential to be transplanted other fields. And we managed apply two techniques learning, object detection and tracking, dynamic counting. We test it on one the basic problem steel industry, rebar To cope with this, used an infrared camera collect video spot so that can distinguished from background apparently. Then use complete counting work. divided process into parts: tracking....

10.1117/12.2503014 article EN 2018-08-09

Unsupervised aspect identification is a challenging task in aspect-based sentiment analysis. Traditional topic models are usually used for this task, but they not appropriate short texts such as product reviews. In work, we propose an model based on vector reconstruction. A key of our that make connections between sentence vectors and multi-grained using fuzzy k-means membership function. Furthermore, to full use different representations space, reconstruct coarse-grained fine-grained...

10.3233/jifs-210175 article EN Journal of Intelligent & Fuzzy Systems 2021-04-27

The paper selects distributed solar water heating system of certain high-rise house in Jinan City as research object, its field performance test and energy efficiency assessment are made, obtains annual guarantee rate conventional substitution quantity the building, makeseconomic benefit assessment, analyzes saving effect economic system, gives out index index.Research results show building has good benefit. ForewordDesign, construction, management, etc. become increasingly normalized, but...

10.2991/nceece-15.2016.221 article EN cc-by-nc 2016-01-01

Light fields are a type of image data that capture both spatial and angular scene information by recording light rays emitted from different orientations. In this context, is defined as features remain static regardless perspectives, while refers to vary between viewpoints. We propose novel neural network that, design, can separate field. The represents using kernels shared among all Sub-Aperture Images (SAIs), sets for each SAI. To further improve the representation capability without...

10.48550/arxiv.2304.06322 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Light field is a type of image data that captures the 3D scene information by recording light rays emitted from at various orientations. It offers more immersive perception than classic 2D images but cost huge volume. In this paper, we draw inspiration visual characteristics Sub-Aperture Images (SAIs) and design compact neural network representation for compression task. The backbone takes randomly initialized noise as input supervised on SAIs target field. composed two types complementary...

10.48550/arxiv.2307.06143 preprint EN other-oa arXiv (Cornell University) 2023-01-01

10.5220/0010793900003124 article EN cc-by-nc-nd Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications 2022-01-01

Axial light field resolution refers to the ability distinguish features at different depths by refocusing. The axial refocusing precision corresponds minimum distance in direction between two distinguishable planes. High can be essential for some applications like microscopy. In this paper, we propose a learning-based method extrapolate novel views from volumes of sheared epipolar plane images (EPIs). As extended numerical aperture (NA) classical imaging, extrapolated gives re-focused with...

10.48550/arxiv.2103.06510 preprint EN other-oa arXiv (Cornell University) 2021-01-01
Coming Soon ...