NFDI4DS | UHH-SEMS - Publication Details

Ling Shao

ORCID: 0000-0002-8264-6117

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5082634513

Research Areas

Advanced Image and Video Retrieval Techniques
Domain Adaptation and Few-Shot Learning
Advanced Neural Network Applications
Video Surveillance and Tracking Methods
Human Pose and Action Recognition
Multimodal Machine Learning Applications
Visual Attention and Saliency Detection
Advanced Image Processing Techniques
Image Retrieval and Classification Techniques
Image and Signal Denoising Methods
Anomaly Detection Techniques and Applications
Advanced Vision and Imaging
Image Enhancement Techniques
Video Analysis and Summarization
Medical Imaging Techniques and Applications
COVID-19 diagnosis using AI
Gait Recognition and Analysis
Advanced Image Fusion Techniques
Face recognition and analysis
Face and Expression Recognition
Generative Adversarial Networks and Image Synthesis
Image and Video Quality Assessment
Image Processing Techniques and Applications
Medical Image Segmentation Techniques
Robotics and Sensor-Based Localization

Inception Institute of Artificial Intelligence
2017-2025

University of Chinese Academy of Sciences
2019-2025

China University of Geosciences (Beijing)
2023-2025

Hubei University of Chinese Medicine
2024

RefleXion Medical (United States)
2021-2024

Tianjin University
2019-2023

University College of Applied Science
2023

Smile Train
2023

Institute of Economics
2023

Hefei University of Technology
2023

Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions

OPENALEX - Publications

Wenhai Wang Enze Xie Xiang Li Deng-Ping Fan Kaitao Song and 4 more

Although convolutional neural networks (CNNs) have achieved great success in computer vision, this work investigates a simpler, convolution-free backbone network use-fid for many dense prediction tasks. Unlike the recently-proposed Vision Transformer (ViT) that was designed image classification specifically, we introduce Pyramid (PVT), which overcomes difficulties of porting to various PVT has several merits compared current state arts. (1) Different from ViT typically yields low-resolution...

10.1109/iccv48922.2021.00061 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

A Fast Single Image Haze Removal Algorithm Using Color Attenuation Prior

OPENALEX - Publications

Qingsong Zhu Jiaming Mai Ling Shao

Single image haze removal has been a challenging problem due to its ill-posed nature. In this paper, we propose simple but powerful color attenuation prior for from single input hazy image. By creating linear model modeling the scene depth of under novel and learning parameters with supervised method, information can be well recovered. With map image, easily estimate transmission restore radiance via atmospheric scattering model, thus effectively remove Experimental results show that...

10.1109/tip.2015.2446191 article EN IEEE Transactions on Image Processing 2015-06-18

Multi-Stage Progressive Image Restoration

OPENALEX - Publications

Syed Waqas Zamir Aditya Arora Salman Khan Munawar Hayat Fahad Shahbaz Khan and 2 more

Image restoration tasks demand a complex balance between spatial details and high-level contextualized information while recovering images. In this paper, we propose novel synergistic design that can optimally these competing goals. Our main proposal is multi-stage architecture, progressively learns functions for the degraded inputs, thereby breaking down overall recovery process into more manageable steps. Specifically, our model first features using encoder-decoder architectures later...

10.1109/cvpr46437.2021.01458 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

PVT v2: Improved baselines with Pyramid Vision Transformer

OPENALEX - Publications

Wenhai Wang Enze Xie Xiang Li Deng-Ping Fan Kaitao Song and 4 more

Transformer recently has presented encouraging progress in computer vision. In this work, we present new baselines by improving the original Pyramid Vision (PVT v1) adding three designs, including (1) linear complexity attention layer, (2) overlapping patch embedding, and (3) convolutional feed-forward network. With these modifications, PVT v2 reduces computational of v1 to achieves significant improvements on fundamental vision tasks such as classification, detection, segmentation. Notably,...

10.1007/s41095-022-0274-8 article EN cc-by Computational Visual Media 2022-03-16

Inf-Net: Automatic COVID-19 Lung Infection Segmentation From CT Images

OPENALEX - Publications

Deng-Ping Fan Tao Zhou Ge-Peng Ji Yi Zhou Geng Chen and 3 more

Coronavirus Disease 2019 (COVID-19) spread globally in early 2020, causing the world to face an existential health crisis. Automated detection of lung infections from computed tomography (CT) images offers a great potential augment traditional healthcare strategy for tackling COVID-19. However, segmenting infected regions CT slices faces several challenges, including high variation infection characteristics, and low intensity contrast between normal tissues. Further, collecting large amount...

10.1109/tmi.2020.2996645 article EN IEEE Transactions on Medical Imaging 2020-05-22

Transfer Learning for Visual Categorization: A Survey

OPENALEX - Publications

Ling Shao Fan Zhu Xuelong Li

Regular machine learning and data mining techniques study the training for future inferences under a major assumption that are within same feature space or have distribution as data. However, due to limited availability of human labeled data, stay in cannot be guaranteed sufficient enough avoid over-fitting problem. In real-world applications, apart from target domain, related different domain can also included expand our prior knowledge about Transfer addresses such cross-domain problems by...

10.1109/tnnls.2014.2330900 article EN IEEE Transactions on Neural Networks and Learning Systems 2014-07-01

A survey on fall detection: Principles and approaches

OPENALEX - Publications

Muhammad Mubashir Ling Shao N.L. Seed

10.1016/j.neucom.2011.09.037 article EN Neurocomputing 2012-05-07

HRank: Filter Pruning Using High-Rank Feature Map

OPENALEX - Publications

Mingbao Lin Rongrong Ji Yan Wang Yichen Zhang Baochang Zhang and 2 more

Neural network pruning offers a promising prospect to facilitate deploying deep neural networks on resource-limited devices. However, existing methods are still challenged by the training inefficiency and labor cost in designs, due missing theoretical guidance of non-salient components. In this paper, we propose novel filter method exploring High Rank feature maps (HRank). Our HRank is inspired discovery that average rank multiple generated single always same, regardless number image batches...

10.1109/cvpr42600.2020.00160 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

Video Salient Object Detection via Fully Convolutional Networks

OPENALEX - Publications

Wenguan Wang Jianbing Shen Ling Shao

This paper proposes a deep learning model to efficiently detect salient regions in videos. It addresses two important issues: 1) video saliency training with the absence of sufficiently large and pixel-wise annotated data 2) fast detection. The proposed network consists modules, for capturing spatial temporal information, respectively. dynamic model, explicitly incorporating estimates from static directly produces spatiotemporal inference without time-consuming optical flow computation. We...

10.1109/tip.2017.2754941 article EN IEEE Transactions on Image Processing 2017-09-20

Camouflaged Object Detection

OPENALEX - Publications

Deng-Ping Fan Ge-Peng Ji Guolei Sun Ming–Ming Cheng Jianbing Shen and 1 more

We present a comprehensive study on new task named camouflaged object detection (COD), which aims to identify objects that are "seamlessly" embedded in their surroundings. The high intrinsic similarities between the target and background make COD far more challenging than traditional task. To address this issue, we elaborately collect novel dataset, called COD10K, comprises 10,000 images covering various natural scenes, over 78 categories. All densely annotated with category, bounding-box,...

10.1109/cvpr42600.2020.00285 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

See More, Know More: Unsupervised Video Object Segmentation With Co-Attention Siamese Networks

OPENALEX - Publications

Xiankai Lu Wenguan Wang Chao Ma Jianbing Shen Ling Shao and 1 more

We introduce a novel network, called as CO-attention Siamese Network (COSNet), to address the unsupervised video object segmentation task from holistic view. emphasize importance of inherent correlation among frames and incorporate global co-attention mechanism improve further state-of-the-art deep learning based solutions that primarily focus on discriminative foreground representations over appearance motion in short-term temporal segments. The layers our network provide efficient...

10.1109/cvpr.2019.00374 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

Binary Multi-View Clustering

OPENALEX - Publications

Zheng Zhang Li Liu Fumin Shen Heng Tao Shen Ling Shao

Clustering is a long-standing important research problem, however, remains challenging when handling large-scale image data from diverse sources. In this paper, we present novel Binary Multi-View (BMVC) framework, which can dexterously manipulate multi-view and easily scale to large data. To achieve goal, formulate BMVC by two key components: compact collaborative discrete representation learning binary clustering structure learning, in joint framework. Specifically, collaboratively encodes...

10.1109/tpami.2018.2847335 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2018-06-18

A rapid learning algorithm for vehicle classification

OPENALEX - Publications

Xuezhi Wen Ling Shao Yu Xue Wei Fang

10.1016/j.ins.2014.10.040 article EN Information Sciences 2014-10-24

Deep Dynamic Neural Networks for Multimodal Gesture Segmentation and Recognition

OPENALEX - Publications

Di Wu Lionel Pigou Pieter-Jan Kindermans Nam Le Ling Shao and 2 more

This paper describes a novel method called Deep Dynamic Neural Networks (DDNN) for multimodal gesture recognition. A semi-supervised hierarchical dynamic framework based on Hidden Markov Model (HMM) is proposed simultaneous segmentation and recognition where skeleton joint information, depth RGB images, are the input observations. Unlike most traditional approaches that rely construction of complex handcrafted features, our approach learns high-level spatio-temporal representations using...

10.1109/tpami.2016.2537340 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2016-03-02

Concealed Object Detection

OPENALEX - Publications

Deng-Ping Fan Ge-Peng Ji Ming–Ming Cheng Ling Shao

We present the first systematic study on concealed object detection (COD), which aims to identify objects that are visually embedded in their background. The high intrinsic similarities between and background make COD far more challenging than traditional detection/segmentation. To better understand this task, we collect a large-scale dataset, called COD10K, consists of 10,000 images covering diverse real-world scenarios from 78 categories. Further, provide rich annotations including...

10.1109/tpami.2021.3085766 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2021-06-01

Design and performance evaluation of a whole-body Ingenuity TF PET–MRI system

OPENALEX - Publications

Habib Zaidi N Ojha Michael Morich J.J. Griesmer Zhiqiang Hu and 5 more

The Ingenuity TF PET–MRI is a newly released whole-body hybrid PET–MR imaging system with Philips time-of-flight GEMINI PET and Achieva 3T X-series MRI system. Compared to PET–CT, modifications the positron emission tomography (PET) gantry were made avoid mutual interference deliver uncompromising performance which equivalent standalone systems. was redesigned introduce magnetic shielding for photomultiplier tubes (PMTs). Stringent electromagnetic noise requirements of MR necessitated...

10.1088/0031-9155/56/10/013 article EN Physics in Medicine and Biology 2011-04-20

Consistent Video Saliency Using Local Gradient Flow Optimization and Global Refinement

OPENALEX - Publications

Wenguan Wang Jianbing Shen Ling Shao

We present a novel spatiotemporal saliency detection method to estimate salient regions in videos based on the gradient flow field and energy optimization. The proposed incorporates two distinctive features: 1) intra-frame boundary information 2) inter-frame motion together for indicating regions. Based effective utilization of both field, our algorithm is robust enough object background complex scenes with various patterns appearances. Then, we introduce local as well global contrast...

10.1109/tip.2015.2460013 article EN IEEE Transactions on Image Processing 2015-07-22

Object-Centric Auto-Encoders and Dummy Anomalies for Abnormal Event Detection in Video

OPENALEX - Publications

Radu Tudor Ionescu Fahad Shahbaz Khan Mariana-Iuliana Georgescu Ling Shao

Abnormal event detection in video is a challenging vision problem. Most existing approaches formulate abnormal as an outlier task, due to the scarcity of anomalous data during training. Because lack prior information regarding events, these methods are not fully-equipped differentiate between normal and events. In this work, we formalize one-versus-rest binary classification Our contribution two-fold. First, introduce unsupervised feature learning framework based on object-centric...

10.1109/cvpr.2019.00803 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

Crowd Counting and Density Estimation by Trellis Encoder-Decoder Networks

OPENALEX - Publications

Xiaolong Jiang Zehao Xiao Baochang Zhang Xiantong Zhen Xianbin Cao and 2 more

Crowd counting has recently attracted increasing interest in computer vision but remains a challenging problem. In this paper, we propose trellis encoder-decoder network (TEDnet) for crowd counting, which focuses on generating high-quality density estimation maps. The major contributions are four-fold. First, develop new architecture that incorporates multiple decoding paths to hierarchically aggregate features at different encoding stages, improves the representative capability of...

10.1109/cvpr.2019.00629 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

Coming Soon ...