Shan Liu

ORCID: 0000-0003-1847-8769
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Vision and Imaging
  • Video Coding and Compression Technologies
  • Advanced Image Processing Techniques
  • Advanced Data Compression Techniques
  • 3D Shape Modeling and Analysis
  • Image and Video Quality Assessment
  • Computer Graphics and Visualization Techniques
  • Remote Sensing and LiDAR Applications
  • 3D Surveying and Cultural Heritage
  • Image Enhancement Techniques
  • Image and Signal Denoising Methods
  • Generative Adversarial Networks and Image Synthesis
  • Advanced Image and Video Retrieval Techniques
  • Advanced Neural Network Applications
  • Multimedia Communication and Technology
  • Image Processing Techniques and Applications
  • Visual Attention and Saliency Detection
  • Crystallization and Solubility Studies
  • X-ray Diffraction in Crystallography
  • Human Pose and Action Recognition
  • Advanced Computational Techniques and Applications
  • Robotics and Sensor-Based Localization
  • Air Quality and Health Impacts
  • Image Processing and 3D Reconstruction
  • Optical measurement and interference techniques

Marine Biomedical Research Institute of Qingdao
2025

Sichuan Academy of Medical Sciences & Sichuan Provincial People's Hospital
2018-2025

University of Electronic Science and Technology of China
2012-2025

Fujian Agriculture and Forestry University
2024-2025

Fujian Medical University
2025

Tencent (China)
2018-2024

Southwest University
2022-2024

Rutgers New Jersey Medical School
2024

KLA (United States)
2018-2024

Hubei University of Technology
2024

Versatile Video Coding (VVC) was finalized in July 2020 as the most recent international video coding standard. It developed by Joint Experts Team (JVET) of ITU-T Group (VCEG) and ISO/IEC Moving Picture (MPEG) to serve an ever-growing need for improved compression well support a wider variety today’s media content emerging applications. This paper provides overview novel technical features new applications core technologies achieving significant bit rate reductions neighborhood 50% over its...

10.1109/tcsvt.2021.3101953 article EN cc-by IEEE Transactions on Circuits and Systems for Video Technology 2021-08-02

Video anomaly detection under weak labels is formulated as a typical multiple-instance learning problem in previous works. In this paper, we provide new perspective, i.e., supervised task noisy labels. such viewpoint, long cleaning away label noise, can directly apply fully action classifiers to weakly detection, and take maximum advantage of these well-developed classifiers. For purpose, devise graph convolutional network correct Based upon feature similarity temporal consistency, our...

10.1109/cvpr.2019.00133 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

Image inpainting techniques have shown significant improvements by using deep neural networks recently. However, most of them may either fail to reconstruct reasonable structures or restore fine-grained textures. In order solve this problem, in paper, we propose a two-stage model which splits the task into two parts: structure reconstruction and texture generation. first stage, edge-preserved smooth images are employed train reconstructor completes missing inputs. second based on...

10.1109/iccv.2019.00027 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Generating portrait images by controlling the motions of existing faces is an important task great consequence to social media industries. For easy use and intuitive control, semantically meaningful fully disentangled parameters should be used as modifications. However, many techniques do not provide such fine-grained controls or indirect editing methods i.e. mimic other individuals. In this paper, a Portrait Image Neural Renderer (PIRenderer) proposed control face with three-dimensional...

10.1109/iccv48922.2021.01350 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

In point cloud compression, sufficient contexts are significant for modeling the distribution. However, gathered by previous voxel-based methods decrease when handling sparse clouds. To address this problem, we propose a multiple-contexts deep learning framework called OctAttention employing octree structure, memory-efficient representation Our approach encodes symbol sequences in lossless way gathering information of sibling and ancestor nodes. Expressly, first represent clouds with to...

10.1609/aaai.v36i1.19942 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2022-06-28

With the emerging applications such as online gaming and Wi-Fi display, screen content video, including computer generated text, graphics animations, becomes more popular than ever. Traditional video coding technologies typically were developed based on models that fit into natural, camera-captured video. The distinct characteristics exhibited between these two types of contents necessitate exploration efficiency improvement given new tools can be specially for HEVC Screen Content Coding...

10.1109/jetcas.2016.2597645 article EN IEEE Journal on Emerging and Selected Topics in Circuits and Systems 2016-08-19

An explainable machine learning method for point cloud classification, called the PointHop method, is proposed in this work. The consists of two stages: 1) local-to-global attribute building through iterative one-hop information exchange, and 2) classification ensembles. In stage, we address problem unordered data using a space partitioning procedure developing robust descriptor that characterizes relationship between its neighbor unit. When put multiple units cascade, attributes will grow...

10.1109/tmm.2019.2963592 article EN IEEE Transactions on Multimedia 2020-01-01

Generic object detection algorithms have proven their excellent performance in recent years. However, on underwater datasets is still less explored. In contrast to generic datasets, images usually color shift and low contrast; sediment would cause blurring images. addition, creatures often appear closely each other due living habits. To address these issues, our work investigates augmentation policies simulate overlapping, occluded blurred objects, we construct a model capable of achieving...

10.1109/icassp40776.2020.9053829 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020-04-09

This paper presents the intra prediction and mode coding of Versatile Video Coding (VVC) standard. standard was collaboratively developed by Joint Experts Team (JVET). It follows traditional architecture a hybrid block-based codec that also basis previous standards. Almost all features VVC either contain substantial modifications in comparison with its predecessor H.265/HEVC or were newly added. The key aspects these tools are following: 65 angular modes block shape-adaptive directions 4-tap...

10.1109/tcsvt.2021.3072430 article EN cc-by IEEE Transactions on Circuits and Systems for Video Technology 2021-04-12

Instance segmentation in high-resolution (HR) remote sensing imagery is one of the most challenging tasks and more difficult than object detection semantic tasks. It aims to predict class labels pixel-wise instance masks locate instances an image. However, there are rare methods currently suitable for HR images. Meanwhile, it implement due complex background In this article, a novel approach based on Cascade Mask R-CNN proposed, which called high-quality network (HQ-ISNet). scheme, HQ-ISNet...

10.3390/rs12060989 article EN cc-by Remote Sensing 2020-03-19

In the past decade, development of transform coding techniques has achieved significant progress and several advanced tools have been adopted in new generation Versatile Video Coding (VVC) standard. this paper, a brief history during VVC standardization is presented, standard are described detail together with their initial design, incremental improvements implementation aspects. To improve efficiency, four introduced VVC, which namely Multiple Transform Selection (MTS), Low-Frequency...

10.1109/tcsvt.2021.3087706 article EN cc-by IEEE Transactions on Circuits and Systems for Video Technology 2021-06-09

<h3>Objective</h3> Precise genetic analyses were conducted with ring finger protein 213 (<i>RNF213</i>) in relation to a particular clinical phenotype Chinese patients moyamoya disease (MMD) determine whether heterozygosity is responsible for the early-onset and severe form of this disease. <h3>Methods</h3> A case–control study <i>RNF213</i> p.R4810K involving 1,385 MMD 2,903 normal control participants was performed. Correlation between genotype or different features also statistically...

10.1212/wnl.0000000000008901 article EN cc-by-nc-nd Neurology 2020-01-17

The previous deep video compression approaches only use the single scale motion compensation strategy and rarely adopt mode prediction technique from traditional standards like H.264/H.265 for both residual compression. In this work, we first propose a coarse-to-fine (C2F) framework better compensation, in which perform estimation, twice coarse to fine manner. Our C2F can achieve results without significantly increasing bit costs. Observing hyperprior information (i.e., mean variance values)...

10.1109/cvpr52688.2022.00583 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

A steady increase in available processing power continues to drive advances video compression technology. The recently completed Versatile Video Coding (VVC) standard aims double the efficiency of HEVC and deliver a same quality at half bitrate. To achieve this goal, VVC includes several new methods that improve coding cost increased complexity. This paper provides complexity analysis its VTM reference software. Whereas is more complex than HEVC, it remains readily implementable software on...

10.1109/tcsvt.2021.3072204 article EN cc-by IEEE Transactions on Circuits and Systems for Video Technology 2021-04-09

Inspired by the recent PointHop classification method, an unsupervised 3D point cloud registration called R-PointHop, is proposed in this work. R-PointHop first determines a local reference frame (LRF) for every using its nearest neighbors and finds attributes. Next, obtains local-to-global hierarchical features downsampling, neighborhood expansion, attribute construction dimensionality reduction steps. Thus, correspondences are built feature space neighbor rule. Afterwards, subset of...

10.1109/tip.2022.3160609 article EN IEEE Transactions on Image Processing 2022-01-01

Mapping tree crown is critical for estimating the functional and spatial distribution of ecosystem services. However, accurate up-to-date urban mapping remains a challenge due to time-consuming nature field sampling heterogeneity. Another data cost, which always concern low-cost processing forest maps on large scales. Here, we developed novel working framework by integrating an advanced deep learning technology, Mask Region-based Convolutional Neural Network (Mask R-CNN) model with Google...

10.1016/j.jag.2022.102764 article EN cc-by-nc-nd International Journal of Applied Earth Observation and Geoinformation 2022-04-01

We deal with the controllable person image synthesis task which aims to re-render a human from reference explicit control over body pose and appearance. Observing that images are highly structured, we propose generate desired by extracting distributing semantic entities of images. To achieve this goal, neural texture extraction distribution operation based on double attention is described. This first extracts textures feature maps. Then, it distributes extracted according spatial...

10.1109/cvpr52688.2022.01317 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

With the rapid development of 3D vision, point cloud has become an increasingly popular visual media content. Due to irregular structure, posed novel challenges related research, such as compression, transmission, rendering and quality assessment. In these latest researches, assessment (PCQA) attracted wide attention due its significant role in guiding practical applications, especially many cases where reference is unavailable. However, current no-reference metrics which based on prevalent...

10.1109/tvcg.2023.3282802 article EN IEEE Transactions on Visualization and Computer Graphics 2023-06-28

Block-based compression scheme shows remarkable success in image and video coding. However, existing tree-type block partition methods usually divide point clouds into clusters with few or disjoint points due to the irregular sampling, which is adverse for subsequent transform exploit local region correlation. Moreover, widely-used optimal transform, e.g., Discrete Cosine Transform (DCT), deduced assumption of specified probability model. Thus these transforms cannot adequately conform...

10.1109/tcsvt.2023.3235891 article EN IEEE Transactions on Circuits and Systems for Video Technology 2023-01-10

There is an urgent need from various multimedia applications to efficiently compress point clouds. The Moving Picture Experts Group has released a standard platform called geometry-based cloud compression (G-PCC). However, its <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">k</i> -nearest neighbor (k-NN) based attribute prediction limited efficiency for clouds with rich texture and directional information. To overcome this problem, we propose...

10.1109/tcsvt.2024.3396694 article EN IEEE Transactions on Circuits and Systems for Video Technology 2024-05-03

The substantial data volume within dynamic point clouds representing three-dimensional moving entities necessitates advancements in compression techniques. Motion estimation (ME) is crucial for reducing cloud temporal redundancy. Standard block-based ME schemes, which typically utilize the previously decoded as inter-reference frames, often yield inaccurate and translation-only estimates clouds. To overcome this limitation, we propose an advanced patch-based affine scheme geometry...

10.3390/s24103142 article EN cc-by Sensors 2024-05-15

There is a pressing need across various applications for efficiently compressing point clouds. While the Moving Picture Experts Group introduced geometry-based cloud compression (G-PCC) standard, its attribute scheme falls short of eliminating signal frequency-domain redundancy. This paper proposes texture-guided graph transform optimization compression. We formulate coding task as problem, considering both decorrelation capability and sparsity optimized within tailored joint framework....

10.3390/app14104094 article EN cc-by Applied Sciences 2024-05-11
Coming Soon ...