- Video Coding and Compression Technologies
- Advanced Vision and Imaging
- Advanced Data Compression Techniques
- Advanced Image Processing Techniques
- Image and Video Quality Assessment
- Image Enhancement Techniques
- Image and Signal Denoising Methods
- Image Retrieval and Classification Techniques
- AI in cancer detection
- Advanced Steganography and Watermarking Techniques
- Color Science and Applications
- Multimedia Communication and Technology
- Advanced Image and Video Retrieval Techniques
- Digital Imaging for Blood Diseases
- Advanced Algorithms and Applications
- COVID-19 diagnosis using AI
- Digital Filter Design and Implementation
- Advanced Sensor and Control Systems
- Face recognition and analysis
- Synthesis of Organic Compounds
- Telecommunications and Broadcasting Technologies
- Speech and Audio Processing
- Underwater Vehicles and Communication Systems
- Machine Learning in Healthcare
- Advanced Computational Techniques and Applications
Dolby (United States)
2013-2024
Ministry of Education of the People's Republic of China
2023
Jilin University
2022-2023
Maebashi Institute of Technology
2023
Union Hospital
2022
Sichuan University
2014
Dolby (Netherlands)
2013
Télécom Paris
2013
Danaher (United Kingdom)
2012
Princeton University
2002-2011
The JVT/H.264 video coding standard achieves considerably higher efficiency than previous standards. Unfortunately this comes at a cost in increased complexity the encoder mainly due to motion estimation and mode decision. In paper, we propose new scheme jointly optimize Simulation results show that achieve up 90% reduction while maintaining efficiency.
This paper presents an overview of the technologies for in-loop processing and filtering in Versatile Video Coding (VVC) standard. These processes comprise luma mapping with chroma scaling, deblocking filter, sample adaptive offset, loop filter cross-component filter. They are qualified as "in-loop" because they applied inside encoding decoding loops, before storing pictures decoded picture buffer. The filters complementary address different purposes. Luma scaling aims at adaptively...
This paper discusses the problem of reduced-resolution transcoding compressed video bitstreams. An analysis drift errors is provided to identify sources quality degradation when a lower spatial resolution. Two types error are considered: reference picture error, which has been identified in previous works, and due noncommutative property motion compensation down-sampling, unique this work. To overcome these four novel architectures presented. One architecture attempts compensate for reduced...
The H.264 standard achieves much higher coding efficiency than previous video standards, due to its improved inter and intra prediction modes which come with a cost of computational complexity. When transcoding from MPEG-2, motion information MPEG-2 can be used speed up the search. Fast mode decision algorithms are proposed for B P frames, fast algorithm is developed coding. In addition, estimation by reusing MPEG-2. Simulation results demonstrate that we achieve significant complexity...
Frame partitioning is a process of key importance in efficient video coding. Most recent compression technologies, like H.264/AVC, use tree based frame partition. This reveals to be more than simple uniform block partition, typically used older coding standards MPEG-2 or H.263. However, partition still does not code efficiently enough information, as unable capture the geometric structure 2D data. During last years, several works have been developed, mainly domain image representation and...
This paper describes a new video coding tool in the Versatile Video Coding standard (VVC) named as luma mapping with chroma scaling (LMCS). Experimental compression performance results for LMCS and non-normative examples deriving parameter values are also provided. has two main components: 1) process input code to set of use inside loop; 2) luma-dependent residue values. The first process, mapping, aims at improving efficiency high dynamic range signals by making better allowed specified bit...
In network delivery of digital video, if the bandwidth required for a video is not available, has to be recoded at reduced bit rate. It highly desirable that transcoding carried out in real time while maintaining reasonable image quality. this paper, we propose fast approach derive from an MPEG stream new with half spatial resolution. For downsized first generate original compressed improved estimate motion vectors. We then domain data hiding produce DCT residues by open-loop method. The...
We propose new tools for multiview video coding (MVC) that aim to compensate mismatches between frames corresponding different views.Such could be caused by shooting positions of the cameras and/or heterogeneous camera settings.In particular, we consider illumination and focus across views, i.e., such portions a frame can undergo blurriness/sharpness changes with respect areas in from other views.Models are proposed developed models.We block-based compensation (IC) technique depth-dependent...
Error concealment plays an important role in combating transmission errors. Methods of error that produce better quality are generally higher complexity, thus making some the more sophisticated algorithms not suitable for real-time applications or restricted to client devices with limited capability. We propose approach use data hiding facilitate at decoder. A set features extracted encoder and embedded imperceptibly into host media. If part media is damaged during transmission, can be used...
We consider the effect of depth-image compression artifacts on quality virtual views rendered using neighboring views. Such view rendering processes are utilized in new video applications such as 3D television (3DTV) and free viewpoint (FVV). first analyze how compressed depth-images result distortions show that position error is a monotonic function coding error. For scenario which cameras arranged with parallel optical axes, we further demonstrate specific properties Exploiting special...
Modern video coding standards such as H.264/AVC use intra prediction for efficient of Intra pictures. These usually exploit local directional signal correlations. More recently, modes using non-local information have been introduced. A very popular approach is the so called template matching (TMP), which uses based texture synthesis prediction. This, combined with regular prediction, significantly improves efficiency compared to H.264/AVC. However, current TMP techniques trouble synthesizing...
This paper describes a new video coding scheme based on simplified block structure that significantly outperforms the efficiency of ISO/IEC 14496-10 ITU-T H.264 advanced (AVC) standard. Its conceptual design is similar to typical block-based hybrid coder applying prediction and subsequent error coding. The basic unit an 8 × for inter, or 16 intra, instead usual macroblock. No larger sizes are considered transform. Based this structure, uses simple fundamental tools with optimized encoding...
The ITU-T Video Coding Experts Group and the ISO/IEC Moving Picture issued a Call for Proposals (CfP) on video compression with capability beyond HEVC in October 2017. CfP considered three categories of content - Standard Dynamic Range, High Range Wide Colour Gamut (HDR/WCG), 360° Omni-directional video. As result process, development new coding standard, named Versatile (VVC), was initiated. goal this paper is to provide an overview responses HDR/WCG category. includes summary work leading...
The paper describes a new compression method for handling local brightness variations in video sequences. does not explicitly code and transmit the weighting parameters used to predict current block. Instead, are estimated by considering correlation of block with its spatial neighbors temporal predictors. Since information deriving these is available both encoder decoder, no additional bits required be coded transmitted. Compared weighted prediction H.264 standard, our approach can further...
Block-based motion and disparity compensation are popular techniques to exploit correlation between video frames. Block sizes used for can be chosen achieve a good trade-off signaling overhead prediction accuracy. However, field boundaries correspond objects having arbitrary shapes; this limits the accuracy of block-based compensation, even when small block chosen. In paper we seek enable based on arbitrarily-shaped regions, while preserving an essentially architecture. To do so, propose...
Many modern video coding strategies, such as the H.264/AVC standard, use quadtree-based partition structures for intra macroblocks. Such a structure allows algorithm to adapt complicated and non-stationary nature of natural images. Despite adaptation flexibility quadtree partitions, recent studies have shown that these are not efficient enough (in terms rate-distortion performance) when images can be locally modeled 2D piecewise-smooth signals. These observations motivate us investigate...
In a backward compatible HDR image/video compression, it is general approach to reconstruct from compressed LDR as prediction original HDR, which referred inverse tone mapping. Experimental results show that 2- piecewise 2<sup>nd</sup> order polynomial has the best mapping accuracy than 1 piece high or 2-piecewise linear, but also most time-consuming method because find optimal pivot point split range 2 pieces requires exhaustive search. this paper, we propose fast algorithm completes in...
Geometry-adaptive block partitioning (GEO) can greatly enhance video coding efficiency but at the expense of significantly increased computational complexity. Instead proposing fast searching algorithm for encoding only, this paper proposes to reduce size partitions both encoder and decoder. The proposed scheme only searches recognized as most valuable partitions, which is derived by analyzing different GEO from contribution efficiency. On one hand, limited number computation burden much...
The ITU-T Video Coding Experts Group (VCEG) and ISO/IEC Moving Picture (MPEG) issued in October 2017 a joint Call for Proposals (CfP) on video compression with capability beyond HEVC. CfP included three categories of content: standard dynamic range (SDR), high wide color gamut (HDR/WCG), 360° omni-directional (360°). This paper describes response to the that considers all content. core codec is designed based exploration model (JEM) reference software. key coding tools JEM are significantly...
This paper provides a brief overview of the H.264 video coding standard, focusing on its new features and applications. Outperforming all previous standards over wide range bit-rates, is expected to flood market in large number applications ranging from real-time conversational services, TV broadcasting or Internet streaming. At basis high performance there is: layered structure, improving network friendliness; adoption clean simple solutions enabling efficient implementations; capability...
The H.264/MPEG-4 AVC video coding standard is very promising due to its high efficiency. In this paper, we propose a new constant bit rate control method based on the rate-distortion model of TMN8. novelty our approach use simple preprocessing achieve target more accurately, better allocation and buffer management for both frame-level macroblock-level control, improved perceptual quality, adoption virtual frame skipping. Simulations show that can meet bitrate even content with scene changes...
The paper presents the joint proposal by Arris, Dolby and InterDigital as a response to Call-for-Evidence of High Dynamic Range Wide Color Gamut (HDR/WCG) video compression in MPEG. introduces set new HDR coding technologies, including IPT-PQ color space, adaptive reshaping process, enhancement filters, transfer function. These technologies are applied decoded output an HEVC decoder. Hence, no changes lower level logics decoder required implement proposal. Formal subjective tests conducted...
Histopathological image contains rich phenotypic information, which is beneficial to classifying tumor subtypes and predicting the development of diseases. The vast size pathological slides makes it impossible directly train whole slide images (WSI) on convolutional neural networks (CNNs). Most previous weakly supervision works divide high-resolution WSIs into small patches separately input them CNN classify as tumors or normal areas. first difficulty that although method based framework...