- Video Coding and Compression Technologies
- Advanced Vision and Imaging
- Advanced Image Processing Techniques
- Advanced Data Compression Techniques
- Image and Video Quality Assessment
- Video Analysis and Summarization
- Image and Signal Denoising Methods
- Multimodal Machine Learning Applications
- Advanced Image and Video Retrieval Techniques
- Digital Filter Design and Implementation
- Image Retrieval and Classification Techniques
- Domain Adaptation and Few-Shot Learning
- Visual Attention and Saliency Detection
- Generative Adversarial Networks and Image Synthesis
- Digital Media Forensic Detection
- Computer Graphics and Visualization Techniques
- Pressure Ulcer Prevention and Management
- Integrated Circuits and Semiconductor Failure Analysis
- Structural Health Monitoring Techniques
- Cancer Treatment and Pharmacology
- Hand Gesture Recognition Systems
- Optical Systems and Laser Technology
- Color Science and Applications
- Optical Coherence Tomography Applications
- Pelvic floor disorders treatments
Kyungpook National University
2020-2024
Ewha Womans University
2019
Yonsei University
2018
Korea Electronics Technology Institute
2018
Hanyang University
2006-2017
Anyang University
2016
Software (Spain)
2016
In this paper, we propose a fast decision scheme using lightweight neural network (LNN) to avoid redundant block partitioning in versatile video coding (VVC). A more structure, named the multi-type tree (MTT) which includes binary trees (BTs) and ternary (TTs), is adopted by VCC, addition traditional quadtree structure. The MTT improved efficiency compared with previous standards. However, new structures, mainly TT, significantly increased complexity of VVC encoder. Although widespread...
This paper presents a fast encoding method for versatile video coding (VVC) using an early determination scheme that skips redundant multi-type tree (MTT) pruning. MTT consists of binary and ternary trees (TTs) with traditional quadtrees has recently attracted considerable VVC research interest due to efficiency beyond HEVC. However, the additional trees, particularly TTs, significantly increased complexity, which rarely been studied previously. Therefore, we identified TT characteristics in...
In this paper, we propose a fast encoding method to facilitate an affine motion estimation (AME) process in versatile video coding (VVC) encoders. The recently-launched VVC project for next-generation standardization far outperforms the High Efficiency Video Coding (HEVC) standard terms of efficiency. first version test model (VTM) displays superior efficiency yet requires higher complexity due advanced inter-prediction techniques multi-type tree (MTT) structure. particular, AME technique is...
The recent development of video-based content platforms led the easy access to videos decades ago. However, some past have a old screen ratio. If an image with this ratio is executed on display wider ratio, excessively stretched horizontally or creates black box, which prevents efficient viewing content. In paper, we propose method for retargeting video frames while maintaining original important objects in using deep learning-based semantic segmentation and inpainting techniques. Our...
Recently, deep learning-based super-resolution (SR) models have been used to improve SR performance by equipping preprocessing networks with baseline networks. In particular, in video SR, which creates a high-resolution (HR) image multiple frames, optical flow extraction is accompanied process. These work effectively terms of quality, but at the cost increased network parameters, increase computational complexity and memory consumption for tasks restricted resources. One well-known approach...
To address the diversified needs of Internet, ISO/IEC JTC1/SC29/WG11 Moving Picture Experts Group (MPEG) started project Internet video coding (IVC) in July 2011. It is anticipated that any patent declaration associated with baseline profile this standard will indicate owner prepared to grant a free-ofcharge license an unrestricted number applicants worldwide. IVC has been developed MPEG from scratch by combining well-known existing technology elements and new contributions free-of-charge...
In this paper, an efficient motion estimation method for quadtree plus binary tree (QTBT) structure is presented JVET future video coding (FVC). To exploit the possibility to design a new standard better than HEVC, activity called has been active in MPEG and VCEG. One of most influential technologies proposed FVC QTBT structure, which can give more flexibility on prediction block quadtree-based HEVC. advance compression efficiency QTBT, we propose that, accurate estimation, sets point...
In this paper, we provide comments on the recent paper by Pan <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">et al.</i> that proposed an initial search point-based motion estimation skipping (ISP-MES) method. We found some discrepancies of method and its experimental results, especially in setting ISPs. clarify these issues and, as a result, enhanced results from work
Internet video coding (IVC) is an exploration within MPEG to develop a compression technology that expected of royalty-free and targeted the performance comparable MPEG-4 AVC/H.264 Constrained Baseline Profile (CBP). IVC codec has been steadily enhanced since 2011, so it valuable report comparison results in progress. In this paper, we evaluate comparing with AVC CBP terms bitrate PSNR together. Furthermore, also subjective quality two types viewers: experts non-experts. The show overall...
This paper introduces effective test patterns for system-on-chip and board interconnects. Initially, “ <tex xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">$6n$</tex> ” are introduced to completely detect diagnose both static crosstalk faults, where xmlns:xlink="http://www.w3.org/1999/xlink">$n$</tex> is the total number of interconnect nets. Then, more economic xmlns:xlink="http://www.w3.org/1999/xlink">$4n + 1$</tex> described faults nets separated...
An efficient biprediction decision scheme of high efficiency video coding (HEVC) is proposed for fast-encoding applications. For low-delay applications, bidirectional prediction can be used to increase compression performance efficiently with previous reference frames. However, at the same time, computational complexity HEVC encoder significantly increased due additional search. Although a some research has attempted reduce this complexity, whether strongly related both motion and modes in...
최신 동영상 압축 표준 기술인 HEVC (High Efficiency Video Coding)는 기존의 AVC/H.264와 비교하여 동일 화질 대비 약 2배의 높은 압축률을 보여준다. 하지만 이러한 성능을 얻기 위하여 복잡한 연산이 필요한 기법들을 많이 도입한 결과, HEVC의 시간 복잡도는 AVC/H.264보다 더욱 증가하게 되었다. 문제를 해결하기 다양한 고속 알고리즘 연구가 진행되고 있다. 본 논문에서는 HEVC에 구현된 RMD (Rough Mode Decision)의 결과와 MPM (Most Probable Mode)을 활용하여 고속화된 최적 예측 모드 결정 방법을 제안한다. 제안한 방법은 과정에서 계산한 방향과 도출 방향을 선정한다. 이 All-Intra 환경에서 실험한 평균 0.8%의 BD-rate 손실이 발생하였고 전체 부호화 실행 시간은 26% 감소하였다. High Coding (HEVC), the latest video coding standard, has...
In this paper, we propose a method that skips the complex encoding processes of coding unit (CU) for HEVC intra frame coding. To speed-up process recursively explore all sizes CUs, most researchers have exploited spatial information thus far. On other hand, temporal correlation among frames has not been thoroughly investigated. We as an early termination method, which ineffective and associated cost. Simulation results are provided to verify efficiency proposed showing 32% time saving with...
MPEG has produced standards that have provided the industry with best video compression technologies. In order to address diversified needs of Internet, issued Call for Proposals (CfP) internet coding in July, 2011. It is anticipated any patent declaration associated Baseline Profile this standard will indicate owner prepared grant a free charge license an unrestricted number applicants on worldwide, non-discriminatory basis and under other reasonable terms conditions make, use, sell...
Autism Spectrum Disorder (ASD) can often make life difficult for children, therefore early diagnosis is necessary proper treatment and care. Thus, in this work, we consider the problem of detecting or classifying ASD children to aid medical professionals detection. To end, develop a deep learning model that analyzes video clips reacting sensory stimuli, with intent on capturing key differences reactions behavior between non-ASD patients. Unlike many works classification, their data consist...
텍스트-비디오 검색 문제는 주어진 텍스트 쿼리를 활용하여 관련 비디오를 검색하는 연구 분야이다. 이를 위해 텍스트와 비디오 데이터 데이터의 의미가 잘 표상된 공통-임베딩 공간을 구축하여 검색에 활용하는 기반 방법들이 널리 사용된다. 그러나 두 종류의 입력 데이터는 본질적으로 서로 다른 특성을 가지고 있기 때문에 공간에서의 분포 차이가 발생하고 이는 성능의 저하로 이어질 수 있다. 이러한 문제를 극복하기 본 연구는 비디오의 시각적 특징과 언어적 특징을 결합하는 새로운 특징 표현 방법을 제안한다. 구체적으로, 제안하는 모델은 비디오로부터 캡션을 생성하고 이 결합하여 정보와 정보가 결합된 개선된 벡터를 생성하였다. 강화된 벡터는 추론 과정에서 쿼리로 주어지는 후보 비디오간의 모달리티 간격을 완화시킴으로써, 성능을 향상시킨다. 제안된 방법의 검증하기 수행한 두가지 벤치마크 데이터셋에 대한 실험에서 베이스라인 모델 대비 Recall@sum 지표로 3.7%(MSR-VTT),...
Abstract The authors propose a compression strategy for 3D human pose estimation model based on transformer which yields high accuracy but increases the size. This approach involves pruning‐guided determination of search range to achieve lightweight under limited training time and identify optimal In addition, transformer‐based feature distillation (TFD) method, efficiently exploits in terms both size by leveraging architecture characteristics. Pruning‐guided TFD is first that employs...
<title>Abstract</title> In the video compression industry, tailored to machine vision tasks has recently emerged as a critical area of focus. Given unique characteristics vision, current practice directly employing conventional codecs reveals inefficiency, which requires compressing unnecessary regions. this paper, we propose framework that more aptly encodes regions distinguished by enhance coding efficiency. For that, proposed consists deep learning-based adaptive switch networks guide...