- Advanced Vision and Imaging
- Video Coding and Compression Technologies
- Robotics and Sensor-Based Localization
- Advanced Data Compression Techniques
- Video Surveillance and Tracking Methods
- Indoor and Outdoor Localization Technologies
- Advanced Image Processing Techniques
- Video Analysis and Summarization
- Remote Sensing and LiDAR Applications
- Image and Video Quality Assessment
- Advanced Image and Video Retrieval Techniques
- Image Enhancement Techniques
- Digital Filter Design and Implementation
- Multimedia Communication and Technology
- Color Science and Applications
- Computer Graphics and Visualization Techniques
- Robotic Path Planning Algorithms
- Advanced Steganography and Watermarking Techniques
- Target Tracking and Data Fusion in Sensor Networks
- Image Processing Techniques and Applications
- 3D Shape Modeling and Analysis
- 3D Surveying and Cultural Heritage
- Wireless Communication Security Techniques
- Visual Attention and Saliency Detection
- Integrated Energy Systems Optimization
Dankook University
2024
Electronics and Telecommunications Research Institute
2006-2024
York University
2018-2024
National University of Singapore
2024
Wuhan University
2020
Sejong University
2017
Korea Advanced Institute of Science and Technology
2007-2015
Carnegie Mellon University
2011
Korea Institute of Robot and Convergence
2011
Georgia Institute of Technology
2002
The DeepGlobe Building Extraction Challenge poses the problem of localizing all building polygons in given satellite images. We can create using an existing instance segmentation algorithm based on Mask R-CNN. However, produced from have irregular shapes, which are far different real footprint boundaries and therefore cannot be directly applied to many cartographic engineering applications. Hence, we present a method combining R-CNN with boundary regularization. Through experiments, find...
Inverted planar perovskite solar cells with PCBM ETL have poor film formation and charge transfer. Adding MgO improves photoluminescence, carrier lifetime, efficiency to 15.12%, enhances X-ray detector performance.
Versatile Video Coding (H.266/VVC) is the newest video coding standard jointly developed by Joint Experts Team (JVET), which organized ITU-T Group (VCEG) and ISO/IEC Moving Picture (MPEG). H.266/VVC provides about 40% bitrate reduction compared with High Efficiency (H.265/HEVC) for same visual quality. This paper introduces in detail core structure of highlighting its features within block partitioning structure, intra/inter prediction, transform, quantization, in-loop filtering, to...
A LiDAR-assisted panoramic visual simultaneous localization and mapping (SLAM) system for a mobile (MMS) is presented in this paper. The feasibility research on the SLAM MMS with camera tilted LiDAR without GPS/IMU sparked our interest. Because of significant disparity spatial sensing coverage, we show that employing as primary sensor more suitable than using particular combination. Existing systems, other hand, produce up-to-scale results, making them inappropriate many applications require...
Abstract. Ultrawide-band (UWB) ranging technology and multilateration techniques have recently been emerging solutions for positioning unmanned aerial vehicles (UAVs) in GNSS-denied environments. This solution offers cm-level accuracy considerable robustness to multipath receptions. UWB modules are commonly used an anchor-based configuration; i.e., one tag is mounted on the UAV, several anchors installed ground. In real-world operational conditions, can form a planar or near-planar surface....
Abstract To represent immersive media providing six degree‐of‐freedom experience, moving picture experts group (MPEG) video (MIV) was developed to compress multiview videos. Meanwhile, the state‐of‐the‐art versatile coding (VVC) also supports multilayer (ML) functionality, enabling of In this study, we designed experimental conditions assess performance these two standards in terms objective and subjective quality. We observe that their performances are highly dependent on input source, such...
The York University Teledyne Optech (YUTO) Mobile Mapping System (MMS) Dataset, encompassing four sequences totaling 20.1 km, was thoroughly assembled through two data collection expeditions on August 12, 2020, and June 21, 2019. Acquisitions were performed using a uniquely equipped vehicle, fortified with panoramic camera, tilted LiDAR, Global Positioning (GPS), an Inertial Measurement Unit (IMU), journeying strategic locations: the Keele Campus in Toronto headquarters City of Vaughan,...
The detection of free space and obstacles in a scene is essential for safe driving. Among sensors environment perception, stereo-vision promising as it provides 3D perception information. Moreover current decreasing price camera module makes vision sensor attractive, taking into account the consumer product. In this paper we propose an algorithm scene, using stereo-vision. Contrary to previous generic obstacle methods that have strong assumption placement with respect ground or estimate...
In this paper, we present a novel smoothing approach for ultra-wideband (UWB) aided unmanned aerial vehicle (UAV) positioning. Existing works based on or filtering estimate 3D position of UAV by updating solution each single 1D low-dimensional UWB range measurement. However, measurement merely acts as weak constraint in space estimation, and thus it can often lead to incorrect estimation unfavorable conditions. Inspired the idea that multilateration outcome be utilized providing strong...
In this paper, we introduced a novel voxel-wise UV parameterization and view-dependent texture synthesis for the immersive rendering of truncated signed distance field (TSDF) scene model. The proposed delegates precomputed map to each voxel using lookup table consequently, enabling efficient high-quality mapping without complex process. By leveraging convenient parameterization, our method extracts set local maps from multiview color images separates them into single view-independent diffuse...
Textured meshes are widely used in computer graphics to represent 3D scenes, with UV mapping playing a crucial role establishing bijective between the mesh surface and 2D texture. This not only allows for enhancement of rendering quality but also enables compression textures using standard image or video codecs. However, when reconstructing from real-world multiview images, resulting texture maps often suffer fragmentation due geometric inaccuracies excessive tessellation reconstructed...
An efficient distributed video coding algorithm using symmetric motion estimation and channel division is proposed in this work. We employ the to generate high quality side information for Wyner-Ziv frames. Also, division, we classify blocks into reliable ones unreliable ones. Then, transmit parity bits only, achieving a gain. Simulation results demonstrate that provides up 4 dB better PSNR performance than conventional algorithms.
The emerging versatile video coding (VVC) standard currently adopts 67 intra-prediction modes in order to improve the performance. most probable mode (MPM) is used encode prediction efficiently based on of neighbouring blocks. Due an increase number intra-modes and resolution input sequence, it necessary intra-mode current block. This Letter proposes efficient method extending MPM called frequent (MFM), which exploits occurrences proposed MFM derives intra-mode. Then derived signaled by a...
A novel concept of channel division to improve the performance distributed video coding is proposed in this work. At decoder, algorithm partitions each side information frame into multiple regions with different expected distortions. It shown that partitioning equivalent virtual noisy from encoder decoder channels. Then, analyzes noise characteristics those channels, and allocates a limited bit budget channels adaptively rate-distortion performance. Simulation results demonstrate provides up...
High Efficiency Video Coding is the latest video compression standard, which achieves best coding performance up until now. Specifically, intra prediction a tool that removes spatial redundancy in single frame and then predictor generated from its neighboring reference samples based on specific interpolation scheme. In this paper, we propose an edge-preserving sample filtering method using bilateral filter, implemented as hardware-friendly. Two parameters of filter are modeled by block size...
Precise positioning of the Unmanned Aerial Vehicle (UAV) is critical to conduct many sophisticated civil and military applications in challenging environments. Many the-state-of-the-art methods rely on active range sensors. Among available ranging sensors, Ultra-wideband (UWB) can provide benefits such as high precision, power efficiency, not prone multipath propagation noise. Thus, UWB has recently been attracting interests from research community a complementary sensor. However, there...
This paper describes a novel sensor system for 3D world modeling of an autonomous vehicle in large-scale outdoor environments. When performs path planning and following, well-constructed model target environment is very important analyze the track determined path. To generate well-construct model, we develop system. The proposed consists two 2D laser scanners, single cameras, DGPS (Differential Global Positioning System) IMU (Inertial Measurement System). We verify effectiveness through...
In this paper, the technical analysis and characteristics of screen content coding (SCC) based on High efficiency video (HEVC) are presented. For SCC, which is increasingly used these days, HEVC SCC standardization has been proceeded. Technologies such as intra block copy (IBC), palette coding, adaptive color transform developed adopted to standard. This paper examines IBC that significantly impacts RD performance for content. The reference model (SCM) 4.0 was comparatively analyze range...
This paper proposes a Subsampled Sum-Modified-Laplacian (SSML) operator for the block classification of Adaptive Loop Filter (ALF) in Versatile Video Coding (VVC). The VVC Test Model (VTM)-2.0 includes Geometry transformation-based ALF (GALF) with 4 × classification, single 7 Luma diamond-shaped filter, and spatial adaptation at Tree Block (CTB) level to improve coding efficiency VVC. However, 1-D (1-Dimensional) ModifiedLaplacian (ML) values various directions are calculated all sample...