NFDI4DS | UHH-SEMS - Publication Details

Hoseok Do

ORCID: 0000-0003-4005-1999

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5103154878

Research Areas

Computer Graphics and Visualization Techniques
Generative Adversarial Networks and Image Synthesis
Video Surveillance and Tracking Methods
Smart Parking Systems Research
Advanced Steganography and Watermarking Techniques
Face recognition and analysis
Robotics and Sensor-Based Localization
Digital Media Forensic Detection
Vehicle License Plate Recognition
Image Enhancement Techniques
Image Processing Techniques and Applications
Advanced Vision and Imaging
Chaos-based Image/Signal Encryption
Advanced Neural Network Applications
Image Processing and 3D Reconstruction
3D Shape Modeling and Analysis
Speech and Audio Processing
Advanced Image Processing Techniques
Human Pose and Action Recognition
Video Coding and Compression Technologies
Data Management and Algorithms

LG (United States)
2024

LG (South Korea)
2019-2024

Seoul National University
2008-2023

Blending-NeRF: Text-Driven Localized Editing in Neural Radiance Fields

OPENALEX - Publications

Hyeonseop Song Seokhun Choi Hoseok Do Chul Lee Taehyeong Kim

Text-driven localized editing of 3D objects is particularly difficult as locally mixing the original object with intended new and style effects without distorting object's form not a straightforward process. To address this issue, we propose novel NeRF-based model, Blending-NeRF, which consists two NeRF networks: pre-trained editable NeRF. Additionally, introduce blending operations that allow Blending-NeRF to properly edit target regions are by text. By using pretrained vision-language...

10.1109/iccv51070.2023.01323 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2023-10-01

A blind MPEG-2 video watermarking robust to camcorder recording

OPENALEX - Publications

Dooseop Choi Hoseok Do Hyuk Choi Taejeong Kim

10.1016/j.sigpro.2009.10.009 article EN Signal Processing 2009-10-22

Context-Based Parking Slot Detection With a Realistic Dataset

OPENALEX - Publications

Hoseok Do Jin Young Choi

The autonomous parking of vehicles requires the ability to accurately locate an available slot in vicinity a vehicle. Since slots have variety shapes and colors, may be occluded by obstacles, or look different due surroundings such as lighting, locating them can challenging task. In this paper, we propose context-based detection method inspired process human driver finding slot. Our consists two deep network modules: context recognizer detector. identifies environment (type, angle,...

10.1109/access.2020.3024668 article EN cc-by IEEE Access 2020-01-01

Diffusion-Driven GAN Inversion for Multi-Modal Face Image Generation

OPENALEX - Publications

Jihyun Kim Changjae Oh Hoseok Do Soo-Hyun Kim Kwanghoon Sohn

10.1109/cvpr52733.2024.00990 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

Digital Video Watermarking Based on Histogram and Temporal Modulation and Robust to Camcorder Recording

OPENALEX - Publications

Hoseok Do Dooseop Choi Taejeong Kim Hyuk Jin Choi

This paper presents a blind digital video watermarking scheme, which is especially robust to camcorder recording attacks and also variety of common processing geometric distortions. Using the fact that nearby frames sequence are quite similar, method embeds watermark by temporal modulation frames. The pattern used in generated based on pixel-value histogram, makes extraction free from synchronization. To make it imperceptible, adjusted according roughly Human Visual System. experimental...

10.1109/isspit.2008.4775680 article EN 2008-12-01

Quantitative Manipulation of Custom Attributes on 3D-Aware Image Synthesis

OPENALEX - Publications

Hoseok Do Eunkyung Yoo Taehyeong Kim Lee Eui Chul Jin young Choi

While 3D-based GAN techniques have been successfully applied to render photo-realistic 3D images with a variety of attributes while preserving view consistency, there has little research on how fine-control without limiting specific category objects their properties. To fill such gap, we propose novel image manipulation model representations for fine-grained control custom attributes. By extending the latest models (e.g., EG3D), our user-friendly quantitative enables fine yet normalized...

10.1109/cvpr52729.2023.00824 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Implementation of CNN-based parking slot type classification using around view images

OPENALEX - Publications

Hoseok Do Ji-Hyun Kim Kwon‐Ho Lee Deukhyeon Kim Kyu-yeol Chae and 1 more

This paper presents a commercial implementation of CNN-based classification parking slot type using around view images. The existing automatic systems use ultrasonic sensors, but they often fail to classify the types slots. Around images can depict slots distinguishably. However, due diverse lighting and ground conditions, it is difficult Moreover, hard find lines since are occluded by vehicle or erased. To overcome these problems, we have constructed an extensive dataset composed labeled...

10.1109/icce46568.2020.9212312 article EN 2023 IEEE International Conference on Consumer Electronics (ICCE) 2020-01-01

Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation

OPENALEX - Publications

Jihyun Kim Changjae Oh Hoseok Do Soo-Hyun Kim Kwanghoon Sohn

We present a new multi-modal face image generation method that converts text prompt and visual input, such as semantic mask or scribble map, into photo-realistic image. To do this, we combine the strengths of Generative Adversarial networks (GANs) diffusion models (DMs) by employing features in DM latent space pre-trained GANs. simple mapping style modulation network to link two convert meaningful representations feature maps attention codes. With GAN inversion, estimated codes can be used...

10.48550/arxiv.2405.04356 preprint EN arXiv (Cornell University) 2024-05-07

Click-Gaussian: Interactive Segmentation to Any 3D Gaussians

OPENALEX - Publications

Seokhun Choi Hyeonseop Song Jaechul Kim Taehyeong Kim Hoseok Do

Interactive segmentation of 3D Gaussians opens a great opportunity for real-time manipulation scenes thanks to the rendering capability Gaussian Splatting. However, current methods suffer from time-consuming post-processing deal with noisy output. Also, they struggle provide detailed segmentation, which is important fine-grained scenes. In this study, we propose Click-Gaussian, learns distinguishable feature fields two-level granularity, facilitating without post-processing. We delve into...

10.48550/arxiv.2407.11793 preprint EN arXiv (Cornell University) 2024-07-16

Effective Photometric Alignment for Surround View Monitoring System

OPENALEX - Publications

Kwon‐Ho Lee Deukhyeon Kim Hoseok Do Jihyun Kim Kyu-yeol Chae

Surround view monitoring (SVM) system provides a composite bird-eye of the vehicle to assist in safe parking. Since each camera independently performs auto exposure (AE) and white balance (AWB), has noticeable boundaries between adjacent views. To achieve seamlessly stitched view, we propose an effective photometric alignment for surround using simple additive gain model. Experimental results show that proposed method view. And processing time achieves 3ms on NVIDIA Tegra CX embedded platform.

10.1109/icce-berlin47944.2019.8966185 article EN 2019-09-08

Blending-NeRF: Text-Driven Localized Editing in Neural Radiance Fields

OPENALEX - Publications

Hyeonseop Song Seokhun Choi Hoseok Do Chul Lee Taehyeong Kim

Text-driven localized editing of 3D objects is particularly difficult as locally mixing the original object with intended new and style effects without distorting object's form not a straightforward process. To address this issue, we propose novel NeRF-based model, Blending-NeRF, which consists two NeRF networks: pretrained editable NeRF. Additionally, introduce blending operations that allow Blending-NeRF to properly edit target regions are by text. By using vision-language aligned CLIP,...

10.48550/arxiv.2308.11974 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Coming Soon ...