NFDI4DS | UHH-SEMS - Publication Details

Seungju Han

ORCID: 0000-0001-7293-1419

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5053165370

Research Areas

Topic Modeling
Natural Language Processing Techniques
Advanced Adaptive Filtering Techniques
Blind Source Separation Techniques
Video Analysis and Summarization
Speech and dialogue systems
Neural Networks and Applications
Face recognition and analysis
Tactile and Sensory Interactions
Music and Audio Processing
Advanced Vision and Imaging
Human Motion and Animation
Multimodal Machine Learning Applications
Speech and Audio Processing
Spectroscopy Techniques in Biomedical and Chemical Research
Biosensors and Analytical Detection
Interactive and Immersive Displays
Video Surveillance and Tracking Methods
Image and Signal Denoising Methods
Control Systems and Identification
Multimedia Communication and Technology
Domain Adaptation and Few-Shot Learning
Video Coding and Compression Technologies
Text and Document Classification Technologies
Human Pose and Action Recognition

Kongju National University
2024

Kyung Hee University
2021-2024

Chungbuk National University
2024

Samsung (South Korea)
2010-2023

Seoul National University
2023

Allen Institute for Artificial Intelligence
2023

National University
2023

University of Washington
2023

Inje University Busan Paik Hospital
2022

Samsung (United States)
2021

Disentangling Label Distribution for Long-tailed Visual Recognition

OPENALEX - Publications

Youngkyu Hong Seungju Han Kwanghee Choi Seokjun Seo Beomsu Kim and 1 more

The current evaluation protocol of long-tailed visual recognition trains the classification model on source label distribution and evaluates its performance uniform target distribution. Such has questionable practicality since may also be long-tailed. Therefore, we formulate as a shift problem where tar-get distributions are different. One significant hurdles in dealing with is entanglement between prediction. In this paper, focus disentangling from We first introduce simple but over-looked...

10.1109/cvpr46437.2021.00656 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

Towards Accurate Facial Landmark Detection via Cascaded Transformers

OPENALEX - Publications

Hui Li Zidong Guo Seon-Min Rhee Seungju Han Jae‐Joon Han

Accurate facial landmarks are essential prerequisites for many tasks related to human faces. In this paper, an accurate landmark detector is proposed based on cascaded transformers. We formulate detection as a coordinate regression task such that the model can be trained end-to-end. With self-attention in transformers, our inherently exploit structured relationships between landmarks, which would benefit under challenging conditions large pose and occlusion. During refinement, able extract...

10.1109/cvpr52688.2022.00414 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Drug classification with a spectral barcode obtained with a smartphone Raman spectrometer

OPENALEX - Publications

Un Jeong Kim Su‐Yeon Lee Hyochul Kim Yeongeun Roh Seungju Han and 6 more

Measuring, recording and analyzing spectral information of materials as its unique finger print using a ubiquitous smartphone has been desired by scientists consumers. We demonstrated it drug classification chemical components with Raman spectrometer. The spectrometer is based on the CMOS image sensor periodic array band pass filters, capturing 2D intensity map, newly defined barcode in this work. Here we show 11 major drugs are classified high accuracy, 99.0%, aid convolutional neural...

10.1038/s41467-023-40925-3 article EN cc-by Nature Communications 2023-08-29

Attentron: Few-Shot Text-to-Speech Utilizing Attention-Based Variable-Length Embedding

OPENALEX - Publications

Seungwoo Choi Seungju Han Dongyoung Kim Sungjoo Ha

On account of growing demands for personalization, the need a so-called few-shot TTS system that clones speakers with only few data is emerging.To address this issue, we propose Attentron, model voices unseen during training.It introduces two special encoders, each serving different purposes.A fine-grained encoder extracts variable-length style information via an attention mechanism, and coarse-grained greatly stabilizes speech synthesis, circumventing unintelligible gibberish even...

10.21437/interspeech.2020-2096 article EN Interspeech 2022 2020-10-25

Rethinking Feature-based Knowledge Distillation for Face Recognition

OPENALEX - Publications

Jingzhi Li Zidong Guo Hui Li Seungju Han Ji-Won Baek and 3 more

With the continual expansion of face datasets, feature-based distillation prevails for large-scale recognition. In this work, we attempt to remove identity supervision in student training, spare GPU memory from saving massive class centers. However, naive removal leads inferior result. We carefully inspect performance degradation perspective intrinsic dimension, and argue that gap namely gap, is intimately connected infamous capacity problem. By constraining teacher's search space with...

10.1109/cvpr52729.2023.01930 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

The correntropy MACE filter

OPENALEX - Publications

Kyu-Hwa Jeong Weifeng Liu Seungju Han Erion Hasanbelliu José C. Prı́ncipe

10.1016/j.patcog.2008.09.023 article EN Pattern Recognition 2008-10-15

BiasAdv: Bias-Adversarial Augmentation for Model Debiasing

OPENALEX - Publications

Jongin Lim Young‐Dong Kim Byungjai Kim Chanho Ahn Jinwoo Shin and 2 more

Neural networks are often prone to bias toward spurious correlations inherent in a dataset, thus failing generalize unbiased test criteria. A key challenge resolving the issue is significant lack of bias-conflicting training data (i. e., samples without correlations). In this paper, we propose novel augmentation approach termed Bias-Adversarial (BiasAdv) that supplements with adversarial images. Our idea an attack on biased model makes decisions based may generate syn-thetic samples, which...

10.1109/cvpr52729.2023.00373 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Quality-Agnostic Image Recognition via Invertible Decoder

OPENALEX - Publications

Insoo Kim Seungju Han Ji-Won Baek Seong-Jin Park Jae‐Joon Han and 1 more

Despite the remarkable performance of deep models on image recognition tasks, they are known to be susceptible common corruptions such as blur, noise, and low-resolution. Data augmentation is a conventional way build robust model by considering these during training. However, naive data scheme may result in non-specialized for particular corruptions, tends learn averaged distribution among corruptions. To mitigate issue, we propose new paradigm training networks that produce clean-like...

10.1109/cvpr46437.2021.01208 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

Pushing the Performance Limit of Scene Text Recognizer without Human Annotation

OPENALEX - Publications

Caiyuan Zheng Hui Li Seon-Min Rhee Seungju Han Jae‐Joon Han and 1 more

Scene text recognition (STR) attracts much attention over the years because of its wide application. Most methods train STR model in a fully supervised manner which requires large amounts labeled data. Although synthetic data contributes lot to STR, it suffers from real-to-synthetic domain gap restricts performance. In this work, we aim boost models by leveraging both and numerous real unlabeled images, exempting human annotation cost thoroughly. A robust con-sistency regularization based...

10.1109/cvpr52688.2022.01372 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Evaluation of human tangential force input performance

OPENALEX - Publications

Bhoram Lee Hyunjeong Lee Soo‐Chul Lim Hyungkew Lee Seungju Han and 1 more

While interacting with mobile devices, users may press against touch screens and also exert tangential force to the display in a sliding manner. We seek guide UI design based on applied by user surface of hand-held device. A prototype an interface using input was implemented utilizing sensitive layer elastic used for experiment. investigated controllability reach maintain target levels considered effects hand pose direction input. Our results imply no significant difference performance when...

10.1145/2207676.2208727 article EN 2012-05-05

Implantable pH Sensing System Using Vertically Stacked Silicon Nanowire Arrays and Body Channel Communication for Gastroesophageal Reflux Monitoring

OPENALEX - Publications

Changhee Kim Seungju Han Tae-Hwan Kim Sangmin Lee

Silicon nanowires (SiNWs) are emerging as versatile components in the fabrication of sensors for implantable medical devices because their exceptional electrical, optical, and mechanical properties. This paper presents a novel top-down method vertically stacked SiNWs, eliminating need wet oxidation, etching, nanolithography. The integration these SiNWs into body channel communication (BCC) circuits was also explored. fabricated were confirmed to be capable forming arrays with multiple layers...

10.3390/s24030861 article EN cc-by Sensors 2024-01-29

Meet Your Favorite Character: Open-domain Chatbot Mimicking Fictional Characters with only a Few Utterances

OPENALEX - Publications

Seungju Han Beomsu Kim Jin Yong Yoo Seokjun Seo Sang‐Bum Kim and 2 more

Seungju Han, Beomsu Kim, Jin Yong Yoo, Seokjun Seo, Sangbum Enkhbayar Erdenee, Buru Chang. Proceedings of the 2022 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies. 2022.

10.18653/v1/2022.naacl-main.377 article EN cc-by Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies 2022-01-01

A minimum-error entropy criterion with self-adjusting step-size (MEE-SAS)

OPENALEX - Publications

Seungju Han Sudhir Rao Deniz Erdoğmuş Kyu-Hwa Jeong José C. Prı́ncipe

10.1016/j.sigpro.2007.05.003 article EN Signal Processing 2007-05-19

AR Smart Evacuation Map Using QR Code

OPENALEX - Publications

Ji-Hoon Choi Yunji Kim Seungju Han Hyun Lee

10.9728/dcs.2024.25.3.789 article EN cc-by-nc Journal of Digital Contents Society 2024-03-31

The Machine Learning Ensemble for Analyzing Internet of Things Networks: Botnet Detection and Device Identification

OPENALEX - Publications

Seungju Han Seong-Su Yoon Ieck-Chae Euom

10.32604/cmes.2024.053457 article EN Computer Modeling in Engineering & Sciences 2024-01-01

Implantable nanostructured MEA with biphasic current stimulator for retinal prostheses

OPENALEX - Publications

Seungju Han Chang‐Hee Kim Kang‐Il Kim Sangmin Lee

In retinal prosthetic systems on multi-channel microelectrodes to effectively stimulate neurons, the electrode-electrolyte interface impedance of a microelectrode should be minimized drive sufficiently large current at given supply voltage.This paper presents fabrication nanostructured array with simplified and its characteristic evaluation using biphasic stimulator.The base diameter 25 μm, 50 75 μm are fabricated, maximum allowable injection limits measured verify estimated limit. Also,...

10.3233/thc-235001 article EN Technology and Health Care 2023-03-03

Champagne: Learning Real-world Conversation from Large-Scale Web Videos

OPENALEX - Publications

Seungju Han Jack Hessel Nouha Dziri Yejin Choi Youngjae Yu

Visual information is central to conversation: body gestures and physical behaviour, for example, contribute meaning that transcends words alone. To date, however, most neural conversational models are limited just text. We introduce Champagne, a generative model of conversations can account visual contexts. train we collect release YTD-18M, large-scale corpus 18M video-based dialogues. YTD-18M constructed from web videos: crucial our data collection pipeline pretrained language converts...

10.1109/iccv51070.2023.01421 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2023-10-01

SMILE: Multimodal Dataset for Understanding Laughter in Video with Language Models

OPENALEX - Publications

Lee Jung Hyun Kim Sung-Bin Seungju Han Youngjae Yu Tae-Hyun Oh

10.18653/v1/2024.findings-naacl.73 article EN Findings of the Association for Computational Linguistics: NAACL 2022 2024-01-01

Virtual world control system using sensed information and adaptation engine

OPENALEX - Publications

Sang‐Kyun Kim Yong Soo Joo Minho Shin Seungju Han Jae‐Joon Han

10.1016/j.image.2012.10.006 article EN Signal Processing Image Communication 2012-11-06

Generative Correlation Discovery Network for Multi-label Learning

OPENALEX - Publications

Lichen Wang Zhengming Ding Seungju Han Jae‐Joon Han Changkyu Choi and 1 more

The goal of Multi-label learning is to predict multiple labels each single instance. This a challenging problem since the training data limited, long-tail label distribution, and complicated correlations. Generally, more samples correlation knowledge would benefit performance. However, it difficult obtain large-scale well-labeled datasets, building such map requires sophisticated semantic knowledge. To this end, we propose an end-to-end Generative Correlation Discovery Network (GCDN) method...

10.1109/icdm.2019.00069 article EN 2021 IEEE International Conference on Data Mining (ICDM) 2019-11-01

Sample-wise Label Confidence Incorporation for Learning with Noisy Labels

OPENALEX - Publications

Chanho Ahn Kikyung Kim Ji-Won Baek Jongin Lim Seungju Han

Deep learning algorithms require large amounts of labeled data for effective performance, but the presence noisy labels often significantly degrade their performance. Although recent studies on designing a robust objective function to label noise, known as loss method, have shown promising results with labels, they suffer from issue underfitting not only samples also clean ones, leading suboptimal model To address this issue, we propose novel framework that selectively suppresses while...

10.1109/iccv51070.2023.00175 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2023-10-01

Coming Soon ...