NFDI4DS | UHH-SEMS - Publication Details

CutMix: Regularization Strategy to Train Strong Classifiers With Localizable Features

OPENALEX - Publications

Sangdoo Yun Dongyoon Han Sanghyuk Chun Seong Joon Oh Youngjoon Yoo and 1 more

Regional dropout strategies have been proposed to enhance performance of convolutional neural network classifiers. They proved be effective for guiding the model attend on less discriminative parts objects (e.g. leg as opposed head a person), thereby letting generalize better and object localization capabilities. On other hand, current methods regional removes informative pixels training images by overlaying patch either black or random noise. Such removal is not desirable because it suffers...

10.1109/iccv.2019.00612 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Rethinking Spatial Dimensions of Vision Transformers

OPENALEX - Publications

Byeongho Heo Sangdoo Yun Dongyoon Han Sanghyuk Chun Junsuk Choe and 1 more

Vision Transformer (ViT) extends the application range of transformers from language processing to computer vision tasks as being an alternative architecture against existing convolutional neural networks (CNN). Since transformer-based has been innovative for modeling, design convention towards effective less studied yet. From successful principles CNN, we investigate role spatial dimension conversion and its effectiveness on architecture. We particularly attend reduction principle CNNs;...

10.1109/iccv48922.2021.01172 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

Attention-Based Dropout Layer for Weakly Supervised Object Localization

OPENALEX - Publications

Junsuk Choe Hyunjung Shim

Weakly Supervised Object Localization (WSOL) techniques learn the object location only using image-level labels, without annotations. A common limitation for these is that they cover most discriminative part of object, not entire object. To address this problem, we propose an Attention-based Dropout Layer (ADL), which utilizes self-attention mechanism to process feature maps model. The proposed method composed two key components: 1) hiding from model capturing integral extent and 2)...

10.1109/cvpr.2019.00232 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

CutMix: Regularization Strategy to Train Strong Classifiers with Localizable Features

OPENALEX - Publications

Sangdoo Yun Dongyoon Han Seong Joon Oh Sanghyuk Chun Junsuk Choe and 1 more

Regional dropout strategies have been proposed to enhance the performance of convolutional neural network classifiers. They proved be effective for guiding model attend on less discriminative parts objects (e.g. leg as opposed head a person), thereby letting generalize better and object localization capabilities. On other hand, current methods regional remove informative pixels training images by overlaying patch either black or random noise. Such removal is not desirable because it leads...

10.48550/arxiv.1905.04899 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Evaluating Weakly Supervised Object Localization Methods Right

OPENALEX - Publications

Junsuk Choe Seong Joon Oh Seungho Lee Sanghyuk Chun Zeynep Akata and 1 more

Weakly-supervised object localization (WSOL) has gained popularity over the last years for its promise to train models with only image-level labels. Since seminal WSOL work of class activation mapping (CAM), field focused on how expand attention regions cover objects more broadly and localize them better. However, these strategies rely full supervision validate hyperparameters model selection, which is in principle prohibited under setup. In this paper, we argue that task ill-posed labels,...

10.1109/cvpr42600.2020.00320 preprint EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

Weakly Supervised Semantic Segmentation using Out-of-Distribution Data

OPENALEX - Publications

Jungbeom Lee Seong Joon Oh Sangdoo Yun Junsuk Choe Eunji Kim and 1 more

Weakly supervised semantic segmentation (WSSS) methods are often built on pixel-level localization maps obtained from a classifier. However, training class labels only, classifiers suffer the spurious correlation between fore-ground and background cues (e.g. train rail), fundamentally bounding performance of WSSS. There have been previous endeavors to address this issue with additional supervision. We propose novel source information distinguish foreground background: Out-of-Distribution...

10.1109/cvpr52688.2022.01639 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Attention-Based Dropout Layer for Weakly Supervised Single Object Localization and Semantic Segmentation

OPENALEX - Publications

Junsuk Choe Seungho Lee Hyunjung Shim

Both weakly supervised single object localization and semantic segmentation techniques learn an object's location using only image-level labels. However, these are limited to cover the most discriminative part of not entire object. To address this problem, we propose attention-based dropout layer, which utilizes attention mechanism locate efficiently. achieve this, devise two key components, 1) hiding from model capture object, 2) highlighting informative region improve classification power...

10.1109/tpami.2020.2999099 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2020-06-01

Re-labeling ImageNet: from Single to Multi-Labels, from Global to Localized Labels

OPENALEX - Publications

Sangdoo Yun Seong Joon Oh Byeongho Heo Dongyoon Han Junsuk Choe and 1 more

ImageNet has been the most popular image classification benchmark, but it is also one with a significant level of label noise. Recent studies have shown that many samples contain multiple classes, despite being assumed to be single-label benchmark. They thus proposed turn evaluation into multi-label task, exhaustive annotations per image. However, they not fixed training set, presumably because formidable annotation cost. We argue mismatch between and effectively images equally, if more,...

10.1109/cvpr46437.2021.00237 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

Face Generation for Low-Shot Learning Using Generative Adversarial Networks

OPENALEX - Publications

Junsuk Choe Song Park Kyungmin Kim Joo Hyun Park Dongseob Kim and 1 more

Recently, low-shot learning has been proposed for handling the lack of training data in machine learning. Despite importance this issue, relatively less efforts have made to study problem. In paper, we aim increase size dataset various ways improve accuracy and robustness face recognition. detail, adapt a generator from Generative Adversarial Network (GAN) dataset, which includes base set, widely available novel given limited while adopting transfer as backend. Based on extensive...

10.1109/iccvw.2017.229 article EN 2017-10-01

Rethinking Spatial Dimensions of Vision Transformers

OPENALEX - Publications

Byeongho Heo Sangdoo Yun Dongyoon Han Sanghyuk Chun Junsuk Choe and 1 more

Vision Transformer (ViT) extends the application range of transformers from language processing to computer vision tasks as being an alternative architecture against existing convolutional neural networks (CNN). Since transformer-based has been innovative for modeling, design convention towards effective less studied yet. From successful principles CNN, we investigate role spatial dimension conversion and its effectiveness on architecture. We particularly attend reduction principle CNNs;...

10.48550/arxiv.2103.16302 preprint EN cc-by-sa arXiv (Cornell University) 2021-01-01

An Empirical Evaluation on Robustness and Uncertainty of Regularization Methods

OPENALEX - Publications

Sanghyuk Chun Seong Joon Oh Sangdoo Yun Dongyoon Han Junsuk Choe and 1 more

Despite apparent human-level performances of deep neural networks (DNN), they behave fundamentally differently from humans. They easily change predictions when small corruptions such as blur and noise are applied on the input (lack robustness), often produce confident out-of-distribution samples (improper uncertainty measure). While a number researches have aimed to address those issues, proposed solutions typically expensive complicated (e.g. Bayesian inference adversarial training)....

10.48550/arxiv.2003.03879 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Curriculum learning with class-label composition for weakly supervised semantic segmentation

OPENALEX - Publications

Dongjun Hwang Hyoseo Kim Doyeol Baek Hyunbin Kim Inhye Kye and 1 more

10.1016/j.patrec.2024.12.016 article EN Pattern Recognition Letters 2025-01-05

Fog-free training for foggy scene understanding

OPENALEX - Publications

Minyoung Lee Kyungwoo Song Junsuk Choe

10.1016/j.patrec.2025.01.012 article DA Pattern Recognition Letters 2025-01-01

VHOIP: Video-based human-object interaction recognition with CLIP prior knowledge

OPENALEX - Publications

Doyeol Baek Junsuk Choe

10.1016/j.patrec.2025.02.014 article EN Pattern Recognition Letters 2025-02-01

Normalization Matters in Weakly Supervised Object Localization

OPENALEX - Publications

Jeesoo Kim Junsuk Choe Sangdoo Yun Nojun Kwak

Weakly-supervised object localization (WSOL) enables finding an using a dataset without any information. By simply training classification model only image-level annotations, the feature map of can be utilized as score for localization. In spite many WSOL methods proposing novel strategies, there has not been de facto standard about how to normalize class activation (CAM). Consequently, have failed fully exploit their own capacity because misuse normalization method. this paper, we review...

10.1109/iccv48922.2021.00341 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

Entropy regularization for weakly supervised object localization

OPENALEX - Publications

Dongjun Hwang Jung-Woo Ha Hyunjung Shim Junsuk Choe

10.1016/j.patrec.2023.03.018 article EN Pattern Recognition Letters 2023-03-22

Weakly Supervised Semantic Segmentation for Driving Scenes

OPENALEX - Publications

Dongseob Kim Seungho Lee Junsuk Choe Hyunjung Shim

State-of-the-art techniques in weakly-supervised semantic segmentation (WSSS) using image-level labels exhibit severe performance degradation on driving scene datasets such as Cityscapes. To address this challenge, we develop a new WSSS framework tailored to datasets. Based extensive analysis of dataset characteristics, employ Contrastive Language-Image Pre-training (CLIP) our baseline obtain pseudo-masks. However, CLIP introduces two key challenges: (1) pseudo-masks from lack representing...

10.1609/aaai.v38i3.28053 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2024-03-24

Evaluation for Weakly Supervised Object Localization: Protocol, Metrics, and Datasets

OPENALEX - Publications

Junsuk Choe Seong Joon Oh Sanghyuk Chun Seungho Lee Zeynep Akata and 1 more

Weakly-supervised object localization (WSOL) has gained popularity over the last years for its promise to train models with only image-level labels. Since seminal WSOL work of class activation mapping (CAM), field focused on how expand attention regions cover objects more broadly and localize them better. However, these strategies rely full supervision validating hyperparameters model selection, which is in principle prohibited under setup. In this paper, we argue that task ill-posed labels,...

10.1109/tpami.2022.3169881 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2022-04-25

Attention-based Dropout Layer for Weakly Supervised Object Localization

OPENALEX - Publications

Junsuk Choe Hyunjung Shim

Weakly Supervised Object Localization (WSOL) techniques learn the object location only using image-level labels, without annotations. A common limitation for these is that they cover most discriminative part of object, not entire object. To address this problem, we propose an Attention-based Dropout Layer (ADL), which utilizes self-attention mechanism to process feature maps model. The proposed method composed two key components: 1) hiding from model capturing integral extent and 2)...

10.48550/arxiv.1908.10028 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Region-based dropout with attention prior for weakly supervised object localization

OPENALEX - Publications

Junsuk Choe Dongyoon Han Sangdoo Yun Jung-Woo Ha Seong Joon Oh and 1 more

10.1016/j.patcog.2021.107949 article EN Pattern Recognition 2021-03-19

Keep CALM and Improve Visual Feature Attribution

OPENALEX - Publications

Jae Myung Kim Junsuk Choe Zeynep Akata Seong Joon Oh

The class activation mapping, or CAM, has been the cornerstone of feature attribution methods for multiple vision tasks. Its simplicity and effectiveness have led to wide applications in explanation visual predictions weakly-supervised localization However, CAM its own shortcomings. computation maps relies on ad-hoc calibration steps that are not part training computational graph, making it difficult us understand real meaning values. In this paper, we improve by explicitly incorporating a...

10.1109/iccv48922.2021.00824 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

Contrastive Attention Maps for Self-supervised Co-localization

OPENALEX - Publications

Minsong Ki Youngjung Uh Junsuk Choe Hyeran Byun

The goal of unsupervised co-localization is to locate the object in a scene under assumptions that 1) dataset consists only one superclass, e.g., birds, and 2) there are no human-annotated labels dataset. most recent method achieves impressive performance by employing self-supervised representation learning approaches such as predicting rotation. In this paper, we introduce new contrastive objective directly on attention maps enhance performance. Our loss function exploits rich information...

10.1109/iccv48922.2021.00280 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

Coach design for the Korean high-speed train

OPENALEX - Publications

Erik Jung Sung H. Han Min‐Cherl Jung Junsuk Choe

10.1016/s0003-6870(97)00010-0 article EN Applied Ergonomics 1998-12-01