Sung Ju Hwang

ORCID: 0000-0002-9675-2324
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Domain Adaptation and Few-Shot Learning
  • Topic Modeling
  • Advanced Neural Network Applications
  • Multimodal Machine Learning Applications
  • Machine Learning and Data Classification
  • Adversarial Robustness in Machine Learning
  • Natural Language Processing Techniques
  • Anomaly Detection Techniques and Applications
  • Advanced Image and Video Retrieval Techniques
  • Machine Learning and ELM
  • Generative Adversarial Networks and Image Synthesis
  • Advanced Graph Neural Networks
  • Explainable Artificial Intelligence (XAI)
  • Speech Recognition and Synthesis
  • Privacy-Preserving Technologies in Data
  • Neural Networks and Applications
  • Music and Audio Processing
  • Machine Learning in Healthcare
  • COVID-19 diagnosis using AI
  • Text and Document Classification Technologies
  • Machine Learning and Algorithms
  • Speech and dialogue systems
  • Data Stream Mining Techniques
  • Computational Drug Discovery Methods
  • Image Retrieval and Classification Techniques

Kootenay Association for Science & Technology
2019-2024

Korea Advanced Institute of Science and Technology
2017-2024

International Graduate School of English
2024

Amazon (Germany)
2023

Naver (South Korea)
2023

Samsung (United States)
2020

Samsung (South Korea)
2020

Ulsan National Institute of Science and Technology
2015-2017

Walt Disney (United States)
2014

The University of Texas at Austin
2010-2013

We propose a novel deep network architecture for lifelong learning which we refer to as Dynamically Expandable Network (DEN), that can dynamically decide its capacity it trains on sequence of tasks, learn compact overlapping knowledge sharing structure among tasks. DEN is efficiently trained in an online manner by performing selective retraining, expands upon arrival each task with only the necessary number units, and effectively prevents semantic drift splitting/duplicating units...

10.48550/arxiv.1708.01547 preprint EN other-oa arXiv (Cornell University) 2017-01-01

The goal of few-shot learning is to learn a classifier that generalizes well even when trained with limited number training instances per class. recently introduced meta-learning approaches tackle this problem by generic across large multiclass classification tasks and generalizing the model new task. Yet, such meta-learning, low-data in novel task still remains. In paper, we propose Transductive Propagation Network (TPN), framework for transductive inference classifies entire test set at...

10.48550/arxiv.1805.10002 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Reducing bit-widths of activations and weights deep networks makes it efficient to compute store them in memory, which is crucial their deployments resource-limited devices, such as mobile phones. However, decreasing with quantization generally yields drastically degraded accuracy. To tackle this problem, we propose learn quantize via a trainable quantizer that transforms discretizes them. Specifically, parameterize the intervals obtain optimal values by directly minimizing task loss...

10.1109/cvpr.2019.00448 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

Dense computer vision tasks such as object detection and segmentation require effective multi-scale feature representation for detecting or classifying objects regions with varying sizes. While Convolutional Neural Networks (CNNs) have been the dominant architectures tasks, recently introduced Vision Transformers (ViTs) aim to replace them a backbone. Similar CNNs, ViTs build simple multi-stage structure (i.e., fine-to-coarse) single-scale patches. In this work, different perspective from...

10.1109/cvpr52688.2022.00714 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Visual attributes expose human-defined semantics to object recognition models, but existing work largely restricts their influence mid-level cues during classifier training. Rather than treat as intermediate features, we consider how learning visual properties in concert with categories can regularize the models for both. Given a low-level feature space together attribute-and object-labeled image data, learn shared lower-dimensional representation by optimizing joint loss function that...

10.1109/cvpr.2011.5995543 article EN 2011-06-01

Existing adversarial learning approaches mostly use class labels to generate samples that lead incorrect predictions, which are then used augment the training of model for improved robustness. While some recent works propose semi-supervised methods utilize unlabeled data, they still require labels. However, do we really need at all, adversarially robust deep neural networks? In this paper, a novel attack makes confuse instance-level identities perturbed data samples. Further, present...

10.48550/arxiv.2006.07589 preprint EN other-oa arXiv (Cornell University) 2020-01-01

While semi-supervised learning (SSL) has proven to be a promising way for leveraging unlabeled data when labeled is scarce, the existing SSL algorithms typically assume that training class distributions are balanced. However, these trained under imbalanced can severely suffer generalizing balanced testing criterion, since they utilize biased pseudo-labels of toward majority classes. To alleviate this issue, we formulate convex optimization problem softly refine generated from model, and...

10.48550/arxiv.2007.08844 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Summary Plants, as a sessile organism, produce various secondary metabolites to interact with the environment. These chemicals have fascinated plant science community because of their ecological significance and notable biological activity. However, predicting complete biosynthetic pathways from target molecules metabolic building blocks remains challenge. Here, we propose retrieval‐augmented dual‐view retrosynthesis (READRetro) practical bio‐retrosynthesis tool predict natural products....

10.1111/nph.20012 article EN New Phytologist 2024-07-30

Current uses of tagged images typically exploit only the most explicit information: link between nouns named and objects present somewhere in image. We propose to leverage "unspoken" cues that rest within an ordered list image tags so as improve object localization. define three novel implicit features from image's tags-the relative prominence each signified by its order mention, scale constraints implied unnamed objects, loose spatial links hinted at proximity names on list. By learning a...

10.1109/tpami.2011.190 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2011-10-05

As the application of deep learning has expanded to real-world problems with insufficient volume training data, transfer recently gained much attention as means improving performance in such small-data regime. However, when existing methods are applied between heterogeneous architectures and tasks, it becomes more important manage their detailed configurations often requires exhaustive tuning on them for desired performance. To address issue, we propose a novel approach based meta-learning...

10.48550/arxiv.1905.05901 preprint EN other-oa arXiv (Cornell University) 2019-01-01

In practical settings, a speaker recognition system needs to identify given short utterance, while the enrollment utterance may be relatively long.However, existing models perform poorly with such utterances.To solve this problem, we introduce meta-learning framework for imbalance length pairs.Specifically, use Prototypical Networks and train it support set of long utterances query varying lengths.Further, since optimizing only classes in episode insufficient learning discriminative...

10.21437/interspeech.2020-1283 article EN Interspeech 2022 2020-10-25

While tasks could come with varying the number of instances and classes in realistic settings, existing meta-learning approaches for few-shot classification assume that per task class is fixed. Due to such restriction, they learn equally utilize meta-knowledge across all tasks, even when largely varies. Moreover, do not consider distributional difference unseen on which may have less usefulness depending relatedness. To overcome these limitations, we propose a novel model adaptively balances...

10.48550/arxiv.1905.12917 preprint EN other-oa arXiv (Cornell University) 2019-01-01

One of the most crucial challenges in question answering (QA) is scarcity labeled data, since it costly to obtain question-answer pairs for a target text domain with human annotation. An alternative approach tackle problem use automatically generated QA from either context or large amount unstructured texts (e.g. Wikipedia). In this work, we propose hierarchical conditional variational autoencoder (HCVAE) generating given as contexts, while maximizing mutual information between ensure their...

10.18653/v1/2020.acl-main.20 article EN cc-by 2020-01-01

We introduce a method for image retrieval that leverages the implicit information about object importance conveyed by list of keyword tags person supplies an image.We propose unsupervised learning procedure based on Kernel Canonical Correlation Analysis discovers relationship between how humans tag images (e.g., order in which words are mentioned) and relative objects their layout scene.Using this discovered connection, we show to boost accuracy novel queries, such search results may more...

10.5244/c.24.58 article EN 2010-01-01
Coming Soon ...