Sunghwan Hong

ORCID: 0000-0003-0685-3779
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Neural Network Applications
  • Advanced Image and Video Retrieval Techniques
  • Rare-earth and actinide compounds
  • Multimodal Machine Learning Applications
  • Topic Modeling
  • Human Pose and Action Recognition
  • Natural Language Processing Techniques
  • Iron-based superconductors research
  • Generative Adversarial Networks and Image Synthesis
  • Video Surveillance and Tracking Methods
  • Magnetic and transport properties of perovskites and related materials
  • Advanced Vision and Imaging
  • High Entropy Alloys Studies
  • Image Enhancement Techniques
  • Domain Adaptation and Few-Shot Learning
  • Image Retrieval and Classification Techniques
  • Advanced materials and composites
  • Supercapacitor Materials and Fabrication
  • Polymer Surface Interaction Studies
  • Nanocomposite Films for Food Packaging
  • Inorganic Chemistry and Materials
  • Visual Attention and Saliency Detection
  • Advancements in Battery Materials
  • High-Temperature Coating Behaviors
  • Video Analysis and Summarization

Samsung (South Korea)
2023-2025

Korea University
2021-2022

Sejong University
2022

Sungkyunkwan University
2002-2021

Suwon Research Institute
2019-2020

Government of the Republic of Korea
2019-2020

Tokushima University
1991

10.1109/cvpr52733.2024.00394 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

Achieving low thermal conductivity and high mechanical strength presents a material design challenge due to intrinsic trade-offs, such as the aerogel's porosity, impeding applications in construction, industry, aerospace. This study composite that incorporates silica aerogel within thermally expanded 2D layered vermiculite matrix. overcomes limitations imposed by van der Waals bonding lengths, typically less than 10 Å, which hinder integration with crystals. Our method employs spark reaction...

10.1021/acs.nanolett.4c04767 article EN Nano Letters 2025-01-09

Polymeric three-dimensional inverse-opal (IO) structures provide unique structural properties useful for various applications ranging from optics to separation technologies. Despite vast needs IO functionalization impart additional chemical properties, this task has been seriously challenged by the intrinsic limitation of polymeric porous materials that do not allow easy penetration waterborne moieties or precursors. To overcome restriction, we present a robust and straightforward method...

10.1021/acsami.5b11021 article EN ACS Applied Materials & Interfaces 2016-01-18

We propose a novel cost aggregation network, called Cost Aggregation Transformers (CATs), to find dense correspondences between semantically similar images with additional challenges posed by large intra-class appearance and geometric variations. is highly important process in matching tasks, which the accuracy depends on quality of its output. Compared hand-crafted or CNN-based methods addressing aggregation, that either lacks robustness severe deformations inherit limitation CNNs fail...

10.48550/arxiv.2106.02520 preprint EN cc-by arXiv (Cornell University) 2021-01-01

Zwitterionic sulfobetaine moieties were anchored on a separator surface to concurrently facilitate selective Li-ion transport and polysulfide conversion, finally resulting in significantly enhanced performance for durable Li–S battery operation.

10.1039/d1ta08422d article EN Journal of Materials Chemistry A 2021-01-01

With growing concerns over electronic device malfunction and the resulting information loss caused by electromagnetic interference (EMI), extensive studies have been performed in developing EMI shielding techniques.

10.1039/d0tc02483j article EN Journal of Materials Chemistry C 2020-01-01

The proteolytic activities of the 20 S proteasome were found to change in their levels during development chick embryonic muscle.The peptide-cleaving against N-succinyl-Leu-Leu-Val-Tyr-7amido-4-methylcoumarin and N-benzyloxycarbonyl-Ala-Arg-Arg-4-methoxy-&naphthylamide gradually decreased with time development.On other hand, casein-degrading activity presence poly-L-lysine markedly increased from day 11 reached a maximal level by 17.These changes appeared be tissue-specific because little or...

10.1016/s0021-9258(18)98472-3 article EN cc-by Journal of Biological Chemistry 1991-08-01

Binder-free layer-by-layer assembled multilayers consisting of reduced graphene oxide and alumina nanoparticles are prepared for implementing heat dissipation films with outstandingly high in-plane cross-plane thermal conductivities.

10.1039/c9tc02143d article EN Journal of Materials Chemistry C 2019-01-01

Cost aggregation is a process in image matching tasks that aims to disambiguate the noisy scores. Existing methods generally tackle this by hand-crafted or CNN-based methods, which either lack robustness severe deformations inherit limitation of CNNs fail discriminate incorrect matches due limited receptive fields and inadaptability. In paper, we introduce Aggregation with Transformers (CATs) exploring global consensus among initial correlation map help some architectural designs allow us...

10.1109/tpami.2022.3218727 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2022-11-04

In the paradigm of AI-generated content (AIGC), there has been increasing attention to transferring knowledge from pre-trained text-to-image (T2I) models text-to-video (T2V) generation. Despite their effectiveness, these frameworks face challenges in maintaining consistent narratives and handling shifts scene composition or object placement a single abstract user prompt. Exploring ability large language (LLMs) generate time-dependent, frame-by-frame prompts, this paper introduces new...

10.48550/arxiv.2305.14330 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Conventional techniques to establish dense correspondences across visually or semantically similar images focused on designing a task-specific matching prior, which is difficult model in general. To overcome this, recent learning-based methods have attempted learn good prior within itself large training data. The performance improvement was apparent, but the need for sufficient data and intensive learning hinders their applicability. Moreover, using fixed at test time does not account fact...

10.1109/iccv48922.2021.00976 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

We introduce a novel cost aggregation network, dubbed Volumetric Aggregation with Transformers (VAT), to tackle the few-shot segmentation task by using both convolutions and transformers efficiently handle high dimensional correlation maps between query support. In specific, we propose our encoder consisting of volume embedding module not only transform into more tractable size but also inject some convolutional inductive bias volumetric transformer for aggregation. Our has pyramidal...

10.48550/arxiv.2112.11685 preprint EN cc-by arXiv (Cornell University) 2021-01-01

We present a novel architecture for dense correspondence. The current state-of-the-art are Transformer-based approaches that focus on either feature descriptors or cost volume aggregation. However, they generally aggregate one the other but not both, though joint aggregation would boost each by providing information has lacks, i.e., structural semantic of an image, pixel-wise matching similarity. In this work, we propose network interleaves both forms aggregations in way exploits their...

10.48550/arxiv.2209.08742 preprint EN cc-by arXiv (Cornell University) 2022-01-01

Open-vocabulary semantic segmentation presents the challenge of labeling each pixel within an image based on a wide range text descriptions. In this work, we introduce novel cost-based approach to adapt vision-language foundation models, notably CLIP, for intricate task segmentation. Through aggregating cosine similarity score, i.e., cost volume between and embeddings, our method potently adapts CLIP segmenting seen unseen classes by fine-tuning its encoders, addressing challenges faced...

10.48550/arxiv.2303.11797 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Successful sequential recommendation systems rely on accurately capturing the user's short-term and long-term interest. Although Transformer-based models achieved state-of-the-art performance in task, they generally require quadratic memory time complexity to sequence length, making it difficult extract interest of users. On other hand, Multi-Layer Perceptrons (MLP)-based models, renowned for their linear complexity, have recently shown competitive results compared Transformer various tasks....

10.48550/arxiv.2108.07505 preprint EN cc-by arXiv (Cornell University) 2021-01-01

This paper introduces a Transformer-based integrative feature and cost aggregation network designed for dense matching tasks. In the context of matching, many works benefit from one two forms aggregation: aggregation, which pertains to alignment similar features, or procedure aimed at instilling coherence in flow estimates across neighboring pixels. this work, we first show that exhibit distinct characteristics reveal potential substantial benefits stemming judicious use both processes. We...

10.48550/arxiv.2403.11120 preprint EN arXiv (Cornell University) 2024-03-17

Large-scale vision-language models like CLIP have demonstrated impressive open-vocabulary capabilities for image-level tasks, excelling in recognizing what objects are present. However, they struggle with pixel-level recognition tasks semantic segmentation, which additionally require understanding where the located. In this work, we propose a novel method, PixelCLIP, to adapt image encoder by guiding model on where, is achieved using unlabeled images and masks generated from vision...

10.48550/arxiv.2409.19846 preprint EN arXiv (Cornell University) 2024-09-29

In this work, we explore new perspectives on cross-view completion learning by drawing an analogy to self-supervised correspondence learning. Through our analysis, demonstrate that the cross-attention map within models captures more effectively than other correlations derived from encoder or decoder features. We verify effectiveness of evaluating both zero-shot matching and learning-based geometric multi-frame depth estimation. Project page is available at https://cvlab-kaist.github.io/ZeroCo/.

10.48550/arxiv.2412.09072 preprint EN arXiv (Cornell University) 2024-12-12
Coming Soon ...