NFDI4DS | UHH-SEMS - Publication Details

Jaemin Cho

ORCID: 0000-0003-1148-5413

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5101760065

Research Areas

Multimodal Machine Learning Applications
Topic Modeling
Domain Adaptation and Few-Shot Learning
Advanced Image and Video Retrieval Techniques
Natural Language Processing Techniques
Video Analysis and Summarization
Perovskite Materials and Applications
Fuel Cells and Related Materials
Robotics and Sensor-Based Localization
Computer Graphics and Visualization Techniques
Chalcogenide Semiconductor Thin Films
Human Motion and Animation
Advancements in Solid Oxide Fuel Cells
Speech and dialogue systems
Corporate Finance and Governance
Private Equity and Venture Capital
Recycling and Waste Management Techniques
Conducting polymers and applications
Integrated Energy Systems Optimization
Advanced Neural Network Applications
Reinforcement Learning in Robotics
Embedded Systems Design Techniques
Advancements in Photolithography Techniques
Visual Attention and Saliency Detection
Quantum Dots Synthesis And Properties

Korea Advanced Institute of Science and Technology
2021-2025

University of North Carolina at Chapel Hill
2020-2024

University of North Carolina Health Care
2020-2024

Korea University of Technology and Education
2024

Inha University
2022-2024

Korea Institute of Ocean Science and Technology
2024

Seoul National University
2016-2023

Pohang University of Science and Technology
2012-2020

University of Washington
2020

Naver (South Korea)
2019

VL-ADAPTER: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks

OPENALEX - Publications

Yi-Lin Sung Jaemin Cho Mohit Bansal

Recently, fine-tuning language models pre-trained on large text corpora have provided huge improvements vision-and-language (V&L) tasks as well pure tasks. However, the entire parameter set of becomes impractical since model size is growing rapidly. Hence, in this paper, we introduce adapter-based parameter-efficient transfer learning techniques to V&L such VL-BART and VL-T5. We evaluate our methods a unified multi-task setup both image-text video-text benchmarks. For tasks, use four diverse...

10.1109/cvpr52688.2022.00516 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

DALL-EVAL: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models

OPENALEX - Publications

Jaemin Cho Abhay Zala Mohit Bansal

Recently, DALL-E [45], a multimodal transformer language model, and its variants including diffusion models have shown high-quality text-to-image generation capabilities. However, despite the realistic image results, there has not been detailed analysis of how to evaluate such models. In this work, we investigate visual reasoning capabilities social biases different models, covering both First, measure three skills: object recognition, counting, spatial relation understanding. For this,...

10.1109/iccv51070.2023.00283 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2023-10-01

Development of a new technology product evaluation model for assessing commercialization opportunities using Delphi method and fuzzy AHP approach

OPENALEX - Publications

Jaemin Cho Jaeho Lee

10.1016/j.eswa.2013.03.038 article EN Expert Systems with Applications 2013-04-06

A Hierarchical Latent Structure for Variational Conversation Modeling

OPENALEX - Publications

Yookoon Park Jaemin Cho Gunhee Kim

Yookoon Park, Jaemin Cho, Gunhee Kim. Proceedings of the 2018 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers). 2018.

10.18653/v1/n18-1162 article EN cc-by 2018-01-01

X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers

OPENALEX - Publications

Jaemin Cho Jiasen Lu Dustin Schwenk Hannaneh Hajishirzi Aniruddha Kembhavi

Mirroring the success of masked language models, vision-and-language counterparts like VILBERT, LXMERT and UNITER have achieved state art performance on a variety multimodal discriminative tasks visual question answering grounding. Recent work has also successfully adapted such models towards generative task image captioning. This begs question: Can these go other way generate images from pieces text? Our analysis popular representative this model family – finds that it is unable to rich...

10.18653/v1/2020.emnlp-main.707 article EN cc-by 2020-01-01

LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning

OPENALEX - Publications

Yi-Lin Sung Jaemin Cho Mohit Bansal

Fine-tuning large pre-trained models on downstream tasks has been adopted in a variety of domains recently. However, it is costly to update the entire parameter set models. Although recently proposed parameter-efficient transfer learning (PETL) techniques allow updating small subset parameters (e.g. only using 2% parameters) inside backbone network for new task, they reduce training memory requirement by up 30%. This because gradient computation trainable still requires backpropagation...

10.48550/arxiv.2206.06522 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Mixture Content Selection for Diverse Sequence Generation

OPENALEX - Publications

Jaemin Cho Minjoon Seo Hannaneh Hajishirzi

Jaemin Cho, Minjoon Seo, Hannaneh Hajishirzi. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint (EMNLP-IJCNLP). 2019.

10.18653/v1/d19-1308 article EN cc-by 2019-01-01

Fine-grained Image Captioning with CLIP Reward

OPENALEX - Publications

Jaemin Cho Seunghyun Yoon Ajinkya Kale Franck Dernoncourt Trung Bui and 1 more

Modern image captioning models are usually trained with text similarity objectives. However, since reference captions in public datasets often describe the most salient common objects, objectives tend to ignore specific and detailed aspects of an that distinguish it from others. Towards more descriptive distinctive caption generation, we propose use CLIP, a multimodal encoder on huge image-text pairs web, calculate multi-modal as reward function. We also simple finetuning strategy CLIP...

10.18653/v1/2022.findings-naacl.39 article EN cc-by Findings of the Association for Computational Linguistics: NAACL 2022 2022-01-01

Design of SnO2 Electron Transport Layer in Perovskite Solar Cells to Achieve 2000 h Stability Under 1 Sun Illumination and 85 °C

OPENALEX - Publications

Bumjin Gil Alan Jiwan Yun Jiheon Lim Jaemin Cho Beomsoo Kim and 3 more

Abstract In order to realize both efficient and stable perovskite solar cells, designing electron transport layer (ETL) is of crucial importance withstand constant light illumination thermal stress while maintaining high charge extractability. Herein, commonly used SnO 2 nanoparticle‐based ETL for cells modified by ionic‐salt ammonium chloride (NH 4 Cl) tin dihydrate (SnCl ∙2H O) as additives, which easily fabricated simple one‐step spin coating single precursor solution. With the presence...

10.1002/admi.202202148 article EN cc-by Advanced Materials Interfaces 2023-03-03

Self-Chained Image-Language Model for Video Localization and Question Answering

OPENALEX - Publications

Shoubin Yu Jaemin Cho Prateek Yadav Mohit Bansal

Recent studies have shown promising results on utilizing large pre-trained image-language models for video question answering. While these can efficiently bootstrap the representation learning of video-language models, they typically concatenate uniformly sampled frames as visual inputs without explicit language-aware, temporal modeling. When only a portion input is relevant to language query, such uniform frame sampling often lead missing important cues. Although humans find moment focus...

10.48550/arxiv.2305.06988 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Hierarchical Video-Moment Retrieval and Step-Captioning

OPENALEX - Publications

Abhay Zala Jaemin Cho Satwik Kottur Xilun Chen Barlas Oğuz and 2 more

There is growing interest in searching for information from large video corpora. Prior works have studied relevant tasks, such as text-based retrieval, moment summarization, and captioning isolation, without an end-to-end setup that can jointly search corpora generate summaries. Such would allow many interesting applications, e.g., a finds corpus, extracts the most video, segments into important steps with captions. To address this, we present HIREST (HIerarchical REtrieval STep-captioning)...

10.1109/cvpr52729.2023.02208 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Microplastic Contamination of a Benthic Ecosystem in a Hydrothermal Vent

OPENALEX - Publications

Byeongyong Park Boongho Cho Jaemin Cho Tae Won Kim

Plastic contamination is a global pervasive issue, extending from coastal areas and open oceans to polar regions even the deep sea. Microplastic (MP) in hydrothermal vents, which are known for their high biodiversity under extreme conditions, has remained largely unexplored. Here, we present, first time, MP pollution deep-sea vent at one of hotspots─the Central Indian Ridge. Not only environment (seawater: 2.08 ± 1.04 MPs/L, surface sediments: 0.57 0.19 MP/g) but also all six major benthic...

10.1021/acs.est.4c02811 article EN Environmental Science & Technology 2024-04-17

CuCrO2 Nanoparticles Incorporated into PTAA as a Hole Transport Layer for 85 °C and Light Stabilities in Perovskite Solar Cells

OPENALEX - Publications

Bumjin Gil Jinhyun Kim Alan Jiwan Yun Ki‐Min Park Jaemin Cho and 2 more

High-mobility inorganic CuCrO2 nanoparticles are co-utilized with conventional poly(bis(4-phenyl)(2,5,6-trimethylphenyl)amine) (PTAA) as a hole transport layer (HTL) for perovskite solar cells to improve device performance and long-term stability. Even though can be readily synthesized by hydrothermal reaction, it is difficult form uniform HTL alone due the severe agglomeration of nanoparticles. Herein, both PTAA sequentially deposited on simple spin-coating process, forming excellent...

10.3390/nano10091669 article EN cc-by Nanomaterials 2020-08-26

A Non‐thermal Plasma Seed Treatment Method for Management of a Seedborne Fungal Pathogen on Rice Seed

OPENALEX - Publications

Young‐Ki Jo Jaemin Cho Tsung‐Chan Tsai David Staack Mi‐Hyung Kang and 4 more

ABSTRACT Seeds contaminated with pathogens are the primary inoculum for plant diseases in many food crops. Conventional treatments seedborne use hot water, chlorine or fungicide applications. A novel seed treatment method based on non‐thermal plasma generated by an air dielectric barrier discharge (DBD) device was evaluated this study as alternative to these conventional treatments. The at atmospheric pressure and room temperature consisted of partially‐ionized gases that chemically...

10.2135/cropsci2013.05.0331 article EN Crop Science 2014-02-27

Incorporation of Lithium Fluoride Restraining Thermal Degradation and Photodegradation of Organometal Halide Perovskite Solar Cells

OPENALEX - Publications

Alan Jiwan Yun Jinhyun Kim Bumjin Gil Hyungsub Woo Ki‐Min Park and 2 more

Because of the facile formation defects in organometal halide perovskites, defect passivation has become an important prerequisite for stable and efficient perovskite solar cell (PSC). Regarding that ionic perovskites play a significant role on performance stability PSCs, we introduce lithium fluorides as effective passivators based their strong characteristics small radii. Both Li+ F– are observed to successfully incorporate within layer, improving device performances with best efficiency...

10.1021/acsami.0c14218 article EN ACS Applied Materials & Interfaces 2020-10-29

The venture capital certification role in R&D: Evidence from IPO underpricing in Korea

OPENALEX - Publications

Jaemin Cho Jae-Ho Lee

10.1016/j.pacfin.2013.01.005 article EN Pacific-Basin Finance Journal 2013-01-30

Comparison of the Electrochemical Reaction Parameter of Graphite and Sub-bituminous Coal in a Direct Carbon Fuel Cell

OPENALEX - Publications

Seongyong Eom Jaemin Cho Seongyool Ahn Yonmo Sung Gyungmin Choi and 1 more

A direct carbon fuel cell (DCFC) system directly converts the chemical energy of solid carbonaceous into electrical energy. The electrochemical reaction this has an influence on properties fuel, such as crystal structure, element composition, and surface properties. In addition, when using raw coals DCFC volatile gases released from coal at a high temperature affect performance. purpose study is to investigate effect characteristics resistance inner by impedance spectroscopy (EIS) equivalent...

10.1021/acs.energyfuels.5b02904 article EN Energy & Fuels 2016-03-05

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models

OPENALEX - Publications

Jaemin Cho Abhay Zala Mohit Bansal

Recently, DALL-E, a multimodal transformer language model, and its variants, including diffusion models, have shown high-quality text-to-image generation capabilities. However, despite the realistic image results, there has not been detailed analysis of how to evaluate such models. In this work, we investigate visual reasoning capabilities social biases different covering both models First, measure three skills: object recognition, counting, spatial relation understanding. For this, propose...

10.48550/arxiv.2202.04053 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Multifunctional Green Solvent for Efficient Perovskite Solar Cells

OPENALEX - Publications

Jaemin Cho Beomsoo Kim Seokjoo Ryu Alan Jiwan Yun Bumjin Gil and 4 more

10.1007/s13391-023-00410-x article EN Electronic Materials Letters 2023-03-03

Unifying Vision-and-Language Tasks via Text Generation

OPENALEX - Publications

Jaemin Cho Jie Lei Hao Tan Mohit Bansal

Existing methods for vision-and-language learning typically require designing task-specific architectures and objectives each task. For example, a multi-label answer classifier visual question answering, region scorer referring expression comprehension, language decoder image captioning, etc. To alleviate these hassles, in this work, we propose unified framework that learns different tasks single architecture with the same modeling objective, i.e., multimodal conditional text generation,...

10.48550/arxiv.2102.02779 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Editing Scene Illumination and Material Appearance of Light-Field Images

OPENALEX - Publications

Jaemin Cho Dongyoung Choi Dahyun Kang Gun Bang Min Kim

10.5220/0013145500003912 article EN Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications 2025-01-01

Efficient way of reducing contaminant induced by interactions between EUV photoresist and electron beam in CD-SEM: the route to carryover free SEM metrology

OPENALEX - Publications

Seung Min Park I. Nir Seung-Chan Kwak Woosik Yoo Hyunwoo Kim and 25 more

10.1117/12.3047636 article EN 2025-04-24

MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding

OPENALEX - Publications

Revant Gangi Reddy Xilin Rui Manling Li Xudong Lin Haoyang Wen and 7 more

Recently, there has been an increasing interest in building question answering (QA) models that reason across multiple modalities, such as text and images. However, QA using images is often limited to just picking the answer from a pre-defined set of options. In addition, real world, especially news, have objects are co-referential text, with complementary information both modalities. this paper, we present new evaluation benchmark 1,384 questions over news articles require cross-media...

10.1609/aaai.v36i10.21370 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2022-06-28

Mixed-Valence iron phosphate as an effective catalytic host for the High-Rate Lithium-Sulfur battery

OPENALEX - Publications

Ki‐Min Park Bumjin Gil Alan Jiwan Yun Jaemin Cho Jinhyun Kim and 1 more

10.1016/j.cej.2022.134814 article EN Chemical Engineering Journal 2022-01-22

Coming Soon ...