NFDI4DS | UHH-SEMS - Publication Details

Large Language Models are Strong Audio-Visual Speech Recognition Learners

OPENALEX - Publications

Umberto Cappellazzo Minsu Kim Honglie Chen Pingchuan Ma Stavros Petridis and 3 more

10.1109/icassp49660.2025.10889251 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2025-03-12

Prompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech Recognition

OPENALEX - Publications

Minsu Kim Hyung-Il Kim Yong Man Ro

Visual Speech Recognition (VSR) aims to infer speech into text depending on lip movements alone. As it focuses visual information model the speech, its performance is inherently sensitive personal appearances and movements, this makes VSR models show degraded when they are applied unseen speakers. In paper, remedy degradation of speakers, we propose prompt tuning methods Deep Neural Networks (DNNs) for speaker-adaptive VSR. Specifically, motivated by recent advances in Natural Language...

10.1109/tpami.2024.3484658 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2024-01-01

Contextual Speech Extraction: Leveraging Textual History as an Implicit Cue for Target Speech Extraction

OPENALEX - Publications

Minsu Kim Rodrigo Mira Honglie Chen Stavros Petridis Maja Pantić

10.1109/icassp49660.2025.10887655 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2025-03-12

A 125 GOPS 583 mW Network-on-Chip Based Parallel Processor With Bio-Inspired Visual Attention Engine

OPENALEX - Publications

Kwanho Kim Seungjin Lee Joo-Young Kim Minsu Kim Hoi‐Jun Yoo

<para xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> A network-on-chip (NoC) based parallel processor is presented for bio-inspired real-time object recognition with visual attention algorithm. It contains an ARM10-compatible 32-bit main processor, 8 single-instruction multiple-data (SIMD) clusters processing elements in each cluster, a cellular neural network engine (VAE), matching accelerator, and DMA-like external interface. The VAE 2-D shift...

10.1109/jssc.2008.2007157 article EN IEEE Journal of Solid-State Circuits 2009-01-01

An Embedded nand Flash-Based Compute-In-Memory Array Demonstrated in a Standard Logic Process

OPENALEX - Publications

Minsu Kim Muqing Liu Luke Everson Chris H. Kim

A neural network hardware inspired by the 3-D NAND flash array structure was experimentally demonstrated in a standard 65-nm CMOS process. Logic-compatible embedded memory cells were used for storing multi-level synaptic weights while bit-serial architecture enables 8 bit <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$\times $ </tex-math></inline-formula> multiply-and-accumulate operation. novel...

10.1109/jssc.2021.3098671 article EN IEEE Journal of Solid-State Circuits 2021-07-29

Ultralow-k Amorphous Boron Nitride Film for Copper Interconnect Capping Layer

OPENALEX - Publications

Kiryong Kim Hyeong-Joon Kim Sun-Woo Lee Min Yung Lee Gyusoup Lee and 9 more

We report the feasibility of ultralow- <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">${k}$ </tex-math></inline-formula> amorphous boron nitride ( notation="LaTeX">$\alpha $ -BN) film as a new capping layer for copper (Cu) interconnects. -BN thin films were successfully deposited using plasma-enhanced chemical vapor deposition (PECVD) process. The CVD-grown showed -value low 2.0 at 3 nm thickness, leakage...

10.1109/ted.2023.3258403 article EN IEEE Transactions on Electron Devices 2023-03-23

Large Language Models Are Strong Audio-Visual Speech Recognition Learners

OPENALEX - Publications

Umberto Cappellazzo Minsu Kim Honglie Chen Pingchuan Ma Stavros Petridis and 3 more

Multimodal large language models (MLLMs) have recently become a focal point of research due to their formidable multimodal understanding capabilities. For example, in the audio and speech domains, an LLM can be equipped with (automatic) recognition (ASR) abilities by just concatenating tokens, computed encoder, text tokens achieve state-of-the-art results. On contrary, tasks like visual audio-visual (VSR/AVSR), which also exploit noise-invariant lip movement information, received little or...

10.48550/arxiv.2409.12319 preprint EN arXiv (Cornell University) 2024-09-18

The brain mimicking Visual Attention Engine: An 80×60 digital Cellular Neural Network for rapid global feature extraction

OPENALEX - Publications

Seungjin Lee Kwanho Kim Minsu Kim Joo-Young Kim Hoi‐Jun Yoo

The visual attention engine (VAE), an 80 times 60 digital cellular neural network, rapidly extracts global features used as attentional cues to streamline detailed object recognition. A peak performance of 24 GOPS is achieved by 120 processing elements (PE) shared the cells. 2D shift register based data transactions enable 93% PE utilization. Integrated within recognition SoC, 4.5 mm 2 VAE...

10.1109/vlsic.2008.4585938 article EN 2008-06-01

Effective threshold voltage modulation technique for steep-slope 2D atomic threshold switching field-effect transistor

OPENALEX - Publications

Seong‐Hyun Hwang Seung‐Hwan Kim Seung-Geun Kim Minsu Kim Kyu‐Hyun Han and 5 more

10.1016/j.mtadv.2023.100367 article EN cc-by-nc-nd Materials Today Advances 2023-04-13

A Counter based ADC Non-linearity Measurement Circuit and Its Application to Reliability Testing

OPENALEX - Publications

Gyusung Park Minsu Kim Nakul Pande Po-Wei Chiu Jeehwan Song and 1 more

In this paper, we present a counter based measurement circuit for in-situ characterization of analog-to-digital converter (ADC) differential non-linearity (DNL) and integral (INL). An array counters collects the histogram ADC output code triangular input voltage. Since operation data transfer are separated in time, DNL INL results immune to noise setup. Using proposed method, studied short-term bias temperature instability (BTI) effects successive-approximate-register under different...

10.1109/cicc.2019.8780279 article EN 2022 IEEE Custom Integrated Circuits Conference (CICC) 2019-04-01

Scan-controlled pulse flip-flops for mobile application processors

OPENALEX - Publications

Minsu Kim HyoungWook Lee Jin‐Soo Park Chung Hee Kim Juhyun Kang and 6 more

Novel high-speed low-power pulse-based flip-flops having a pulse generator controlled by scan input and enable signals are presented. The proposed scheme enables the reduction of data-to-output delay elimination MUX-scan logic from setup time path flip-flop, at cost small power overhead. comparison results using 45 nm CMOS process indicate that worst-case DQ flip-flop is reduced up to 59% while energy-delay product improved 80% compared conventional master-slave flip-flop. silicon show new...

10.1109/iscas.2013.6571960 article EN 2022 IEEE International Symposium on Circuits and Systems (ISCAS) 2013-05-01

A Layout-to-Generator Conversion Framework With Graphical User Interface for Visual Programming of Analog Layout Generators

OPENALEX - Publications

Sungyu Jeong Chanhyong Lee Minsu Kim Iksu Jang Myungguk Lee and 2 more

We propose a visual programming framework that helps designer easily convert an existing analog layout into the generator. Using graphical user interface (GUI), designers can load layout, it generator, and visually verify generated result. A GUI-supported method enables intuitive straightforward to significantly reduce required skills coding workload. Through program blocks, describe compile Layout-code synchronization updates blocks automatically when elements are created, edited,...

10.36227/techrxiv.24189216.v2 preprint EN cc-by-nc-sa 2023-10-18

DevFormer: A Symmetric Transformer for Context-Aware Device Placement

OPENALEX - Publications

Haeyeon Kim Minsu Kim Joungho Kim Jinkyoo Park

In this paper, we present DevFormer, a novel transformer-based architecture for addressing the complex and computationally demanding problem of hardware design optimization. Despite demonstrated efficacy transformers in domains including natural language processing computer vision, their use has been limited by scarcity offline data. Our approach addresses limitation introducing strong inductive biases such as relative positional embeddings action-permutation symmetricity that effectively...

10.48550/arxiv.2205.13225 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Timely Update Probability Analysis of Blockchain Ledger in UAV-assisted Data Collection Networks

OPENALEX - Publications

Sungho Lee Minsu Kim Mingun Kim Jemin Lee

In recent years, blockchain technology has been frequently exploited to address new security requirements for unmanned aerial vehicle (UAV)-assisted data collection (U-DC). However, the latency commit ledger emerged as a issue. this paper, therefore, we analyze timely update probability (TUP) of ledger, which is that collected from UAV updated in within given target latency. For analysis, first define TUP U-DC networks, using both communication and latencies. We then derive closed-form...

10.1109/icc45855.2022.9838787 article EN ICC 2022 - IEEE International Conference on Communications 2022-05-16

All-digital PLL frequency and phase noise degradation measurements using simple on-chip monitoring circuits

OPENALEX - Publications

Gyusung Park Minsu Kim Chris H. Kim Bongjin Kim Vijay Reddy

Using simple on-chip monitoring circuits, we precisely characterized the impact of hot carrier injection and bias temperature instability on frequency phase noise degradation a 65nm all-digital PLL circuit. Experimental data shows that degrades with aging even though output is maintained constant due to feedback operation. Results show applying high annealing can recover most degradation.

10.1109/irps.2018.8353613 article EN 2022 IEEE International Reliability Physics Symposium (IRPS) 2018-03-01

MAPIM: Mat Parallelism for High Performance Processing in Non-volatile Memory Architecture

OPENALEX - Publications

Joonseop Sim Minsu Kim Yeseong Kim Saransh Gupta Behnam Khaleghi and 1 more

In the Internet of Things (IoT) era, data movement between processing units and memory is a critical factor in overall system performance. Processing-in-Memory (PIM) promising solution to address this bandwidth bottleneck by performing portion computation inside memory. Many prior studies have enabled various PIM operations on nonvolatile (NVM) modifying sense amplifiers (SA). They exploit single amplifier handle multiple bitlines with multiplexer (MUX) since SA circuit takes much larger...

10.1109/isqed.2019.8697441 article EN 2019-03-01

Evaluation of Airflow Changes according to the Geometry of Airway after Left Upper Lobectomy based on Computational Fluid Dynamics

OPENALEX - Publications

Minsu Kim Soojin Lee Hyo Yeong Ahn Chi‐Seung Lee

좌상엽 절제술은 폐엽 중 가장 큰 엽을 절제하는 수술이다. 수술 후 남아있는 폐가 팽창을 하면서 기관지의 형상 변화를 유도한다. 기관지, 폐동맥, 폐정맥 등을 덮고 있는 것을 폐 인대라고 한다. 방법으로는 인대를 박리하는 방법과 보존하는 방법이 있다. 본 연구에서는 절제술 시 폐인대를 박리한 환자의 CT 이미지로부터 전과 후의 기관지 데이터를 분석하였다. 모델을 단순화하여 전산유체역학 해석을 통해 요인에 따른 기류의 평가하였다. 길이, 곡률, 단면적에서 형상의 보였으며, 이 세 요인을 가지고 일정한 변화량을 주어 모델링하여 시뮬레이션 한 결과 단면적 변화가 기류에 영향을 미치는 확인하였다. 단면 내경이 0.5배로 감소하는 경우 체적 유량이 약 64%

10.3795/ksme-b.2023.47.9.439 article KO Transactions of the Korean Society of Mechanical Engineers B 2023-09-13

A Layout-to-Generator Conversion Framework With Graphical User Interface for Visual Programming of Analog Layout Generators

OPENALEX - Publications

Sungyu Jeong Chanhyong Lee Minsu Kim Iksu Jang Myungguk Lee and 2 more

We propose a visual programming framework that helps designer easily convert an existing analog layout into the generator. Using graphical user interface (GUI), designers can load layout, it generator, and visually verify generated result. A GUI-supported method enables intuitive straightforward to significantly reduce required skills coding workload. Through program blocks, describe compile Layout-code synchronization updates blocks automatically when elements are created, edited,...

10.36227/techrxiv.24189216.v1 preprint EN cc-by-nc-sa 2023-10-02

A Layout-to-Generator Conversion Framework With Graphical User Interface for Visual Programming of Analog Layout Generators

OPENALEX - Publications

Sungyu Jeong Chanhyong Lee Minsu Kim Iksu Jang Myungguk Lee and 2 more

We propose a visual programming framework that helps designer easily convert an existing analog layout into the generator. Using graphical user interface (GUI), designers can load layout, it generator, and visually verify generated result. A GUI-supported method enables intuitive straightforward to significantly reduce required skills coding workload. Through program blocks, describe compile Layout-code synchronization updates blocks automatically when elements are created, edited,...

10.36227/techrxiv.24189216 preprint EN cc-by-nc-sa 2023-10-02

A study of Comparative Analysis of CPV and PV Module through Long-term Outdoor Testing

OPENALEX - Publications

Minsu Kim Yuri Lee Min-Je Cho Soo Young Oh Jae Hak Jung

10.21218/cpr.2017.5.1.033 article EN Current Photovoltaic Research 2017-01-01

Transformer Network-based Reinforcement Learning Method for Power Distribution Network (PDN) Optimization of High Bandwidth Memory (HBM)

OPENALEX - Publications

Hyunwook Park Minsu Kim Seongguk Kim Keunwoo Kim Haeyeon Kim and 7 more

In this article, for the first time, we propose a transformer network-based reinforcement learning (RL) method power distribution network (PDN) optimization of high bandwidth memory (HBM). The proposed can provide an optimal decoupling capacitor (decap) design to maximize reduction PDN self- and transfer impedance seen at multiple ports. An attention-based is implemented directly parameterize decap policy. optimality performance significantly improved since attention mechanism has powerful...

10.48550/arxiv.2203.15722 preprint EN other-oa arXiv (Cornell University) 2022-01-01