NFDI4DS | UHH-SEMS - Publication Details

Sangyeob Kim

ORCID: 0000-0002-1783-5296

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5100748588

Research Areas

Advanced Memory and Neural Computing
Fluid Dynamics Simulations and Interactions
Ferroelectric and Negative Capacitance Devices
Advanced Neural Network Applications
Ship Hydrodynamics and Maneuverability
Spacecraft and Cryogenic Technologies
Neural Networks and Reservoir Computing
Advanced Image and Video Retrieval Techniques
CCD and CMOS Imaging Sensors
Parallel Computing and Optimization Techniques
Computer Graphics and Visualization Techniques
Advanced Vision and Imaging
3D Shape Modeling and Analysis
Semiconductor materials and devices
Robotics and Sensor-Based Localization
Advancements in Semiconductor Devices and Circuit Design
Generative Adversarial Networks and Image Synthesis
Methane Hydrates and Related Phenomena
Silicon Carbide Semiconductor Technologies
Advanced Image Processing Techniques
Reinforcement Learning in Robotics
Modular Robots and Swarm Intelligence
Earthquake and Tsunami Effects
Neural dynamics and brain function
Face and Expression Recognition

Korea Institute of Ocean Science and Technology
2025

Korea Advanced Institute of Science and Technology
2018-2025

Korean Register (South Korea)
2019-2024

Kumoh National Institute of Technology
2023-2024

Sejong University
2023

Seoul National University
2011-2019

UNPU: A 50.6TOPS/W unified deep neural network accelerator with 1b-to-16b fully-variable weight bit-precision

OPENALEX - Publications

Jinmook Lee Changhyeon Kim Sanghoon Kang Dongjoo Shin Sangyeob Kim and 1 more

Deep neural network (DNN) accelerators [1-3] have been proposed to accelerate deep learning algorithms from face recognition emotion in mobile or embedded environments [3]. However, most works only the convolutional layers (CLs) fully-connected (FCLs), and different DNNs, such as those containing recurrent (RLs) (useful for recognition) not supported hardware. A combined CNN-RNN accelerator [1], separately optimizing computation-dominant CLs, memory-dominant RLs FCLs, was reported increase...

10.1109/isscc.2018.8310262 article EN 2022 IEEE International Solid- State Circuits Conference (ISSCC) 2018-02-01

UNPU: An Energy-Efficient Deep Neural Network Accelerator With Fully Variable Weight Bit Precision

OPENALEX - Publications

Jinmook Lee Changhyeon Kim Sanghoon Kang Dongjoo Shin Sangyeob Kim and 1 more

An energy-efficient deep neural network (DNN) accelerator, unified processing unit (UNPU), is proposed for mobile learning applications. The UNPU can support both convolutional layers (CLs) and recurrent or fully connected (FCLs) to versatile workload combinations accelerate various In addition, the first DNN accelerator ASIC that variable weight bit precision from 1 16 bit. It enables operate on accuracy-energy optimal point. Moreover, lookup table (LUT)-based bit-serial element (LBPE) in...

10.1109/jssc.2018.2865489 article EN IEEE Journal of Solid-State Circuits 2018-10-04

7.4 GANPU: A 135TFLOPS/W Multi-DNN Training Processor for GANs with Speculative Dual-Sparsity Exploitation

OPENALEX - Publications

Sanghoon Kang Donghyeon Han Juhyoung Lee Dongseok Im Sangyeob Kim and 2 more

Generative adversarial networks (GAN) have a wide range of applications, from image style transfer to synthetic voice generation [1]. GAN applications on mobile devices, such as face-to-Emoji conversion and super-resolution imaging, enable more engaging user interaction. As shown in Fig. 7.4.1, consists 2 competing deep neural (DNN): generator discriminator. The discriminator is trained, while the fixed, distinguish whether generated real or fake. On other hand, trained generate fake images...

10.1109/isscc19947.2020.9062989 article EN 2022 IEEE International Solid- State Circuits Conference (ISSCC) 2020-02-01

C-DNN: A 24.5-85.8TOPS/W Complementary-Deep-Neural-Network Processor with Heterogeneous CNN/SNN Core Architecture and Forward-Gradient-Based Sparsity Generation

OPENALEX - Publications

Sangyeob Kim Soyeon Kim Seongyon Hong Sangjin Kim Donghyeon Han and 1 more

Spiking-Neural-Networks (SNNs) have been studied for a long time, and recently shown to achieve the same accuracy as Convolutional-Neural-Networks (CNNs). By using CNN-to-SNN conversion, SNNs become promising candidate ultra-low power Al applications [1]. For example, compared BNNs or XOR-nets, provide lower consumption higher [2]. This is because perform spike-based event-driven operation with high spike sparsity, unlike CNN's frame-driven operation. Fig. 22.5.1 shows that energy of SNN...

10.1109/isscc42615.2023.10067497 article EN 2022 IEEE International Solid- State Circuits Conference (ISSCC) 2023-02-19

16.5 DynaPlasia: An eDRAM In-Memory-Computing-Based Reconfigurable Spatial Accelerator with Triple-Mode Cell for Dynamic Resource Switching

OPENALEX - Publications

Sangjin Kim Zhiyong Li Soyeon Um Wooyoung Jo Sangwoo Ha and 4 more

In-memory computing (IMC) processors show significant energy and area efficiency for deep neural network (DNN) processing [1–3]. As shown in Fig. 16.5.1, despite promising macro-level throughput, there remain three main challenges to extending gains system performance with a high integration level. First, most previous works had fixed configuration size of IMC macros, when the macro was smaller than DNN layer's dimension, repetitive memory accesses were required IA/OA, consuming >40% power....

10.1109/isscc42615.2023.10067352 article EN 2022 IEEE International Solid- State Circuits Conference (ISSCC) 2023-02-19

Indocyanine-Based Activatable Fluorescence Turn-On Probe for γ-Glutamyltranspeptidase and Its Application to the Mouse Model of Colon Cancer

OPENALEX - Publications

Seokan Park Soo-Yeon Lim Sang Mun Bae Sangyeob Kim Seung‐Jae Myung and 1 more

An activatable fluorescent probe from indocyanine was developed for the detection of tumor-enriched γ-glutamyltranspeptidase (γGT). The exhibited a dramatic fluorescence enhancement (F/F0 = 10) as well bathochromic shift (>100 nm) upon treatment γGT with low limit 0.15 unit/L and further successfully applied sensitive in mouse model colon cancer.

10.1021/acssensors.5b00249 article EN ACS Sensors 2016-03-15

A 13.7 TFLOPS/W Floating-point DNN Processor using Heterogeneous Computing Architecture with Exponent-Computing-in-Memory

OPENALEX - Publications

Juhyoung Lee Ji-Hoon Kim Wooyoung Jo Sangyeob Kim Sangjin Kim and 2 more

An energy-efficient floating-point DNN training processor is proposed with heterogenous bfloat16 computing architecture using exponent computing-in-memory (CIM) and mantissa processing engine. Mantissa free calculation enables pipelining of operation for while reducing MAC power by 14.4 %. 6T SRAM bitline charge reusing reduces memory access 46.4 The fabricated in 28 nm CMOS technology occupies 1.62×3.6 mm <sup xmlns:mml="http://www.w3.org/1998/Math/MathML"...

10.23919/vlsicircuits52068.2021.9492476 article EN Symposium on VLSI Circuits 2021-06-13

20.5 C-Transformer: A 2.6-18.1μJ/Token Homogeneous DNN-Transformer/Spiking-Transformer Processor with Big-Little Network and Implicit Weight Generation for Large Language Models

OPENALEX - Publications

Sangyeob Kim Sangjin Kim Wooyoung Jo Soyeon Kim Seongyon Hong and 1 more

Recently, transformer-based large language models (LLMs), shown in Fig. 20.5.1, are widely used, and even on-device LLM systems with real-time responses anticipated [1]. Many transformer processors [2–4] enhance energy efficiency by increasing hardware utilization reducing power consumption, but their system consumption response time still not suitable for mobile devices. Since LLMs, such as GPT-2, have many parameters (400-700M), External Memory Access (EMA) consumes 68% of the total power....

10.1109/isscc49657.2024.10454330 article EN 2022 IEEE International Solid- State Circuits Conference (ISSCC) 2024-02-18

Dispersion behavior of Fukushima-derived 137Cs over the North Pacific with emphasis on its sensitivity to vertical velocity and diffusion

OPENALEX - Publications

Haejin Kim Kyeong Ok Kim Sangyeob Kim Kyung-Suk Suh Kyeol Kwon

10.1016/j.marpolbul.2025.117562 article EN cc-by-nc-nd Marine Pollution Bulletin 2025-01-16

Dyamond: Compact and Efficient 1T1C DRAM IMC Accelerator With Bit Column Addition for Memory-Intensive AI

OPENALEX - Publications

Seongyon Hong Wooyoung Jo Sangjin Kim Sangyeob Kim Soyeon Um and 2 more

10.1109/jssc.2025.3538899 article EN IEEE Journal of Solid-State Circuits 2025-01-01

23.7 BROCA: A 52.4-to-559.2mW Mobile Social Agent System-on-Chip with Adaptive Bit-Truncate Unit and Acoustic-Cluster Bit Grouping

OPENALEX - Publications

Wooyoung Jo Seongyon Hong Ji-Won Choi Beomseok Kwon Haoyang Sang and 5 more

10.1109/isscc49661.2025.10904658 article EN 2022 IEEE International Solid- State Circuits Conference (ISSCC) 2025-02-16

Slim-Llama: A 4.69mW Large-Language-Model Processor with Binary/Ternary Weights for Billion-Parameter Llama Model

OPENALEX - Publications

Sangyeob Kim Jungwan Lee Hoi‐Jun Yoo

10.1109/isscc49661.2025.10904761 article EN 2022 IEEE International Solid- State Circuits Conference (ISSCC) 2025-02-16

23.3 EdgeDiff: 418.4mJ/Inference Multi-Modal Few-Step Diffusion Model Accelerator with Mixed-Precision and Reordered Group Quantization

OPENALEX - Publications

Sangjin Kim Jun-Ho Oh J. H. So Yuseon Choi Sangyeob Kim and 3 more

10.1109/isscc49661.2025.10904594 article EN 2022 IEEE International Solid- State Circuits Conference (ISSCC) 2025-02-16

Comparative study on pressure sensors for sloshing experiment

OPENALEX - Publications

Sangyeob Kim Kyong‐Hwan Kim Yonghwan Kim

This study considers a comparative on pressure sensors for the measurement of sloshing impact pressure. For study, four are used: one piezoresistive sensor, piezoelectric and two integrated circuit (ICP) sensors. installed tank wall ceiling rectangular with narrow breadth. Several types studies carried out, including sensitivity to temperature differences between test medium. The forced regular irregular motions applied partial water filling, signals due measured at different filling...

10.1016/j.oceaneng.2014.11.014 article EN cc-by-nc-nd Ocean Engineering 2014-12-29

GANPU: An Energy-Efficient Multi-DNN Training Processor for GANs With Speculative Dual-Sparsity Exploitation

OPENALEX - Publications

Sanghoon Kang Donghyeon Han Juhyoung Lee Dongseok Im Sangyeob Kim and 3 more

This article presents generative adversarial network processing unit (GANPU), an energy-efficient multiple deep neural (DNN) training processor for GANs. It enables on-device of GANs on performance- and battery-limited mobile devices, without sending user-specific data to servers, fully evading privacy concerns. Training require a massive amount computation, therefore, it is difficult accelerate in resource-constrained platform. Besides, networks layers show dramatically changing operational...

10.1109/jssc.2021.3066572 article EN IEEE Journal of Solid-State Circuits 2021-04-22

Database of model-scale sloshing experiment for LNG tank and application of artificial neural network for sloshing load prediction

OPENALEX - Publications

Yangjun Ahn Yonghwan Kim Sangyeob Kim

Seoul National University has conducted a considerable number of six degree-of-freedom irregular small-scale sloshing model tests 1/70–1/25 scales, particularly focusing on the tanks liquefied natural gas (LNG) carriers. An experimental database been created to provide information load severity, which are obtained from lot post-processed results. In this paper, summary is described. The artificial neural network trained based predict severity. Various attributes that affect results...

10.1016/j.marstruc.2019.03.005 article EN cc-by-nc-nd Marine Structures 2019-03-26

Neuro-CIM: A 310.4 TOPS/W Neuromorphic Computing-in-Memory Processor with Low WL/BL activity and Digital-Analog Mixed-mode Neuron Firing

OPENALEX - Publications

Sangyeob Kim Sangjin Kim Soyeon Um Soyeon Kim Kwantae Kim and 1 more

An energy-efficient neuromorphic computing-in-memory (CIM) processor is proposed with four key features: 1) Most significant bit (MSB) Word Skipping to reduce the BL activity; 2) Early Stopping enable lower 3) Mixed-mode firing for multi-macro aggregation; 4) Voltage Folding extend dynamic range. The CIM achieves state-of-the-art energy efficiency of 62.1 TOPS/W (I=4b, W=8b) and 310.4 W=1b).

10.1109/vlsitechnologyandcir46769.2022.9830276 article EN 2022 IEEE Symposium on VLSI Technology and Circuits (VLSI Technology and Circuits) 2022-06-12

Neuro-CIM: ADC-Less Neuromorphic Computing-in-Memory Processor With Operation Gating/Stopping and Digital–Analog Networks

OPENALEX - Publications

Sangyeob Kim Sangjin Kim Soyeon Um Soyeon Kim Kwantae Kim and 1 more

A highly energy-efficient neuromorphic computing-in-memory (Neuro-CIM) processor is proposed for ultralow-power deep learning applications. Neuro-CIM can support spiking neural network (SNN) to eliminate the power and area overhead of previous CIM processor. The sign extended bits gating reduces bitline (BL) voltage switching rate due negative small-magnitude weights allowing 38% reduction at 8-b weight condition 25% 4-b condition. In addition, replaces high-precision analog-to-digital...

10.1109/jssc.2023.3273238 article EN IEEE Journal of Solid-State Circuits 2023-05-19

3D-printed Chlorella vulgaris biocarriers: A novel approach to wastewater treatment

OPENALEX - Publications

Seon-Won Yoon Sangyeob Kim Joon-Seong Jeon Seungdae Oh Sang-Yeop Chung and 2 more

10.1016/j.jwpe.2023.104711 article EN Journal of Water Process Engineering 2023-12-26

A Power-Efficient CNN Accelerator With Similar Feature Skipping for Face Recognition in Mobile Devices

OPENALEX - Publications

Sangyeob Kim Juhyoung Lee Sanghoon Kang Jinsu Lee Hoi‐Jun Yoo

A low power face recognition (FR) convolutional neural network (CNN) processor is proposed with high efficiency to achieve always-on FR in mobile devices. Three key features enable a power-efficient CNN. First, tile-based clustering (THC) for reducing the computation overhead of hierarchical clustering. It generates an average 37.2% duplicated input entire network. Second, latency core proposed. supports approximated method that removes distance updates and increases pipeline utilization by...

10.1109/tcsi.2020.2966243 article EN IEEE Transactions on Circuits and Systems I Regular Papers 2020-02-02

A 146.52 TOPS/W Deep-Neural-Network Learning Processor with Stochastic Coarse-Fine Pruning and Adaptive Input/Output/Weight Skipping

OPENALEX - Publications

Sangyeob Kim Juhyoung Lee Sanghoon Kang Jinmook Lee Hoi‐Jun Yoo

An energy efficient Deep-Neural-Network (DNN) learning processor is proposed for on-chip and iterative weight pruning (WP). This work has three key features: 1) stochastic coarse-fine reduced computation workload by 99.7% compared with previous WP algorithm while maintaining high sparsity, 2) adaptive input/output/weight skipping (AIOWS) achieved 30.1× higher throughput than DNN [1] not only the inference but also learning, 3) memory shared unit removed access WP. As a result, this shows...

10.1109/vlsicircuits18222.2020.9162795 article EN 2020-06-01

DynaPlasia: An eDRAM In-Memory Computing-Based Reconfigurable Spatial Accelerator With Triple-Mode Cell

OPENALEX - Publications

Sangjin Kim Zhiyong Li Soyeon Um Wooyoung Jo Sangwoo Ha and 4 more

This article presents DynaPlasia, a reconfigurable eDRAM-based in- memory computing (IMC) processor with novel triple-mode cell. It enables higher system-level performance and efficiency in resource-limited environment. DynaPlasia proposes five key features that can enhance the energy area of IMC accelerator: 1) dynamic core architecture (DRECA), which dynamically reconfigures effective macro size according to DNN workloads; 2) cell is reconfigured as PE, unit DAC, optimize system resource...

10.1109/jssc.2023.3319962 article EN IEEE Journal of Solid-State Circuits 2023-10-10

Comparison of sloshing-induced pressure in different scale tanks

OPENALEX - Publications

Sangyeob Kim Yonghwan Kim Jae‐Hoon Lee

This paper considers scale effects on three-dimensional (3D) sloshing flows. A series of model tests were conducted for three differently scaled tanks. The tanks considered in this study 1:70, 1:50, and 1:30 membrane type based a 138,000 m3 liquid natural gas carrier model. carried out harmonic sway roll motions different filling depths with various excitation frequencies. pressure measuring points the same, as if they up to actual size. main parameters investigated peak rise time sampled...

10.1080/17445302.2015.1134893 article EN Ships and Offshore Structures 2016-05-13

Design of Sub-10-μW Sub-0.1% THD Sinusoidal Current Generator IC for Bio-Impedance Sensing

OPENALEX - Publications

Kwantae Kim Sangyeob Kim Hoi‐Jun Yoo

This article presents a low-power, low-distortion, and compact mixed-signal sinusoidal current generator (CG) IC for bio-impedance (Bio-Z) sensing applications. By utilizing the digital <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$\Delta \Sigma $ </tex-math></inline-formula> modulation to bridge digitally synthesized sinewave data analog-domain voltage output, implementation of low-distortion lookup...

10.1109/jssc.2021.3100716 article EN IEEE Journal of Solid-State Circuits 2021-08-06

MetaVRain: A Mobile Neural 3-D Rendering Processor With Bundle-Frame-Familiarity-Based NeRF Acceleration and Hybrid DNN Computing

OPENALEX - Publications

Donghyeon Han Junha Ryu Sangyeob Kim Sangjin Kim Jongjun Park and 1 more

This article presents MetaVRain, a low-power neural 3-D rendering processor for metaverse realization on mobile devices. The MetaVRain mainly focused solving high operational intensity problem that appeared during the radiance fields (NeRFs)-based rendering. It imitates brain-inspired visual perception processes and constructs new NeRF acceleration architecture, bundle-frame-familiarity (BuFF). built-in core (VPC) realizes BuFF architecture by accelerating three stages: 1) spatial attention...

10.1109/jssc.2023.3291871 article EN IEEE Journal of Solid-State Circuits 2023-07-13

Coming Soon ...