NFDI4DS | UHH-SEMS - Publication Details

Donghyun Seong

ORCID: 0000-0002-0474-6964

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5019598209

Research Areas

CCD and CMOS Imaging Sensors
Image Processing Techniques and Applications
Natural Language Processing Techniques
Analytical Chemistry and Sensors
Advanced Optical Sensing Technologies
Advanced Memory and Neural Computing
Infrared Target Detection Methodologies
Optical Coherence Tomography Applications
Speech and Audio Processing
Speech Recognition and Synthesis
Digital Holography and Microscopy
Thin-Film Transistor Technologies
Gas Sensing Nanomaterials and Sensors
Music and Audio Processing
Optical measurement and interference techniques
Topic Modeling
Text and Document Classification Technologies

Hanyang University
2024

Kyungpook National University
2018-2019

Adversarial Learning on Compressed Posterior Space for Non-Iterative Score-based End-to-End Text-to-Speech

OPENALEX - Publications

Won-Gook Choi Donghyun Seong Joon‐Hyuk Chang

Score-based generative models have shown the real-like quality of synthesized speech in text-to-speech (TTS) area. However, critical artifact score-based is requirement a high computational cost due to iterative sampling algorithm, and it also makes difficult fine-tune TTS-optimized vocoder. In this study, we propose method joint training TTS model HiFi-GAN using compressed log-mel features, guarantees significant even on non-iterative sampling. As result, proposed overcomes some digital...

10.1109/icassp48485.2024.10446958 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024-03-18

In-Pixel Aperture CMOS Image Sensor for 2-D and 3-D Imaging

OPENALEX - Publications

Byoung-Soo Choi Sang‐Hwan Kim Jimin Lee Donghyun Seong Jang‐Kyoo Shin and 5 more

This paper presents a CMOS image sensor with the in-pixel aperture technique for single-chip 2-D and 3-D imaging. In conventional sensors, is located at camera lens. However, in proposed sensor, integrated on chip formed metal layer of (CIS) process. A pixel array composed W, R, B PA pixels (W aperture) extracting color depth information. While W becomes blurred increasing distance from focused object, maintains sharpness. Therefore, can be obtained using defocus method. The size pixel,...

10.1109/jsen.2018.2869383 article EN IEEE Sensors Journal 2018-09-21

CMOS image sensor for extracting depth information using pixel aperture technique

OPENALEX - Publications

Byoung-Soo Choi Sang‐Hwan Kim Jimin Lee Donghyun Seong Jang‐Kyoo Shin and 3 more

In this paper, complementary metal oxide semiconductor (CMOS) image sensors that can extract depth information using the pixel aperture technique is presented. The array of proposed sensor composed blue, red, and white pixels, as well apertures. apertures are formed by pattern in pixels. focused defocused images obtained, simultaneously, sensor, used for calculating information. was designed fabricated 0.11-μm CMOS process, its performance evaluated.

10.1109/i2mtc.2018.8409654 article EN 2022 IEEE International Instrumentation and Measurement Technology Conference (I2MTC) 2018-05-01

TSP-TTS: Text-based Style Predictor with Residual Vector Quantization for Expressive Text-to-Speech

OPENALEX - Publications

Donghyun Seong Ho‐Young Lee Joon‐Hyuk Chang

Expressive text-to-speech (TTS) aims to synthesize better human-like speech by incorporating diverse styles or emotions. While most expressive TTS models rely on reference condition the style of generated speech, they often fail generate regular quality. To ensure consistent quality, we propose an conditioned representation extracted from text itself. implement this text-based predictor, design a module residual vector quantization. Furthermore, is enhanced through style-to-text alignment...

10.21437/interspeech.2024-1734 article EN Interspeech 2022 2024-09-01

H4C-TTS: Leveraging Multi-Modal Historical Context for Conversational Text-to-Speech

OPENALEX - Publications

Donghyun Seong Joon‐Hyuk Chang

Conversational text-to-speech (TTS) aims to synthesize natural voices appropriate a situation by considering the context of past conversations as well current text. However, analyzing and modeling conversation remains challenging. Most conversational TTS use content historical recent without distinguishing between them often generate speech that does not fit situation. Hence, we introduce novel TTS, H4C-TTS, leverages multi-modal realize contextually synthesis. To facilitate modeling, design...

10.21437/interspeech.2024-1480 article EN Interspeech 2022 2024-09-01

Effects of Aperture Diameter on Image Blur of CMOS Image Sensor With Pixel Apertures

OPENALEX - Publications

Byoung-Soo Choi Jang‐Kyoo Shin Sang‐Hwan Kim Jimin Lee Donghyun Seong and 5 more

This paper presents the effects of aperture diameter on image blur complementary metal-oxide-semiconductor (CMOS) sensor with pixel apertures for depth extraction. In a conventional camera system, is located at lens. However, in proposed photodiode. The patterns array fabricated CIS are composed blue, red, and white pixels, as well apertures. formed by metal pattern pixels designed using layer process. focused defocused images simultaneously obtained without can be used reference to extract...

10.1109/tim.2019.2905708 article EN IEEE Transactions on Instrumentation and Measurement 2019-04-17

Wide Dynamic Range CMOS Image Sensor with Adjustable Sensitivity Using Cascode MOSFET and Inverter

OPENALEX - Publications

Donghyun Seong Jimin Lee

In this paper, a wide dynamic range complementary metal-oxide-semiconductor (CMOS) image sensor with the adjustable sensitivity by using cascode field-effect transistor (MOSFET) and inverter is proposed. The characteristics of CMOS were analyzed through experimental results. proposed active pixel consists eight transistors operated under various light intensity conditions. MOSFET as constant current source. generated from varies intensity. has high illumination owing to logarithmic response...

10.5369/jsst.2018.27.3.160 article EN Journal of Sensor Science and Technology 2018-01-01

CMOS Binary Image Sensor with Gate/Body-Tied PMOSFET-Type Photodetector for Low-Power and Low-Noise Operation

OPENALEX - Publications

Junwoo Lee Choipyung Byoung-Soo Choi Donghyun Seong Jewon Lee and 3 more

A complementary metal oxide semiconductor (CMOS) binary image sensor is proposed for low-power and low-noise operation. The has the advantages of reduced power consumption fixed pattern noise (FPN). gate/body-tied (GBT) p-channel metal-oxide-semiconductor field-effect transistor (PMOSFET)-type photodetector used as CMOS sensor. GBT PMOSFET-type a floating gate that amplifies photocurrent generated by incident light. Therefore, sensitivity higher than other photodetectors. consists pixel...

10.5369/jsst.2018.27.6.362 article EN Journal of Sensor Science and Technology 2018-01-01

Effects of aperture size on the performance of CMOS image sensor with pixel aperture for depth extraction

OPENALEX - Publications

Byoung-Soo Choi Sang‐Hwan Kim Jimin Lee Donghyun Seong Seunghyuk Chang and 3 more

Effects of aperture size on the performance CMOS image sensor with pixel for depth extraction are investigated. In general, is related to resolution and sensitivity sensor. As decreases, improved decreases. To optimize size, optical simulation using finite-difference time-domain method was implemented. The performed various sizes from 0.3 μm 1.1 power incidence angle as a function evaluated. Based results, designed fabricated 0.11 process. effects investigated by comparison measurement results.

10.1117/12.2500229 article EN 2018-09-17

Coming Soon ...