Donghyun Seong

ORCID: 0000-0002-0474-6964
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • CCD and CMOS Imaging Sensors
  • Image Processing Techniques and Applications
  • Natural Language Processing Techniques
  • Analytical Chemistry and Sensors
  • Advanced Optical Sensing Technologies
  • Advanced Memory and Neural Computing
  • Infrared Target Detection Methodologies
  • Optical Coherence Tomography Applications
  • Speech and Audio Processing
  • Speech Recognition and Synthesis
  • Digital Holography and Microscopy
  • Thin-Film Transistor Technologies
  • Gas Sensing Nanomaterials and Sensors
  • Music and Audio Processing
  • Optical measurement and interference techniques
  • Topic Modeling
  • Text and Document Classification Technologies

Hanyang University
2024

Kyungpook National University
2018-2019

Score-based generative models have shown the real-like quality of synthesized speech in text-to-speech (TTS) area. However, critical artifact score-based is requirement a high computational cost due to iterative sampling algorithm, and it also makes difficult fine-tune TTS-optimized vocoder. In this study, we propose method joint training TTS model HiFi-GAN using compressed log-mel features, guarantees significant even on non-iterative sampling. As result, proposed overcomes some digital...

10.1109/icassp48485.2024.10446958 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024-03-18

This paper presents a CMOS image sensor with the in-pixel aperture technique for single-chip 2-D and 3-D imaging. In conventional sensors, is located at camera lens. However, in proposed sensor, integrated on chip formed metal layer of (CIS) process. A pixel array composed W, R, B PA pixels (W aperture) extracting color depth information. While W becomes blurred increasing distance from focused object, maintains sharpness. Therefore, can be obtained using defocus method. The size pixel,...

10.1109/jsen.2018.2869383 article EN IEEE Sensors Journal 2018-09-21

In this paper, complementary metal oxide semiconductor (CMOS) image sensors that can extract depth information using the pixel aperture technique is presented. The array of proposed sensor composed blue, red, and white pixels, as well apertures. apertures are formed by pattern in pixels. focused defocused images obtained, simultaneously, sensor, used for calculating information. was designed fabricated 0.11-μm CMOS process, its performance evaluated.

10.1109/i2mtc.2018.8409654 article EN 2022 IEEE International Instrumentation and Measurement Technology Conference (I2MTC) 2018-05-01

Expressive text-to-speech (TTS) aims to synthesize better human-like speech by incorporating diverse styles or emotions. While most expressive TTS models rely on reference condition the style of generated speech, they often fail generate regular quality. To ensure consistent quality, we propose an conditioned representation extracted from text itself. implement this text-based predictor, design a module residual vector quantization. Furthermore, is enhanced through style-to-text alignment...

10.21437/interspeech.2024-1734 article EN Interspeech 2022 2024-09-01

Conversational text-to-speech (TTS) aims to synthesize natural voices appropriate a situation by considering the context of past conversations as well current text. However, analyzing and modeling conversation remains challenging. Most conversational TTS use content historical recent without distinguishing between them often generate speech that does not fit situation. Hence, we introduce novel TTS, H4C-TTS, leverages multi-modal realize contextually synthesis. To facilitate modeling, design...

10.21437/interspeech.2024-1480 article EN Interspeech 2022 2024-09-01

This paper presents the effects of aperture diameter on image blur complementary metal-oxide-semiconductor (CMOS) sensor with pixel apertures for depth extraction. In a conventional camera system, is located at lens. However, in proposed photodiode. The patterns array fabricated CIS are composed blue, red, and white pixels, as well apertures. formed by metal pattern pixels designed using layer process. focused defocused images simultaneously obtained without can be used reference to extract...

10.1109/tim.2019.2905708 article EN IEEE Transactions on Instrumentation and Measurement 2019-04-17

In this paper, a wide dynamic range complementary metal-oxide-semiconductor (CMOS) image sensor with the adjustable sensitivity by using cascode field-effect transistor (MOSFET) and inverter is proposed. The characteristics of CMOS were analyzed through experimental results. proposed active pixel consists eight transistors operated under various light intensity conditions. MOSFET as constant current source. generated from varies intensity. has high illumination owing to logarithmic response...

10.5369/jsst.2018.27.3.160 article EN Journal of Sensor Science and Technology 2018-01-01

A complementary metal oxide semiconductor (CMOS) binary image sensor is proposed for low-power and low-noise operation. The has the advantages of reduced power consumption fixed pattern noise (FPN). gate/body-tied (GBT) p-channel metal-oxide-semiconductor field-effect transistor (PMOSFET)-type photodetector used as CMOS sensor. GBT PMOSFET-type a floating gate that amplifies photocurrent generated by incident light. Therefore, sensitivity higher than other photodetectors. consists pixel...

10.5369/jsst.2018.27.6.362 article EN Journal of Sensor Science and Technology 2018-01-01

Effects of aperture size on the performance CMOS image sensor with pixel for depth extraction are investigated. In general, is related to resolution and sensitivity sensor. As decreases, improved decreases. To optimize size, optical simulation using finite-difference time-domain method was implemented. The performed various sizes from 0.3 μm 1.1 power incidence angle as a function evaluated. Based results, designed fabricated 0.11 process. effects investigated by comparison measurement results.

10.1117/12.2500229 article EN 2018-09-17
Coming Soon ...