NFDI4DS | UHH-SEMS - Publication Details

A. Lee

ORCID: 0000-0001-8032-2170

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5016843118

Research Areas

Speech Recognition and Synthesis
Speech and Audio Processing
Music and Audio Processing
Blind Source Separation Techniques
Speech and dialogue systems
Adolescent and Pediatric Healthcare
Mental Health and Patient Involvement
Robotics and Automated Systems
Homelessness and Social Issues
Natural Language Processing Techniques
AI in Service Interactions
Topic Modeling
Advanced Adaptive Filtering Techniques

University Medical Center Utrecht
2024

Nara Institute of Science and Technology
2002-2006

Blind source separation based on a fast-convergence algorithm combining ICA and beamforming

OPENALEX - Publications

Hiroshi Saruwatari Tatsuyuki Kawamura Tsuyoki Nishikawa A. Lee Kiyohiro Shikano

We propose a new algorithm for blind source separation (BSS), in which independent component analysis (ICA) and beamforming are combined to resolve the slow-convergence problem through optimization ICA. The proposed method consists of following three parts: (a) frequency-domain ICA with direction-of-arrival (DOA) estimation, (b) null based on estimated DOA, (c) integration diversity both iteration frequency domain. unmixing matrix obtained by is temporally substituted iterative optimization,...

10.1109/tsa.2005.855832 article EN IEEE Transactions on Audio Speech and Language Processing 2006-02-21

Public speech-oriented guidance system with adult and child discrimination capability

OPENALEX - Publications

Ryuichi Nisimura A. Lee Hiroshi Saruwatari Kiyohiro Shikano

The Takemaru-kun system is a real world speech-oriented guidance located at the Ikoma-City North Community Center. has been operated daily from November, 2002, to provide visitors speech interface for information retrieval. This also aims field test of and collecting actual utterance data. By analyzing evaluating collected utterances, flexible processing requirements are discovered according user's age group. It becomes impossible disregard increase child users when installed in public...

10.1109/icassp.2004.1326015 article EN IEEE International Conference on Acoustics Speech and Signal Processing 2004-09-28

Gaussian mixture selection using context-independent HMM

OPENALEX - Publications

A. Lee Tatsuya Kawahara Kiyohiro Shikano

We address a method to efficiently select Gaussian mixtures for fast acoustic likelihood computation. It makes use of context-independent models selection and back-off corresponding triphone models. Specifically, the k-best phone by preliminary evaluation, higher resolution are applied, others assigned likelihoods with monophone This scheme assigns more reliable un-selected states than conventional based on VQ codebook. can also incorporate efficient pruning at which offsets increased size...

10.1109/icassp.2001.940769 article EN 2002-11-13

ASKA: receptionist robot with speech dialogue system

OPENALEX - Publications

Ryuichi Nisimura Takashi Uchida A. Lee Hiroshi Saruwatari Kiyohiro Shikano and 1 more

We implemented a humanoid robot, ASKA, in our university reception desk for the computerized guidance. ASKA can recognize user's question utterance, and answer by its text-to-speech voice, hand gesture head movement. This paper describes speech related parts of ASKA. deal with wide task domain 20k large vocabulary using word trigram model an elaborated speaker-independent acoustic model. also make response keyword key-phrase detection N-best recognition results. The rate is 90.9%,...

10.1109/irds.2002.1043936 article EN 2003-06-25

Real-time word confidence scoring using local posterior probabilities on tree trellis search

OPENALEX - Publications

A. Lee Kiyohiro Shikano Tatsuya Kawahara

Confidence scoring based on word posterior probability is usually performed as a post process of speech recognition decoding, and also needs large number hypotheses to get enough confidence quality. We propose simple way computing the using estimated while decoding. At expansion stack decoding search, local sentence likelihoods that contain heuristic scores unreached segment are directly used compute probabilities. Experimental results showed that, although not optimal, we can provide...

10.1109/icassp.2004.1326105 article EN IEEE International Conference on Acoustics Speech and Signal Processing 2004-09-28

Accurate hidden Markov models for non-audible murmur (NAM) recognition based on iterative supervised adaptation

OPENALEX - Publications

Panikos Heracleous Y. Nakajima A. Lee Hiroshi Saruwatari Kiyohiro Shikano

In previous works, we introduced a special device (Non-Audible Murmur (NATM) microphone) able to detect very quietly uttered speech (murmur), which cannot be heard by listeners near the talker. Experimental results showed efficiency of in NAM recognition. Using normal-speech monophone hidden Markov models (HMM) retrained with data from specific speaker, could recognize high accuracy. Although were promising, serious problem is HMM retraining, requires large amount training data. this paper,...

10.1109/asru.2003.1318406 article EN 2004-09-07

Noise-robust hands-free speech recognition based on spatial subtraction array and known noise superimposition

OPENALEX - Publications

Y. Ohashi Tsuyoki Nishikawa Hiroshi Saruwatari A. Lee Kiyohiro Shikano

We propose a spatial subtraction array (SSA) and known noise superimposition to achieve noise-robust hands-free speech recognition which can be used in human-robot interaction. In the proposed SSA, reduction is achieved by subtracting estimated power spectrum from target enhanced mel-scale filter bank domain. This offers realization of error-robust spectral with few computational complexities. addition, we introduce technique domain, utilize matched acoustic model for noise. compensate...

10.1109/iros.2005.1545036 article EN 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems 2005-01-01

Listening to the Voice of the Patient in RSV Research

OPENALEX - Publications

A. Lee Rachael Thomas Bowen Chung Louis Bont

Patient and public involvement in research refers to patients or caregivers with disease experience contributing the design, conduct dissemination of results from research. has given rise new fields healthcare-oriented potential transform infectious diseases through interventional trials. Our recommendations best practices years organizing respiratory syncytial virus parent networks are provided.

10.1097/inf.0000000000004512 article EN The Pediatric Infectious Disease Journal 2024-09-04

Coming Soon ...