A. Lee

ORCID: 0000-0001-8032-2170
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Speech Recognition and Synthesis
  • Speech and Audio Processing
  • Music and Audio Processing
  • Blind Source Separation Techniques
  • Speech and dialogue systems
  • Adolescent and Pediatric Healthcare
  • Mental Health and Patient Involvement
  • Robotics and Automated Systems
  • Homelessness and Social Issues
  • Natural Language Processing Techniques
  • AI in Service Interactions
  • Topic Modeling
  • Advanced Adaptive Filtering Techniques

University Medical Center Utrecht
2024

Nara Institute of Science and Technology
2002-2006

We propose a new algorithm for blind source separation (BSS), in which independent component analysis (ICA) and beamforming are combined to resolve the slow-convergence problem through optimization ICA. The proposed method consists of following three parts: (a) frequency-domain ICA with direction-of-arrival (DOA) estimation, (b) null based on estimated DOA, (c) integration diversity both iteration frequency domain. unmixing matrix obtained by is temporally substituted iterative optimization,...

10.1109/tsa.2005.855832 article EN IEEE Transactions on Audio Speech and Language Processing 2006-02-21

The Takemaru-kun system is a real world speech-oriented guidance located at the Ikoma-City North Community Center. has been operated daily from November, 2002, to provide visitors speech interface for information retrieval. This also aims field test of and collecting actual utterance data. By analyzing evaluating collected utterances, flexible processing requirements are discovered according user's age group. It becomes impossible disregard increase child users when installed in public...

10.1109/icassp.2004.1326015 article EN IEEE International Conference on Acoustics Speech and Signal Processing 2004-09-28

We address a method to efficiently select Gaussian mixtures for fast acoustic likelihood computation. It makes use of context-independent models selection and back-off corresponding triphone models. Specifically, the k-best phone by preliminary evaluation, higher resolution are applied, others assigned likelihoods with monophone This scheme assigns more reliable un-selected states than conventional based on VQ codebook. can also incorporate efficient pruning at which offsets increased size...

10.1109/icassp.2001.940769 article EN 2002-11-13

We implemented a humanoid robot, ASKA, in our university reception desk for the computerized guidance. ASKA can recognize user's question utterance, and answer by its text-to-speech voice, hand gesture head movement. This paper describes speech related parts of ASKA. deal with wide task domain 20k large vocabulary using word trigram model an elaborated speaker-independent acoustic model. also make response keyword key-phrase detection N-best recognition results. The rate is 90.9%,...

10.1109/irds.2002.1043936 article EN 2003-06-25

Confidence scoring based on word posterior probability is usually performed as a post process of speech recognition decoding, and also needs large number hypotheses to get enough confidence quality. We propose simple way computing the using estimated while decoding. At expansion stack decoding search, local sentence likelihoods that contain heuristic scores unreached segment are directly used compute probabilities. Experimental results showed that, although not optimal, we can provide...

10.1109/icassp.2004.1326105 article EN IEEE International Conference on Acoustics Speech and Signal Processing 2004-09-28

In previous works, we introduced a special device (Non-Audible Murmur (NATM) microphone) able to detect very quietly uttered speech (murmur), which cannot be heard by listeners near the talker. Experimental results showed efficiency of in NAM recognition. Using normal-speech monophone hidden Markov models (HMM) retrained with data from specific speaker, could recognize high accuracy. Although were promising, serious problem is HMM retraining, requires large amount training data. this paper,...

10.1109/asru.2003.1318406 article EN 2004-09-07

We propose a spatial subtraction array (SSA) and known noise superimposition to achieve noise-robust hands-free speech recognition which can be used in human-robot interaction. In the proposed SSA, reduction is achieved by subtracting estimated power spectrum from target enhanced mel-scale filter bank domain. This offers realization of error-robust spectral with few computational complexities. addition, we introduce technique domain, utilize matched acoustic model for noise. compensate...

10.1109/iros.2005.1545036 article EN 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems 2005-01-01

Patient and public involvement in research refers to patients or caregivers with disease experience contributing the design, conduct dissemination of results from research. has given rise new fields healthcare-oriented potential transform infectious diseases through interventional trials. Our recommendations best practices years organizing respiratory syncytial virus parent networks are provided.

10.1097/inf.0000000000004512 article EN The Pediatric Infectious Disease Journal 2024-09-04
Coming Soon ...