Carlos Vaquero

ORCID: 0000-0002-4110-7504
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Speech Recognition and Synthesis
  • Speech and Audio Processing
  • Speech and dialogue systems
  • Music and Audio Processing
  • Phonetics and Phonology Research
  • Natural Language Processing Techniques
  • Voice and Speech Disorders
  • Linguistic Studies and Language Acquisition
  • User Authentication and Security Systems
  • Advanced Clustering Algorithms Research
  • Health and Medical Education
  • Language Development and Disorders

Universidad de Zaragoza
2007-2012

de niveau recherche, publiés ou non, émanant des établissements d'enseignement et recherche français étrangers, laboratoires publics privés.

10.21437/interspeech.2015-95 article FR Interspeech 2022 2015-09-06

In this paper, we present a framework for unsupervised domain adaptation of PLDA based i-vector speaker recognition systems.Given an existing out-of-domain system, use it to cluster unlabeled in-domain data, and then data adapt the parameters system.We explore two versions agglomerative hierarchical clustering that also study automatic ways determine number clusters in dataset.The proposed techniques are experimentally validated recently introduced challenge.This challenge provides very...

10.21437/odyssey.2014-39 article EN 2014-06-16

This paper addresses the problem that disabled people face when accessing new systems and technologies are available nowadays. The use of speech technologies, specially helpful for motor handicapped people, becomes unapproachable these also suffer impairments, making gap in society wider them. As a way to include impaired technological today, two lines work have been carried out. On one hand, computer-aided therapy software has developed training children with different disabilities. tool,...

10.1109/icassp.2008.4518658 article EN Proceedings of the ... IEEE International Conference on Acoustics, Speech, and Signal Processing 2008-03-01

This work presents the results in analysis of acoustic features (formants and three suprasegmental features: tone, intensity duration) vowel production a group 14 young speakers suffering different kinds speech impairments due to physical cognitive disorders. A corpus with unimpaired children's is used determine reference values for these without any kind impairment within same domain impaired speakers; this 57 isolated words. The signal processing extract formant pitch based on Linear...

10.1155/2009/159234 article EN cc-by EURASIP Journal on Advances in Signal Processing 2009-05-26

This article presents a low-latency speaker diarization system (“who is speaking now?”) based on hybrid approach that combines traditional offline spoke when?”) with an online identification system. The fulfills all requirements of the task, i.e. it does not need any a-priori information about input, including no specific models. After initialization phase allows decision current accuracy close to underlying describes approach, evaluates robustness system, and analyzes latency/accuracy...

10.21437/interspeech.2010-700 article EN Interspeech 2022 2010-09-26

This paper addresses the problem of speaker segmentation in two-speaker telephone conversations, using an eigenvoice based factor analysis approach. We present a set improvements system. First, we study two methods to compensate for intra-session variability, that is variability during single session. Secondly propose method generate hypotheses combined with given confidence measure, enables selection correct improving overall performance. The proposed are evaluated on NIST Speaker...

10.1109/icassp.2011.5947362 article EN 2011-05-01

There are many applications related to speaker characterization, specially in telephone environments, where large datasets available but not directly useful since there two speakers involved every recording. Even with very accurate diarization systems, we can expect find some recordings low accuracy. The use of these may reduce the accuracy any characterization technology. Therefore, it is highly desirable detect those correctly segmented, order discard or process manually remaining ones...

10.1109/tasl.2012.2236317 article EN IEEE Transactions on Audio Speech and Language Processing 2012-12-24

This paper presents the results of an experience with "Vocal-izaL2", application for Second Language (L2) learning Spanish, in a multilingual environment at Vienna International School (VIS).For experiment, group 6th-graders school practiced during 5 sessions altogether their regular classes.The experiment show on one hand, great motivation power that computer-based L2 tools have pronunciation training young learners, while also resulting useful teachers.On technical aspect, tool and...

10.21437/slate.2009-20 article EN 2009-09-03
Coming Soon ...