NFDI4DS | UHH-SEMS - Publication Details

Carlos Vaquero

ORCID: 0000-0002-4110-7504

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5011739249

Research Areas

Speech Recognition and Synthesis
Speech and Audio Processing
Speech and dialogue systems
Music and Audio Processing
Phonetics and Phonology Research
Natural Language Processing Techniques
Voice and Speech Disorders
Linguistic Studies and Language Acquisition
User Authentication and Security Systems
Advanced Clustering Algorithms Research
Health and Medical Education
Language Development and Disorders

Universidad de Zaragoza
2007-2012

The reddots data collection for speaker recognition

OPENALEX - Publications

Kong Aik Lee Anthony Larcher Guangsen Wang Patrick Kenny Niko Brümmer and 10 more

de niveau recherche, publiés ou non, émanant des établissements d'enseignement et recherche français étrangers, laboratoires publics privés.

10.21437/interspeech.2015-95 article FR Interspeech 2022 2015-09-06

Tools and Technologies for Computer-Aided Speech and Language Therapy

OPENALEX - Publications

Óscar Saz Shou-Chun Yin Eduardo Lleida Richard C. Rose Carlos Vaquero and 1 more

10.1016/j.specom.2009.04.006 article EN Speech Communication 2009-04-23

Unsupervised Domain Adaptation for I-Vector Speaker Recognition

OPENALEX - Publications

Niko Brümmer Alan McCree Stephen Shum Daniel Garcia-Romero Carlos Vaquero

In this paper, we present a framework for unsupervised domain adaptation of PLDA based i-vector speaker recognition systems.Given an existing out-of-domain system, use it to cluster unlabeled in-domain data, and then data adapt the parameters system.We explore two versions agglomerative hierarchical clustering that also study automatic ways determine number clusters in dataset.The proposed techniques are experimentally validated recently introduced challenge.This challenge provides very...

10.21437/odyssey.2014-39 article EN 2014-06-16

E-inclusion technologies for the speech handicapped

OPENALEX - Publications

Carlos Vaquero Óscar Saz Eduardo Lleida William Ricardo Rodríguez Dueñas

This paper addresses the problem that disabled people face when accessing new systems and technologies are available nowadays. The use of speech technologies, specially helpful for motor handicapped people, becomes unapproachable these also suffer impairments, making gap in society wider them. As a way to include impaired technological today, two lines work have been carried out. On one hand, computer-aided therapy software has developed training children with different disabilities. tool,...

10.1109/icassp.2008.4518658 article EN Proceedings of the ... IEEE International Conference on Acoustics, Speech, and Signal Processing 2008-03-01

Factor analysis with sampling methods for text dependent speaker recognition

OPENALEX - Publications

Antonio Miguel Jesús Villalba Alfonso Ortega Eduardo Lleida Carlos Vaquero

10.21437/interspeech.2014-332 article EN Interspeech 2022 2014-09-14

Analysis of Acoustic Features in Speakers with Cognitive Disorders and Speech Impairments

OPENALEX - Publications

Óscar Saz Javier Simón William Ricardo Rodríguez Dueñas Eduardo Lleida Carlos Vaquero

This work presents the results in analysis of acoustic features (formants and three suprasegmental features: tone, intensity duration) vowel production a group 14 young speakers suffering different kinds speech impairments due to physical cognitive disorders. A corpus with unimpaired children's is used determine reference values for these without any kind impairment within same domain impaired speakers; this 57 isolated words. The signal processing extract formant pitch based on Linear...

10.1155/2009/159234 article EN cc-by EURASIP Journal on Advances in Signal Processing 2009-05-26

A hybrid approach to online speaker diarization

OPENALEX - Publications

Carlos Vaquero Oriol Vinyals Gerald Friedland

This article presents a low-latency speaker diarization system (“who is speaking now?”) based on hybrid approach that combines traditional offline spoke when?”) with an online identification system. The fulfills all requirements of the task, i.e. it does not need any a-priori information about input, including no specific models. After initialization phase allows decision current accuracy close to underlying describes approach, evaluates robustness system, and analyzes latency/accuracy...

10.21437/interspeech.2010-700 article EN Interspeech 2022 2010-09-26

Intra-session variability compensation and a hypothesis generation and selection strategy for speaker segmentation

OPENALEX - Publications

Carlos Vaquero Alfonso Ortega Eduardo Lleida

This paper addresses the problem of speaker segmentation in two-speaker telephone conversations, using an eigenvoice based factor analysis approach. We present a set improvements system. First, we study two methods to compensate for intra-session variability, that is variability during single session. Secondly propose method generate hypotheses combined with given confidence measure, enables selection correct improving overall performance. The proposed are evaluated on NIST Speaker...

10.1109/icassp.2011.5947362 article EN 2011-05-01

Partitioning of two-speaker conversation datasets

OPENALEX - Publications

Carlos Vaquero Alfonso Ortega Eduardo Lleida

10.21437/interspeech.2011-136 article EN Interspeech 2022 2011-08-27

Confidence measures for speaker segmentation and their relation to speaker verification

OPENALEX - Publications

Carlos Vaquero Alfonso Ortega Jesús Villalba Antonio Miguel Eduardo Lleida

10.21437/interspeech.2010-633 article EN Interspeech 2022 2010-09-26

Quality Assessment for Speaker Diarization and Its Application in Speaker Characterization

OPENALEX - Publications

Carlos Vaquero Alfonso Ortega Antonio Miguel Eduardo Lleida

There are many applications related to speaker characterization, specially in telephone environments, where large datasets available but not directly useful since there two speakers involved every recording. Even with very accurate diarization systems, we can expect find some recordings low accuracy. The use of these may reduce the accuracy any characterization technology. Therefore, it is highly desirable detect those correctly segmented, order discard or process manually remaining ones...

10.1109/tasl.2012.2236317 article EN IEEE Transactions on Audio Speech and Language Processing 2012-12-24

Hierarchical audio segmentation with HMM and factor analysis in broadcast news domain

OPENALEX - Publications

Diego Castán Carlos Vaquero Alfonso Ortega David Martínez Jesús Villalba and 1 more

10.21437/interspeech.2011-165 article EN Interspeech 2022 2011-08-27

Analysis of the Impact of the Audio Database Characteristics in the Accuracy of a Speaker Clustering System

OPENALEX - Publications

Jesús Jorrín Prieto Carlos Vaquero Leibny Paola Garcia

10.21437/odyssey.2016-57 article EN 2016-06-21

On the need of template protection for voice authentication

OPENALEX - Publications

Carlos Vaquero P. Rodriguez

10.21437/interspeech.2015-88 article EN Interspeech 2022 2015-09-06

An experience with a Spanish second language learning tool in a multilingual environment

OPENALEX - Publications

Óscar Saz Victoria Rodríguez Eduardo Lleida William Ricardo Rodríguez Dueñas Carlos Vaquero

This paper presents the results of an experience with "Vocal-izaL2", application for Second Language (L2) learning Spanish, in a multilingual environment at Vienna International School (VIS).For experiment, group 6th-graders school practiced during 5 sessions altogether their regular classes.The experiment show on one hand, great motivation power that computer-based L2 tools have pronunciation training young learners, while also resulting useful teachers.On technical aspect, tool and...

10.21437/slate.2009-20 article EN 2009-09-03

Coming Soon ...