NFDI4DS | UHH-SEMS - Publication Details

Amparo Varona

ORCID: 0000-0003-0255-8991

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5057476491

Research Areas

Speech Recognition and Synthesis
Music and Audio Processing
Speech and Audio Processing
Natural Language Processing Techniques
Speech and dialogue systems
Advanced Data Compression Techniques
Neuropeptides and Animal Physiology
Phonetics and Phonology Research
Machine Learning and Algorithms
Subtitles and Audiovisual Media
Topic Modeling
Translation Studies and Practices
Receptor Mechanisms and Signaling
Peptidase Inhibition and Analysis
Algorithms and Data Compression
Radio, Podcasts, and Digital Media
Experimental Learning in Engineering
Language, Linguistics, Cultural Analysis
Interpreting and Communication in Healthcare
Engineering Education and Technology
Neuroscience of respiration and sleep
Data Mining Algorithms and Applications
Protein Hydrolysis and Bioactive Peptides
semigroups and automata theory
Adenosine and Purinergic Signaling

Hospital Riotinto
2025

University of the Basque Country
2009-2024

Software (Spain)
2015

Universidad Politécnica de Madrid
2013

Universitat Politècnica de València
1995

High-performance Query-by-Example Spoken Term Detection on the SWS 2013 evaluation

OPENALEX - Publications

Luis Javier Rodríguez-Fuentes Amparo Varona Mikel Peñagarikano Germán Bordel Mireia Díez

In the last years, task of Query-by-Example Spoken Term Detection (QbE-STD), which aims to find occurrences a spoken query in set audio documents, has gained interest research community for its versatility settings where untranscribed, multilingual and acoustically unconstrained resources, or resources low-resource languages, must be searched. This paper describes reports experimental results QbE-STD system that achieved best performance recent Web Search (SWS) evaluation, held as part...

10.1109/icassp.2014.6855122 article EN 2014-05-01

The 2013 speaker recognition evaluation in mobile environment

OPENALEX - Publications

E. Khoury Boštjan Vesnicer Javier Franco-Pedroso Ricardo P. V. Violato Z. Boulkcnafet and 33 more

This paper evaluates the performance of twelve primary systems submitted to evaluation on speaker verification in context a mobile environment using MOBIO database. The provides challenging and realistic test-bed for current state-of-the-art techniques. Results terms equal error rate (EER), half total (HTER) detection trade-off (DET) confirm that best performing are based variability modeling, fusion several sub-systems. Nevertheless, good old UBM-GMM still competitive. results also show use...

10.1109/icb.2013.6613025 preprint EN 2013-06-01

Neonatal Screening for Spinal Muscular Atrophy and Severe T- and B-Cell Lymphopenias in Andalusia: A Prospective Study

OPENALEX - Publications

Beatriz de Felipe Carmen Delgado‐Pecellín Mercedes López-Lobato Peter Olbrich Pilar Blanco Lobo and 12 more

Spinal muscular atrophy (SMA) and severe T- and/or B-cell lymphopenias (STBCL) in the form of combined immunodeficiencies (SCID) or X-linked agammaglobulinemia (XLA) are rare but potentially fatal pathologies. In January 2021, we initiated first pilot study Spain to evaluate efficacy a very early detection technique for SMA SCID. RT–PCR was performed on prospectively collected dried blood spots (DBSs) from newborns Western Andalusia (Spain). Internal external controls (SCID, XLA SMA) were...

10.3390/ijns11010011 article EN cc-by International Journal of Neonatal Screening 2025-01-30

On the use of phone log-likelihood ratios as features in spoken language recognition

OPENALEX - Publications

Mireia Díez Amparo Varona Mikel Peñagarikano Luis Javier Rodríguez-Fuentes Germán Bordel

This paper presents an alternative feature set to the traditional MFCC-SDC used in acoustic approaches Spoken Language Recognition: log-likelihood ratios of phone posterior probabilities, hereafter Phone Log-Likelihood Ratios (PLLR), produced by a recognizer. In this work, iVector system trained on features (plus dynamic coefficients) is evaluated and compared (1) (trained set) (2) phonotactic (Phone-lattice-SVM) system, using two different benchmarks: NIST 2007 2009 LRE datasets. systems...

10.1109/slt.2012.6424235 article EN 2022 IEEE Spoken Language Technology Workshop (SLT) 2012-12-01

On the calibration and fusion of heterogeneous spoken term detection systems

OPENALEX - Publications

Alberto Abad Luis Javier Rodríguez-Fuentes Mikel Peñagarikano Amparo Varona Germán Bordel

The combination of several heterogeneous systems is known to provide remarkable performance improvements in verification and detection tasks. In Spoken Term Detection (STD), two important issues arise: (1) how define a common set detected candidates, (2) combine system scores produce single score per candidate. this paper, discriminative calibration/fusion approach commonly applied speaker language recognition adopted for STD. Under approach, we first propose heuristics hypothesize that do...

10.21437/interspeech.2013-5 article EN Interspeech 2022 2013-08-25

The Albayzin 2010 language recognition evaluation

OPENALEX - Publications

Luis Javier Rodríguez-Fuentes Mikel Peñagarikano Amparo Varona Mireia Díez Germán Bordel

The Albayzin 2008 Language Recognition Evaluation was held from May to October 2008, and their results presented discussed among the participating teams at 5th Biennial Workshop on Speech Technology [1], organized by Spanish Network Technologies [2] in November 2008.In this paper, we present (for first time) a full description of LRE analyze discuss recognition results.The evaluation designed according test procedures, protocols performance measures used NIST 2007 LRE.The KALAKA database...

10.21437/interspeech.2011-322 article EN Interspeech 2022 2011-08-27

A simple and efficient method to align very long speech signals to acoustically imperfect transcriptions

OPENALEX - Publications

Germán Bordel Mikel Peñagarikano Luis Javier Rodríguez-Fuentes Amparo Varona

In the framework of a contract with Basque Parliament for subtitling videos bilingual plenary sessions, which basically consisted aligning very long (around 3 hours long) audio tracks syntactically correct but acoustically inaccurate text transcriptions (since all disfluencies, mistakes, etc. were edited), simple and efficient procedure (avoiding need language nor lexical models, was key because mix languages) developed as first approach, before trying more complex schemes found in...

10.21437/interspeech.2012-402 article EN Interspeech 2022 2012-09-09

k-TSS language models in speech recognition systems

OPENALEX - Publications

M. Inés Torres Amparo Varona

10.1006/csla.2001.0162 article EN Computer Speech & Language 2001-04-01

Study of different backends in a state-of-the-art language recognition system

OPENALEX - Publications

Mikel Peñagarikano Amparo Varona Mireia Díez Luis Javier Rodríguez-Fuentes Germán Bordel

State of the art language recognition systems usually add a backend prior to linear fusion subsystems scores. The plays dual role. When set languages for which models have been trained does not match target languages, maps available scores space languages. On other hand, serves as precalibration stage that adapts In this work, well known backends (Generative Gaussian Backend, Discriminative Backend and Logistic Regression Backend) newer proposals (Fully Bayesian Mixture are analyzed...

10.21437/interspeech.2012-547 article EN Interspeech 2022 2012-09-09

The albayzin 2012 language recognition evaluation

OPENALEX - Publications

Luis Javier Rodríguez-Fuentes Niko Brümmer Mikel Peñagarikano Amparo Varona Germán Bordel and 1 more

The Albayzin 2012 Language Recognition Evaluation (LRE), carried out from June to October 2012, was the third effort made by Spanish/Portuguese community for benchmarking lan- guage recognition technology. As in previous 2008 and 2010 evaluations, task consisted on deciding whether or not a target language spoken test utterance. pri- mary condition involved 6 languages which there plenty of training data: English, Portuguese four offi- cial Spain (Basque, Catalan, Galician Span- ish). A new...

10.21437/interspeech.2013-387 article EN Interspeech 2022 2013-08-25

Dimensionality reduction of phone log-likelihood ratio features for spoken language recognition

OPENALEX - Publications

Mireia Díez Amparo Varona Mikel Peñagarikano Luis Javier Rodríguez-Fuentes Germán Bordel

In a previous work, we introduced the use of log-likelihood ratios phone posterior probabilities, called Phone LogLikelihood Ratios (PLLR) as features for language recognition under an iVector-based approach, yielding high performance and promising results. However, dimensionality PLLR feature vectors (with regard to MFCC/SDC features) results in comparatively higher computational costs. this several supervised unsupervised reduction techniques are studied, based on either fusions or...

10.21437/interspeech.2013-39 article EN Interspeech 2022 2013-08-25

On the Projection of PLLRs for Unbounded Feature Distributions in Spoken Language Recognition

OPENALEX - Publications

Mireia Díez Amparo Varona Mikel Peñagarikano Luis Javier Rodríguez-Fuentes Germán Bordel

The so called Phone Log-Likelihood Ratio (PLLR) features have been recently introduced as a novel and effective way of retrieving acoustic-phonetic information in spoken language speaker recognition systems. In this letter, an in-depth insight into the PLLR feature space is provided multidimensional distribution these analyzed system. study reveals that are confined subspace strongly bounds distributions. To enhance retrieved by system, projected hyper-plane provides more suitable...

10.1109/lsp.2014.2324819 article EN IEEE Signal Processing Letters 2014-05-16

Query-by-Example Spoken Term Detection ALBAYZIN 2012 evaluation: overview, systems, results, and discussion

OPENALEX - Publications

Javier Tejedor Doroteo T. Toledano Xavier Anguera Amparo Varona Lluís F. Hurtado and 2 more

Query-by-Example Spoken Term Detection (QbE STD) aims at retrieving data from a speech repository given an acoustic query containing the term of interest as input. Nowadays, it has been receiving much due to high volume information stored in audio or audiovisual format. QbE STD differs automatic recognition (ASR) and keyword spotting (KWS)/spoken detection (STD) since ASR is interested all terms/words that appear signal KWS/STD relies on textual transcription search retrieve data. This paper...

10.1186/1687-4722-2013-23 article EN cc-by EURASIP Journal on Audio Speech and Music Processing 2013-09-17

KALAKA-3: a database for the assessment of spoken language recognition technology on YouTube audios

OPENALEX - Publications

Luis Javier Rodríguez-Fuentes Mikel Peñagarikano Amparo Varona Mireia Díez Germán Bordel

10.1007/s10579-015-9324-5 article EN Language Resources and Evaluation 2015-12-19

Probabilistic Kernels for Improved Text-to-Speech Alignment in Long Audio Tracks

OPENALEX - Publications

Germán Bordel Mikel Peñagarikano Luis Javier Rodríguez-Fuentes Aitor Álvarez Amparo Varona

The synchronization of text transcripts with audio tracks is typically solved by forced alignment at the phonetic level. However, when dealing either very long or acoustically inaccurate transcripts, more complex methods are needed, usually based on heavy and costly ASR systems. In a previous work, we showed that simple lightweight method could be effectively applied, free decoding speech signal reference sequences, allowing transfer timestamps from former to latter. This has yielded...

10.1109/lsp.2015.2505140 article EN IEEE Signal Processing Letters 2015-12-03

Improved Modeling of Cross-Decoder Phone Co-Occurrences in SVM-Based Phonotactic Language Recognition

OPENALEX - Publications

Mikel Peñagarikano Amparo Varona Luis Javier Rodríguez-Fuentes Germán Bordel

Most common approaches to phonotactic language recognition deal with several independent phone decodings. These decodings are processed and scored in a fully uncoupled way, their time alignment (and the information that may be extracted from it) being completely lost. Recently, we have presented two new which take into account information, by considering time-synchronous cross-decoder co-occurrences. Experiments on 2007 NIST LRE database demonstrated using co-occurrence statistics could...

10.1109/tasl.2011.2134088 article EN IEEE Transactions on Audio Speech and Language Processing 2011-04-06

A Bilingual Basque–Spanish Dataset of Parliamentary Sessions for the Development and Evaluation of Speech Technology

OPENALEX - Publications

Amparo Varona Mikel Peñagarikano Germán Bordel Luis Javier Rodríguez-Fuentes

The development of speech technology requires large amounts data to estimate the underlying models. Even when relying on multilingual pre-trained models, some amount task-specific target language is needed fine-tune those models and obtain competitive performance. In this paper, we present a bilingual Basque–Spanish dataset extracted from parliamentary sessions. designed develop evaluate automatic recognition (ASR) systems but can be easily repurposed for other speech-processing tasks (such...

10.3390/app14051951 article EN cc-by Applied Sciences 2024-02-27

Using phone log-likelihood ratios as features for speaker recognition

OPENALEX - Publications

Mireia Díez Amparo Varona Mikel Peñagarikano Luis Javier Rodríguez-Fuentes Germán Bordel

The so called Phone Log-Likelihood Ratio (PLLR) features, computed on phone posterior probabilities provided by phonetic decoders, convey acoustic-phonetic information in a sequence of frame-level vectors. Thus, PLLRs can be easily plugged into traditional acoustic systems just replacing MFCCs, PLPs or whatever other representation. PLLR features were used under an iVector-PLDA approach our submission to the NIST 2012 Speaker Recognition Evaluation (SRE). In this work, we present report...

10.21437/interspeech.2013-419 article EN Interspeech 2022 2013-08-25

Distribution of peptidase activity in teleost and rat tissues

OPENALEX - Publications

Naiara Agirregoitia Raúl Laiz‐Carrión Amparo Varona M.P. Martı́n del Rı́o Juan Miguel Mancera and 1 more

10.1007/s00360-005-0011-5 article EN Journal of Comparative Physiology B 2005-07-25

Automatic subtitling of the basque parliament plenary sessions videos

OPENALEX - Publications

Germán Bordel Silvia Nieto Mikel Peñagarikano Luis Javier Rodríguez-Fuentes Amparo Varona

Subtitling of video contents offered in the web by Spanish administration agencies is required law for allowing people with hearing impairments to follow them.The automatic bilingual subtitling system described this paper has been applied on plenary sessions videos that Basque Parliament posts its (http://www.parlamentovasco.euskolegebiltzarra.org/), and running from September 2010.A specific characteristic use a simple phonetic decoder based joint selection phone models, since it not...

10.21437/interspeech.2011-483 article EN Interspeech 2022 2011-08-27

Multi-site heterogeneous system fusions for the Albayzin 2010 Language Recognition Evaluation

OPENALEX - Publications

Luis Javier Rodríguez-Fuentes Mikel Peñagarikano Amparo Varona Mireia Díez Germán Bordel and 16 more

Best language recognition performance is commonly obtained by fusing the scores of several heterogeneous systems. Regardless fusion approach, it assumed that different systems may contribute complementary information, either because they are developed on datasets, or use features modeling approaches. Most authors apply as a final resource for improving based an existing set Though relative gains decrease larger sets considered, best usually attained all available systems, which lead to high...

10.1109/asru.2011.6163961 article EN 2011-12-01

Coming Soon ...