NFDI4DS | UHH-SEMS - Publication Details

Eva Navas

ORCID: 0000-0003-3804-4984

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5085186316

Research Areas

Speech Recognition and Synthesis
Speech and Audio Processing
Natural Language Processing Techniques
Music and Audio Processing
Phonetics and Phonology Research
Speech and dialogue systems
Basque language and culture studies
Spanish Linguistics and Language Studies
Voice and Speech Disorders
Emotion and Mood Recognition
Linguistic Studies and Language Acquisition
Advanced Data Compression Techniques
Subtitles and Audiovisual Media
Semantic Web and Ontologies
Blind Source Separation Techniques
Music Technology and Sound Studies
Journalism and Media Studies
Digital Filter Design and Implementation
Infant Health and Development
Radio, Podcasts, and Digital Media
Video Analysis and Summarization
Social Sciences and Policies
Tracheal and airway disorders
Topic Modeling
Advanced Adaptive Filtering Techniques

University of the Basque Country
2015-2025

Basque Center for Applied Mathematics
2023-2025

Iberdrola (Spain)
2009

Ente Vasco de la Energía
2004

Feature Analysis and Evaluation for Automatic Emotion Identification in Speech

OPENALEX - Publications

Iker Luengo Eva Navas Inma Hernáez

The definition of parameters is a crucial step in the development system for identifying emotions speech. Although there no agreement on which are best features this task, it generally accepted that prosody carries most emotional information. Most works field use some kind prosodic features, often combination with spectral and voice quality parametrizations. Nevertheless, systematic study has been done comparing these features. This paper presents analysis characteristics derived from...

10.1109/tmm.2010.2051872 article EN IEEE Transactions on Multimedia 2010-09-15

Harmonics Plus Noise Model Based Vocoder for Statistical Parametric Speech Synthesis

OPENALEX - Publications

Daniel Erro Iñaki Sainz Eva Navas Inma Hernáez

This article explores the potential of harmonics plus noise model speech in development a high-quality vocoder applicable statistical frameworks, particularly modern synthesizers. It presents an extensive explanation all different alternatives considered during design HNM-based vocoder, together with corresponding objective and subjective experiments, careful description its implementation details. Three aspects analysis have been investigated: refinement pitch estimation using...

10.1109/jstsp.2013.2283471 article EN IEEE Journal of Selected Topics in Signal Processing 2013-09-25

Automatic emotion recognition using prosodic parameters

OPENALEX - Publications

Iker Luengo Eva Navas Inma Hernáez Jon Sánchez

10.21437/interspeech.2005-324 article IT Interspeech 2022 2005-09-04

Toward a Universal Synthetic Speech Spoofing Detection Using Phase Information

OPENALEX - Publications

Jon Sánchez Ibon Saratxaga Inma Hernáez Eva Navas Daniel Erro and 1 more

In the field of speaker verification (SV) it is nowadays feasible and relatively easy to create a synthetic voice deceive speech driven biometric access system. This paper presents detector that can be connected at front-end or back-end standard SV system, will protect from spoofing attacks coming state-of-the-art statistical Text Speech (TTS) systems. The system described Gaussian Mixture Model (GMM) based binary classifier uses natural copy-synthesized signals obtained Wall Street Journal...

10.1109/tifs.2015.2398812 article EN IEEE Transactions on Information Forensics and Security 2015-02-02

Parametric Voice Conversion Based on Bilinear Frequency Warping Plus Amplitude Scaling

OPENALEX - Publications

Daniel Erro Eva Navas Inma Hernáez

Voice conversion methods based on frequency warping followed by amplitude scaling have been recently proposed. These modify the axis of source spectrum in such manner that some significant parts it, usually formants, are moved towards their image target speaker's spectrum. Amplitude is then applied to compensate for differences between warped spectra and spectra. This article presents a fully parametric formulation plus method which bilinear functions used. Introducing this constraint allows...

10.1109/tasl.2012.2227735 article EN IEEE Transactions on Audio Speech and Language Processing 2012-11-14

Improved HNM-based vocoder for statistical synthesizers

OPENALEX - Publications

Daniel Erro Iñaki Sainz Eva Navas Inma Hernáez

Statistical parametric synthesizers have achieved very good performance scores during the last years. Nevertheless, as they require use of vocoders to parameterize speech (during training) and reconstruct waveforms synthesis), generated from statistical models lacks some degree naturalness. In previous works we explored usefulness harmonics plus noise model in design a high-quality vocoder. Quite promising results were when this vocoder was integrated into synthesizer. paper, describe recent...

10.21437/interspeech.2011-35 article EN Interspeech 2022 2011-08-27

An objective and subjective study of the role of semantics and prosodic features in building corpora for emotional TTS

OPENALEX - Publications

Eva Navas Inma Hernáez Iker Luengo

Building a text corpus suitable to be used in corpus-based speech synthesis is time-consuming process that usually requires some human intervention select the desired phonetic content and necessary variety of prosodic contexts. If an emotional text-to-speech (TTS) system desired, complexity generation increases. This paper presents study aiming validate or reject use semantically neutral for recording both (acted) speech. The this kind texts would eliminate need include into corpus. has been...

10.1109/tasl.2006.876121 article EN IEEE Transactions on Audio Speech and Language Processing 2006-06-21

Synthetic speech detection using phase information

OPENALEX - Publications

Ibon Saratxaga Jon Sánchez Zhizheng Wu Inma Hernáez Eva Navas

10.1016/j.specom.2016.04.001 article EN Speech Communication 2016-04-16

Electrode Setup for Electromyography-Based Silent Speech Interfaces: A Pilot Study

OPENALEX - Publications

Inge Salomons Eder del Blanco Eva Navas Inma Hernáez

This paper describes a series of pilot experiments developed to define the electrode setup in order record novel parallel electromyography (EMG)–audio database. The main purpose database is provide data useful for development an EMG-based silent speech interface Spanish laryngectomized speakers. Motivated by scarcity information related studies regarding this important decision-making process, we decided carry out set with multiple recording sessions and different setups. We included types...

10.3390/s25030781 article EN cc-by Sensors 2025-01-28

Simple representation of signal phase for harmonic speech models

OPENALEX - Publications

Ibon Saratxaga Inma Hernáez Daniel Erro Eva Navas Jon Sánchez

A novel representation of the phase information in harmonic speech models is proposed. transformation from instantaneous phases to initial shift differences with respect fundamental frequency provides a clear insight into structure and largely simplifies manipulation this information.

10.1049/el.2009.3328 article EN Electronics Letters 2009-01-01

Perceptual importance of the phase related information in speech

OPENALEX - Publications

Ibon Saratxaga Inma Hernáez Michael Pucher Eva Navas Iñaki Sainz

The importance of phase information in the perceptual quality speech signals is studied this paper. Many synthesisers do not use original assuming their contribution almost inaudible. Relative Phase Shift (RPS) representation allows straightforward structure analysis, manipulation and resynthesis, we these features to a comparative evaluation some modifications usually found models. final intention study get an answer question whether phases deserve elaborate models high synthetic speech, or...

10.21437/interspeech.2012-411 article EN Interspeech 2022 2012-09-09

Evaluation of Pitch Detection Algorithms Under Real Conditions

OPENALEX - Publications

Iker Luengo Ibon Saratxaga Eva Navas Inma Hernáez Jon Sánchez and 1 more

A novel algorithm based on classical cepstrum calculation followed by dynamic programming is presented in this paper. The has been evaluated with a 60-minutes database containing 60 speakers and different recording conditions environments. second reference also used. In addition, the performance of four popular PDA algorithms same databases. results prove good described noisy conditions. Furthermore, paper first initiative to perform an evaluation widely used over extensive realistic database.

10.1109/icassp.2007.367255 article EN 2007-04-01

Emotion Conversion Based on Prosodic Unit Selection

OPENALEX - Publications

Daniel Erro Eva Navas Inma Hernáez Ibon Saratxaga

Voice conversion has been traditionally focused on spectrum. Current systems lack a solid prosody method suitable for different speaking styles. Recently, the unit selection technique applied to transform emotional intonation contours. This paper goes one step beyond: it explores strategies training and configuring cost function in an emotion application. The proposed system, which uses accent groups as basic units performs also phoneme durations intensity, is evaluated by means of carefully...

10.1109/tasl.2009.2038658 article EN IEEE Transactions on Audio Speech and Language Processing 2009-12-16

The AHOLAB RPS SSD spoofing challenge 2015 submission

OPENALEX - Publications

Jon Sánchez Ibon Saratxaga Inma Hernáez Eva Navas Daniel Erro

This paper introduces the Synthetic Speech Detection system developed by Aholab for Automatic Speaker Verification Spoofing and Countermeasures Challenge (ASVspoof 2015). The detector is a classifier based on Gaussian Mixture Models that are created using Relative Phase Shift (RPS) transformation phase information. Different strategies have been evaluated: modeling specific attacks information provided ASVspoof 2015 organizers, vocoders possibly used in spoofing signals, data from previous...

10.21437/interspeech.2015-463 article EN Interspeech 2022 2015-09-06

Use of harmonic phase information for polarity detection in speech signals

OPENALEX - Publications

Ibon Saratxaga Daniel Erro Inma Hernáez Iñaki Sainz Eva Navas

10.21437/interspeech.2009-30 article EN Interspeech 2022 2009-09-06

HNM-based MFCC+F0 extractor applied to statistical speech synthesis

OPENALEX - Publications

Daniel Erro Iñaki Sainz Eva Navas Inma Hernáez

Currently, the statistical framework based on Hidden Markov Models (HMMs) plays a relevant role in speech synthesis, while voice conversion systems Gaussian Mixture (GMMs) are almost standard. In both cases, modeling is applied to learn distributions of acoustic vectors extracted from signals, each vector containing suitable parametric representation one frame. The overall performance often limited by accuracy underlying parameterization and reconstruction method. method presented this paper...

10.1109/icassp.2011.5947411 article EN 2011-05-01

Albayzín-2014 evaluation: audio segmentation and classification in broadcast news domains

OPENALEX - Publications

Diego Castán David Tavárez Paula Lopez‐Otero Javier Franco-Pedroso Héctor Delgado and 6 more

Audio segmentation is important as a pre-processing task to improve the performance of many speech technology tasks and, therefore, it has an undoubted research interest. This paper describes database, metric, systems and results for Albayzín-2014 audio campaign. In contrast previous evaluations where was non-overlapping classes, evaluation proposes delimitation presence speech, music and/or noise that can be found simultaneously. The database used in created by fusing different media noises...

10.1186/s13636-015-0076-3 article EN cc-by EURASIP Journal on Audio Speech and Music Processing 2015-11-30

Combining spectral and prosodic information for emotion recognition in the interspeech 2009 emotion challenge

OPENALEX - Publications

Iker Luengo Eva Navas Inma Hernáez

10.21437/interspeech.2009-108 article EN Interspeech 2022 2009-09-06

NoisenseDB: An Urban Sound Event Database to Develop Neural Classification Systems for Noise-Monitoring Applications

OPENALEX - Publications

Itxasne Díez Ibon Saratxaga Unai Salegi Eva Navas Inma Hernáez

The use of continuous monitoring systems to control aspects such as noise pollution has grown in recent years. commercial used date only provide information on levels but do not identify the sources that generate them. identification is an important aspect order apply corrective measures mitigate levels. In this sense, new technological advances like machine listening can enable addition other capabilities sound detection and classification sources. Despite increasing development these...

10.3390/app13169358 article EN cc-by Applied Sciences 2023-08-17

Personalized synthetic voices for speaking impaired: website and app

OPENALEX - Publications

Daniel Erro Inma Hernáez Agustín Alonso D. García-Lorenzo Eva Navas and 9 more

10.21437/interspeech.2015-314 article EN Interspeech 2022 2015-09-06

Exploring Fusion Methods and Feature Space for the Classification of Paralinguistic Information

OPENALEX - Publications

David Tavárez Xabier Sarasola Agustín Alonso Jon Sánchez Luís Serrano and 2 more

10.21437/interspeech.2017-1378 article EN Interspeech 2022 2017-08-16

A cross-vocoder study of speaker independent synthetic speech detection using phase information

OPENALEX - Publications

Jon Sánchez Ibon Saratxaga Inma Hernáez Eva Navas Daniel Erro

10.21437/interspeech.2014-393 article EN Interspeech 2022 2014-09-14

Coming Soon ...