NFDI4DS | UHH-SEMS - Publication Details

Martti Vainio

ORCID: 0000-0003-2570-0196

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5040474925

Research Areas

Phonetics and Phonology Research
Speech Recognition and Synthesis
Speech and Audio Processing
Natural Language Processing Techniques
Speech and dialogue systems
Multisensory perception and integration
Hearing Impairment and Communication
Neuroscience and Music Perception
Hearing Loss and Rehabilitation
Action Observation and Synchronization
Language, Metaphor, and Cognition
Music and Audio Processing
Linguistic Variation and Morphology
Voice and Speech Disorders
Linguistics and language evolution
Acoustic Wave Phenomena Research
Neurobiology of Language and Bilingualism
Motor Control and Adaptation
Syntax, Semantics, Linguistic Variation
Linguistic research and analysis
Autism Spectrum Disorder Research
Linguistics, Language Diversity, and Identity
Music Technology and Sound Studies
Categorization, perception, and language
Research in Social Sciences

University of Helsinki
2016-2025

Digital Science (United States)
2019

University of Turku
2007

Stockholm South General Hospital
1994

Language-specific phoneme representations revealed by electric and magnetic brain responses

OPENALEX - Publications

Risto Näätänen Anne Lehtokoski Mietta Lennes Marie Cheour Minna Huotilainen and 8 more

10.1038/385432a0 article EN Nature 1997-01-01

Pre-attentive detection of vowel contrasts utilizes both phonetic and auditory memory representations

OPENALEX - Publications

István Winkler Anne Lehtokoski Paavo Alku Martti Vainio István Czigler and 7 more

10.1016/s0926-6410(98)00039-1 article EN Cognitive Brain Research 1999-01-01

HMM-Based Speech Synthesis Utilizing Glottal Inverse Filtering

OPENALEX - Publications

Tuomo Raitio Antti Suni Jun‐ichi Yamagishi Hannu Pulakka Jani Nurminen and 2 more

This paper describes an hidden Markov model (HMM)-based speech synthesizer that utilizes glottal inverse filtering for generating natural sounding synthetic speech. In the proposed method, is first decomposed into source signal and of vocal tract filter through filtering, thus parametrized excitation spectral features. The features are modeled individually in framework HMM generated synthesis stage according to text input. synthesized interpolating concatenating flow pulses, further modified...

10.1109/tasl.2010.2045239 article EN IEEE Transactions on Audio Speech and Language Processing 2010-03-12

Music and speech prosody: a common rhythm

OPENALEX - Publications

Maija Hausen Ritva Torppa Viljami Salmela Martti Vainio Teppo Särkämö

Disorders of music and speech perception, known as amusia aphasia, have traditionally been regarded dissociated deficits based on studies brain damaged patients. This has taken evidence that are perceived by largely separate independent networks in the brain. However, recent congenital broadened this view showing deficit is associated with problems perceiving prosody, especially intonation emotional prosody. In present study association between perception prosody was investigated healthy...

10.3389/fpsyg.2013.00566 article EN cc-by Frontiers in Psychology 2013-01-01

The perception of prosody and associated auditory cues in early-implanted children: The role of auditory working memory and musical activities

OPENALEX - Publications

Ritva Torppa Andrew Faulkner Minna Huotilainen Juhani Järvikivi Jari Lipsanen and 2 more

Objective: To study prosodic perception in early-implanted children relation to auditory discrimination, working memory, and exposure music. Design: Word sentence stress perception, discrimination of fundamental frequency (F0), intensity duration, forward digit span were measured twice over approximately 16 months. Musical activities assessed by questionnaire. Study sample: Twenty-one age-matched normal-hearing (NH) (4–13 years). Results: Children with cochlear implants (CIs) exposed music...

10.3109/14992027.2013.872302 article EN International Journal of Audiology 2014-01-27

Formant frequency estimation of high-pitched vowels using weighted linear prediction

OPENALEX - Publications

Paavo Alku Jouni Pohjalainen Martti Vainio Anne-Maria Laukkanen Brad H. Story

All-pole modeling is a widely used formant estimation method, but its performance known to deteriorate for high-pitched voices. In order address this problem, several all-pole methods robust fundamental frequency have been proposed. This study compares five such previously and introduces technique, Weighted Linear Prediction with Attenuated Main Excitation (WLP-AME). WLP-AME utilizes temporally weighted linear prediction (LP) in which the square of error multiplied by given parametric...

10.1121/1.4812756 article EN The Journal of the Acoustical Society of America 2013-08-01

Rapid and automatic speech-specific learning mechanism in human neocortex

OPENALEX - Publications

Lilli Kimppa Teija Kujala Alina Leminen Martti Vainio Yury Shtyrov

A unique feature of human communication system is our ability to rapidly acquire new words and build large vocabularies. However, its neurobiological foundations remain largely unknown. In an electrophysiological study optimally designed probe this rapid formation word memory circuits, we employed acoustically controlled novel word-forms incorporating native non-native speech sounds, while manipulating the subjects' attention on input. We found a robust index neurolexical memory-trace...

10.1016/j.neuroimage.2015.05.098 article EN cc-by-nc-nd NeuroImage 2015-06-13

Effect of Syllable Articulation on Precision and Power Grip Performance

OPENALEX - Publications

Lari Vainio Mirjam Schulman Kaisa Tiippana Martti Vainio

The present study was motivated by a theory, which proposes that speech includes articulatory gestures are connected to particular hand actions. We hypothesized certain would be more associated with the precision grip than power grip, and vice versa. In study, participants pronounced syllable performed simultaneously or theorized either congruent incongruent syllable. Relatively fast responses were in tip of tongue contacted alveolar ridge ([te]) aperture vocal tract remained small ([hi]),...

10.1371/journal.pone.0053061 article EN cc-by PLoS ONE 2013-01-09

Hierarchical representation and estimation of prosody using continuous wavelet transform

OPENALEX - Publications

Antti Suni Juraj Šimko Daniel Aalto Martti Vainio

10.1016/j.csl.2016.11.001 article EN Computer Speech & Language 2016-11-12

Selective tuning of cortical sound‐feature processing by language experience

OPENALEX - Publications

Mari Tervaniemi Thomas Jacobsen Stefan Röttger Teija Kujala Andreas Widmann and 3 more

Abstract In ‘quantity‐languages’, such as Japanese or Finnish, sound duration is linguistically relevant. We showed that quantity‐language speakers were superior to of a non‐quantity language in discriminating the even non‐speech sounds. contrast, there was no group difference discrimination frequency. This result, obtained both by behavioural and neural indices at attentive automatic levels processing, indicates precise feature‐specific tuning auditory‐cortex functions mother tongue.

10.1111/j.1460-9568.2006.04752.x article EN European Journal of Neuroscience 2006-05-01

Modulation of the mismatch negativity (MMN) to vowel duration changes in native speakers of Finnish and German as a result of language experience

OPENALEX - Publications

Ursula Kirmse Sari Ylinen Mari Tervaniemi Martti Vainio Erich Schröger and 1 more

10.1016/j.ijpsycho.2007.10.012 article EN International Journal of Psychophysiology 2007-11-13

Utilizing glottal source pulse library for generating improved excitation signal for HMM-based speech synthesis

OPENALEX - Publications

Tuomo Raitio Antti Suni Hannu Pulakka Martti Vainio Paavo Alku

This paper describes a source modeling method for hidden Markov model (HMM) based speech synthesis improved naturalness. A corpus is first decomposed into the glottal signal and of vocal tract filter using inverse filtering, parametrized excitation spectral features. Additionally, library pulses extracted from estimated voice signal. In stage, generated by selecting appropriate according to target cost features concatenation between adjacent pulses. Finally, synthesized filtering filter....

10.1109/icassp.2011.5947370 article EN 2011-05-01

Large scale data acquisition of simultaneous MRI and speech

OPENALEX - Publications

Daniel Aalto Olli Aaltonen Risto-Pekka Happonen Päivi Jääsaari Atle Kivelä and 8 more

10.1016/j.apacoust.2014.03.003 article EN Applied Acoustics 2014-04-12

Pitch-based correspondences related to abstract concepts

OPENALEX - Publications

Lari Vainio Alexandra Wikström Martti Vainio

10.1016/j.actpsy.2025.104754 article EN Acta Psychologica 2025-01-24

Hyperarticulation in Lombard speech: Global coordination of the jaw, lips and the tongue

OPENALEX - Publications

Juraj Šimko Štefan Beňuš Martti Vainio

Over the last century, researchers have collected a considerable amount of data reflecting properties Lombard speech, i.e., speech in noisy environment. The documented phenomena predominately report effects on signal produced ambient noise. In comparison, relatively little is known about underlying articulatory patterns particular for lingual articulation. Here authors present an analysis recordings material babble noise different intensity levels and hypoarticulated quantitative differences...

10.1121/1.4939495 article EN The Journal of the Acoustical Society of America 2016-01-01

Tonal features, intensity, and word order in the perception of prominence

OPENALEX - Publications

Martti Vainio Juhani Järvikivi

10.1016/j.wocn.2005.06.004 article EN Journal of Phonetics 2005-09-20

Analysis of HMM-based lombard speech synthesis

OPENALEX - Publications

Tuomo Raitio Antti Suni Martti Vainio Paavo Alku

Humans modify their voice in interfering noise order to maintain the intelligibility of speech – this is called Lombard effect. This ability, however, has not been extensively modeled synthesis. Here we compare several methods synthesizing using a physiologically based statistical synthesis system (GlottHMM). The results show that realistic street situation synthetic judged by listeners both as appropriate for and intelligible natural speech. Of different types models, one adaptation...

10.21437/interspeech.2011-696 article EN Interspeech 2022 2011-08-27

Interaction in planning movement direction for articulatory gestures and manual actions

OPENALEX - Publications

Lari Vainio Mikko Tiainen Kaisa Tiippana Naeem Komeilipoor Martti Vainio

10.1007/s00221-015-4365-y article EN Experimental Brain Research 2015-06-30

Evaluation of an Artificial Speech Bandwidth Extension Method in Three Languages

OPENALEX - Publications

Hannu Pulakka Laura Laaksonen Martti Vainio Jouni Pohjalainen Paavo Alku

Quality and intelligibility of narrowband telephone speech can be improved by artificial bandwidth extension (ABE), which extends the using only information available in signal. This paper reports a three-language evaluation an ABE method that has recently been launched several Nokia's mobile models. The to frequencies above band first utilizing spectral folding then modifying magnitude spectrum with spline curves. performance was evaluated formal listening tests American English, Russian,...

10.1109/tasl.2008.925149 article EN IEEE Transactions on Audio Speech and Language Processing 2008-07-24

Coming Soon ...