NFDI4DS | UHH-SEMS - Publication Details

Manu Airaksinen

ORCID: 0000-0002-8031-2260

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5013722514

Research Areas

Speech Recognition and Synthesis
Speech and Audio Processing
Voice and Speech Disorders
Phonetics and Phonology Research
Infant Development and Preterm Care
Infant Health and Development
Language Development and Disorders
Music and Audio Processing
Neonatal and fetal brain pathology
Advanced Data Compression Techniques
EEG and Brain-Computer Interfaces
Time Series Analysis and Forecasting
Neuroscience of respiration and sleep
Cerebral Palsy and Movement Disorders
Phonocardiography and Auscultation Techniques
Natural Language Processing Techniques
Neural Networks and Applications
Children's Physical and Motor Development
Obstructive Sleep Apnea Research
Child Nutrition and Feeding Issues
Child and Animal Learning Development
Child Development and Digital Technology
Machine Learning in Healthcare
Context-Aware Activity Recognition Systems
Machine Learning and Data Classification

University of Helsinki
2019-2025

Helsinki University Hospital
2019-2025

Aalto University
2013-2024

Early gross motor performance is associated with concurrent prelinguistic and social development

OPENALEX - Publications

Anastasia Gallen Elisa Taylor Juha Salmi Leena Haataja Sampsa Vanhatalo and 1 more

To study how early gross motor development links to concurrent prelinguistic and social development. We recruited a population-based longitudinal sample of 107 infants between 6 21 months age. Gross performance was quantified using novel wearable technology for at-home recordings infants' spontaneous activity. The assessed in parallel with standardized parental questionnaire (Infant Toddler Checklist). developmental trajectories motor, prelinguistic, were inspected longitudinally at...

10.1038/s41390-025-03832-5 article EN cc-by Pediatric Research 2025-01-17

Quasi Closed Phase Glottal Inverse Filtering Analysis With Weighted Linear Prediction

OPENALEX - Publications

Manu Airaksinen Tuomo Raitio Brad H. Story Paavo Alku

This study presents a new glottal inverse filtering (GIF) technique based on closed phase analysis over multiple fundamental periods. The proposed quasi (QCP) method utilizes weighted linear prediction (WLP) with specific attenuated main excitation (AME) weight function that attenuates the contribution of source in model optimization. enables use autocorrelation criterion contrast to covariance used conventional analysis. QCP was compared previously developed methods by using synthetic...

10.1109/taslp.2013.2294585 article EN IEEE/ACM Transactions on Audio Speech and Language Processing 2014-01-31

Automatic Posture and Movement Tracking of Infants with Wearable Movement Sensors

OPENALEX - Publications

Manu Airaksinen Okko Räsänen Elina Ilén Taru Häyrinen Anna Kivi and 7 more

Abstract Infants’ spontaneous and voluntary movements mirror developmental integrity of brain networks since they require coordinated activation multiple sites in the central nervous system. Accordingly, early detection infants with atypical motor development holds promise for recognizing those who are at risk a wide range neurodevelopmental disorders (e.g., cerebral palsy, autism spectrum disorders). Previously, novel wearable technology has shown offering efficient, scalable automated...

10.1038/s41598-019-56862-5 article EN cc-by Scientific Reports 2020-01-13

Intelligent wearable allows out-of-the-lab tracking of developing motor abilities in infants

OPENALEX - Publications

Manu Airaksinen Anastasia Gallen Anna Kivi Pavithra Vijayakrishnan Taru Häyrinen and 4 more

Abstract Background Early neurodevelopmental care needs better, effective and objective solutions for assessing infants’ motor abilities. Novel wearable technology opens possibilities characterizing spontaneous movement behavior. This work seeks to construct validate a generalizable, scalable, method measure abilities across all milestones from lying supine fluent walking. Methods A multi-sensor infant was constructed, 59 infants (age 5–19 months) were recorded during their play. novel gross...

10.1038/s43856-022-00131-6 article EN cc-by Communications Medicine 2022-06-15

Speech Waveform Synthesis from MFCC Sequences with Generative Adversarial Networks

OPENALEX - Publications

Lauri Juvela Bajibabu Bollepalli Xin Wang Hirokazu Kameoka Manu Airaksinen and 2 more

This paper proposes a method for generating speech from filterbank mel frequency cepstral coefficients (MFCC), which are widely used in applications, such as ASR, but generally considered unusable synthesis. First, we predict fundamental and voicing information MFCCs with an autoregressive recurrent neural net. Second, the spectral envelope contained is converted to all-pole filters, pitch-synchronous excitation model matched these filters trained. Finally, introduce generative adversarial...

10.1109/icassp.2018.8461852 article EN 2018-04-01

Automatic assessment of infant carrying and holding using at-home wearable recordings

OPENALEX - Publications

Manu Airaksinen Einari Vaaras Leena Haataja Okko Räsänen Sampsa Vanhatalo

Abstract Assessing infant carrying and holding (C/H), or physical infant-caregiver interaction, is important for a wide range of contexts in development research. An automated detection quantification C/H particularly needed long term at-home studies where infants’ neurobehavior measured using wearable devices. Here, we first developed phenomenological categorization interactions to support five different definitions behaviors. Then, trained assessed deep learning-based classifiers their...

10.1038/s41598-024-54536-5 article EN cc-by Scientific Reports 2024-02-28

A Comparison Between STRAIGHT, Glottal, and Sinusoidal Vocoding in Statistical Parametric Speech Synthesis

OPENALEX - Publications

Manu Airaksinen Lauri Juvela Bajibabu Bollepalli Junichi Yamagishi Paavo Alku

A vocoder is used to express a speech waveform with controllable parametric representation that can be converted back into waveform. Vocoders representing their main categories (mixed excitation, glottal, and sinusoidal vocoders) were compared in this study formal crowd-sourced listening tests. The quality was measured within the context of analysis-synthesis as well text-to-speech (TTS) synthesis modern statistical framework. Furthermore, TTS experiments divided vocoder-specific features...

10.1109/taslp.2018.2835720 article EN IEEE/ACM Transactions on Audio Speech and Language Processing 2018-05-18

Speaker-independent Raw Waveform Model for Glottal Excitation

OPENALEX - Publications

Lauri Juvela Vassilis Tsiaras Bajibabu Bollepalli Manu Airaksinen Junichi Yamagishi and 1 more

Recent speech technology research has seen a growing interest in using WaveNets as statistical vocoders, i.e., generating waveforms from acoustic features.These models have been shown to improve the generated quality over classical vocoders many tasks, such text-to-speech synthesis and voice conversion.Furthermore, conditioning with features allows sharing waveform generator model across multiple speakers without additional speaker codes.However, multi-speaker WaveNet require large amounts...

10.21437/interspeech.2018-1635 article EN Interspeech 2022 2018-08-28

An automated bedside measure for monitoring neonatal cortical activity: a supervised deep learning-based electroencephalogram classifier with external cohort validation

OPENALEX - Publications

Saeed Montazeri Moghadam Manu Airaksinen Päivi Nevalainen Viviana Marchi Lena Hellström‐Westas and 2 more

BackgroundElectroencephalogram (EEG) monitoring is recommended as routine in newborn neurocritical care to facilitate early therapeutic decisions and outcome predictions. EEG's larger-scale implementation is, however, hindered by the shortage of expertise needed for interpretation spontaneous cortical activity, EEG background. We developed an automated algorithm that transforms recordings quantified interpretations background provides simple intuitive visualisations patient...

10.1016/s2589-7500(22)00196-0 article EN cc-by-nc-nd The Lancet Digital Health 2022-11-22

Motherese Directed at Prelinguistic Infants at Risk for Neurological Disorders: An Exploratory Study

OPENALEX - Publications

Okko Räsänen Manu Airaksinen Viviana Marchi Olena Chorna Andrea Guzzetta and 1 more

Abstract To investigate how a high risk for infant neurological impairment affects the quality of verbal interactions, and in particular properties infant-directed speech, spontaneous interactions between 14 mothers their 4.5-month-old infants at disorders (7 female) were recorded acoustically compared with those dyads typically developing (8 female). Mothers at-risk had proportionally less voicing, proportion voicing decreased increasing severity infants’ long-term outcome. Follow-up...

10.1017/s0305000924000217 article EN Journal of Child Language 2025-01-10

IAR 2.0: An algorithm for refining inconsistent annotations for time-series data using discriminative classifiers

OPENALEX - Publications

Einari Vaaras Manu Airaksinen Okko Räsänen

10.1109/access.2025.3534637 article EN cc-by IEEE Access 2025-01-01

Assessing motor development with wearables in low-resource settings: feasibility in rural Malawi

OPENALEX - Publications

Elisa Taylor Manu Airaksinen Rikhard Ihamuotila Milja Kivelä Per Ashorn and 3 more

Abstract Background Tracking of early motor development is essential for all neurodevelopmental assessments. A multisensor wearable system, MAIJU (Motor Assessment Infants with a JUmpsuit), was recently developed an objective and scalable measurement developing skills in out-of-hospital settings. Here, we assessed its feasibility remote low-resource Methods We recruited 44 infants repeated at-home measurements (total N = 121) the rural Malawi. (i) technical quality measured data, (ii)...

10.1038/s41390-025-03818-3 article EN cc-by Pediatric Research 2025-03-03

Assessing Infant Gross Motor Performance With an At-Home Wearable

OPENALEX - Publications

Manu Airaksinen Anastasia Gallen Elisa Taylor Sofie de Sena Taru Palsa and 2 more

Early development of gross motor skills is foundational for the upcoming neurocognitive performance. Here, we studied whether at-home wearable measurements performed by parents could be used to quantify and track infants' developing abilities. Unsupervised spontaneous activity were made repeatedly using a multisensor suit (altogether 620 from 134 infants at age 4-22 months). Machine learning-based algorithms developed detect reaching milestones (GMM), measure times spent in key postures,...

10.1542/peds.2024-068647 article EN PEDIATRICS 2025-03-07

PFML: Self-Supervised Learning of Time-Series Data Without Representation Collapse

OPENALEX - Publications

Einari Vaaras Manu Airaksinen Okko Räsänen

10.1109/access.2025.3556957 article EN cc-by IEEE Access 2025-01-01

GlottDNN — A Full-Band Glottal Vocoder for Statistical Parametric Speech Synthesis

OPENALEX - Publications

Manu Airaksinen Bajibabu Bollepalli Lauri Juvela Zhizheng Wu Simon King and 1 more

GlottHMM is a previously developed vocoder that has been successfully used in HMM-based synthesis by parameterizing speech into two parts (glottal flow, vocal tract) according to the functioning of real human voice production mechanism. In this study, new glottal vocoding method, GlottDNN, proposed. The GlottDNN built on principles its predecessor, GlottHMM, but introduces three main improvements: (1) takes advantage new, more accurate inverse filtering (2) uses method deep neural network...

10.21437/interspeech.2016-342 article EN Interspeech 2022 2016-08-29

Charting infants’ motor development at home using a wearable system: validation and comparison to physical growth charts

OPENALEX - Publications

Manu Airaksinen Elisa Taylor Anastasia Gallen Elina Ilén Antti Saari and 4 more

BackgroundEarly neurodevelopmental care and research are in urgent need of practical methods for quantitative assessment early motor development. Here, performance a wearable system was validated compared to developmental tracking physical growth charts.MethodsAltogether 1358 h spontaneous movement during 226 recording sessions 116 infants (age 4–19 months) were analysed using multisensor system. A deep learning-based automatic pipeline quantified categories infants' postures movements at...

10.1016/j.ebiom.2023.104591 article EN cc-by EBioMedicine 2023-05-01

Quantified Assessment of Infant's Gross Motor Abilities Using a Multisensor Wearable

OPENALEX - Publications

Elisa Taylor Manu Airaksinen Anastasia Gallen Tuuli Immonen Elina Ilén and 3 more

Developing objective and quantitative methods of early gross motor assessment is essential to better understand neurodevelopment support therapeutic interventions. Here, we present a method quantify performance using multisensor wearable, MAIJU (Motility Assessment Infants with JUmpsuit), which offers an automated, scalable, quantitative, fully automated cloud-based pipeline. This wearable suit equipped four movement sensors that record synchronized data mobile phone utilizing low-energy...

10.3791/65949 article EN Journal of Visualized Experiments 2024-05-17

High-pitched excitation generation for glottal vocoding in statistical parametric speech synthesis using a deep neural network

OPENALEX - Publications

Lauri Juvela Bajibabu Bollepalli Manu Airaksinen Paavo Alku

Achieving high quality and naturalness in statistical parametric synthesis of female voices remains to be difficult despite recent advances the study area. Vocoding is one such key element all speech synthesizers that known affect naturalness. The present focuses on a special type vocoding, glottal vocoders, which aim parameterize based modelling real excitation (voiced) speech, flow. More specifically, we compare three different vocoders by aiming at improved voices. Two are previously...

10.1109/icassp.2016.7472653 article EN 2016-03-01

OPENGLOT – An open environment for the evaluation of glottal inverse filtering

OPENALEX - Publications

Paavo Alku Tiina Murtola Jarmo Malinen Juha Kuortti Brad H. Story and 4 more

10.1016/j.specom.2019.01.005 article EN Speech Communication 2019-01-31

Building an Open Source Classifier for the Neonatal EEG Background: A Systematic Feature-Based Approach From Expert Scoring to Clinical Visualization

OPENALEX - Publications

Saeed Montazeri Moghadam Elana Pinchefsky Ilse Tse Viviana Marchi Jukka Kohonen and 8 more

Neonatal brain monitoring in the neonatal intensive care units (NICU) requires a continuous review of spontaneous cortical activity, i.e., electroencephalograph (EEG) background activity. This needs development bedside methods for an automated assessment EEG In this paper, we present key components classifier, starting from visual scoring to classifier design, and finally possible visualization results. A dataset with 13,200 5-minute epochs (8–16 channels) 27 infants birth asphyxia was used...

10.3389/fnhum.2021.675154 article EN cc-by Frontiers in Human Neuroscience 2021-05-31

Analysis and synthesis of shouted speech

OPENALEX - Publications

Tuomo Raitio Antti Suni Jouni Pohjalainen Manu Airaksinen Martti Vainio and 1 more

In this study, the acoustic properties of shouted speech are analyzed in relation to normal speech, and various synthesis techniques for shouting investigated. The analysis shows large differences between two styles, which induces difficulties synthesis. Analysis-synthesis experiments show that use spectral estimation methods not biased by sparse harmonics is beneficial. performed through adaptation voice conversion. Subjective evaluation reveals that, despite quality degradation, impression...

10.21437/interspeech.2013-391 article EN Interspeech 2022 2013-08-25

Estimation of the glottal source from coded telephone speech using deep neural networks

OPENALEX - Publications

N. P. Narendra Manu Airaksinen Brad H. Story Paavo Alku

10.1016/j.specom.2018.12.002 article EN Speech Communication 2018-12-07

Analysis of Self-Supervised Learning and Dimensionality Reduction Methods in Clustering-Based Active Learning for Speech Emotion Recognition

OPENALEX - Publications

Einari Vaaras Manu Airaksinen Okko Räsänen

10.21437/interspeech.2022-329 article EN Interspeech 2022 2022-09-16

Quasi-closed phase forward-backward linear prediction analysis of speech for accurate formant detection and estimation

OPENALEX - Publications

Dhananjaya Gowda Manu Airaksinen Paavo Alku

Recently, a quasi-closed phase (QCP) analysis of speech signals for accurate glottal inverse filtering was proposed. However, the QCP which belongs to family temporally weighted linear prediction (WLP) methods uses conventional forward type sample prediction. This may not be best choice especially in computing WLP models with hard-limiting weighting function. A selective minimization error reduces effective number samples available within given window frame. To counter this problem, modified...

10.1121/1.5001512 article EN The Journal of the Acoustical Society of America 2017-09-01

Coming Soon ...