NFDI4DS | UHH-SEMS - Publication Details

Maria Powell

ORCID: 0000-0002-6643-8991

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5003384228

Research Areas

Voice and Speech Disorders
Speech Recognition and Synthesis
Dysphagia Assessment and Management
Music and Audio Processing
Tracheal and airway disorders
Speech and Audio Processing
Artificial Intelligence in Healthcare and Education
Cleft Lip and Palate Research
Speech and dialogue systems
Machine Learning in Healthcare
AI in cancer detection
COVID-19 diagnosis using AI
Neurological disorders and treatments
Radiomics and Machine Learning in Medical Imaging
Craniofacial Disorders and Treatments
Ear and Head Tumors
Anesthesia and Sedative Agents
Vasculitis and related conditions
Digital Communication and Language
Phonetics and Phonology Research
Atherosclerosis and Cardiovascular Diseases
Respiratory and Cough-Related Research
Anesthesia and Pain Management
Risk Perception and Management
Head and Neck Cancer Studies

Vanderbilt University Medical Center
2017-2025

Vanderbilt University
2016-2020

Cincinnati Children's Hospital Medical Center
2014-2019

University of Cincinnati
2014-2019

Experimental investigation on minimum frame rate requirements of high-speed videoendoscopy for clinical voice assessment

OPENALEX - Publications

Dimitar D. Deliyski Maria Powell Stephanie R. C. Zacharias Terri Treman Gerlach Alessandro de Alarcón

10.1016/j.bspc.2014.11.007 article EN Biomedical Signal Processing and Control 2014-12-29

Efficacy of Videostroboscopy and High-Speed Videoendoscopy to Obtain Functional Outcomes From Perioperative Ratings in Patients With Vocal Fold Mass Lesions

OPENALEX - Publications

Maria Powell Dimitar D. Deliyski Steven M. Zeitels James A. Burns Robert E. Hillman and 2 more

10.1016/j.jvoice.2019.03.012 article EN Journal of Voice 2019-04-17

FastAudio: A Learnable Audio Front-End For Spoof Speech Detection

OPENALEX - Publications

Quchen Fu Zhongwei Teng Jules White Maria Powell Douglas C. Schmidt

Spoof speech can be used to try and fool speaker verification systems that determine the identity of based on voice characteristics. This paper compares popular learnable front-ends this task. We categorize by defining two generic architectures then analyze filtering stages both types in terms learning constraints. pro-pose replacing fixed filterbanks with a layer better adapt anti-spoofing tasks. The proposed FastAudio front-end is tested back-ends measure performance Logical Access track...

10.1109/icassp43922.2022.9746722 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022-04-27

Leveraging large language models for automated detection of velopharyngeal dysfunction in patients with cleft palate

OPENALEX - Publications

Myranda Uselton Shirk Catherine Dang J. Silvia Cho Hanlin Chen Lily Hofstetter and 9 more

Hypernasality, a hallmark of velopharyngeal insufficiency (VPI), is speech disorder with significant psychosocial and functional implications. Conventional diagnostic methods rely heavily on specialized expertise equipment, posing challenges in resource-limited settings. This study explores the application OpenAI's Whisper model for automated hypernasality detection, offering scalable efficient alternative to traditional approaches. The was adapted binary classification by replacing its...

10.3389/fdgth.2025.1552746 article EN cc-by Frontiers in Digital Health 2025-03-28

Uncertainty of Spatial Segmentation of High-Speed Videoendoscopy and Its Temporal and Spatial Dependency

OPENALEX - Publications

Hamzeh Ghasemzadeh Maria Powell David S. Ford Dimitar D. Deliyski

10.1016/j.jvoice.2025.03.007 article EN Journal of Voice 2025-03-01

Machine Learning for Automatic Detection of Velopharyngeal Dysfunction: A Preliminary Report

OPENALEX - Publications

Claiborne Lucas Ricardo A. Torres‐Guzman Andrew James Scott Corlew Amy L. Stone and 3 more

Background: Even after palatoplasty, the incidence of velopharyngeal dysfunction (VPD) can reach 30%; however, these estimates arise from high-income countries (HICs) where speech-language pathologists (SLP) are part standardized cleft teams. The VPD burden in low- and middle-income (LMICs) is unknown. This study aims to develop a machine-learning model that detect presence using audio samples alone. Methods: Case control were obtained institutional publicly available sources. A was built...

10.1097/scs.0000000000010147 article EN Journal of Craniofacial Surgery 2024-05-06

Vibratory function and healing outcomes after small intestinal submucosa biomaterial implantation for chronic vocal fold scar

OPENALEX - Publications

Michael J. Pitman Takashi Kurita Maria Powell Emily E. Kimball Masanobu Mizuta and 3 more

Objectives/Hypothesis Vocal fold scar is a major cause of dysphonia, and optimal treatments do not currently exist. Small intestinal submucosa (SIS) biomaterial developed for the treatment variety pathologies. The purpose this study was to investigate effects SIS implantation on tissue remodeling in scarred vocal folds using routine staining, immunohistochemistry, high‐speed videoendoscopy (HSV). Study Design Prospective, blinded group analysis. Methods Thirteen New Zealand White rabbits...

10.1002/lary.26883 article EN The Laryngoscope 2017-11-06

Decoding phonation with artificial intelligence (DeP AI): Proof of concept

OPENALEX - Publications

Maria Powell Marcelino Rodriguez Cancio David Young William Nock Beshoy A. Abdelmessih and 7 more

Acoustic analysis of voice has the potential to expedite detection and diagnosis disorders. Applying an image-based, neural-network approach analyzing acoustic signal may be effective means for detecting differentially diagnosing The purpose this study is provide a proof-of-concept that embedded data within human phonation can accurately efficiently decoded with deep learning neural network differentiate between normal disordered voices.Acoustic recordings from 10 vocally-healthy speakers,...

10.1002/lio2.259 article EN cc-by-nc-nd Laryngoscope Investigative Otolaryngology 2019-03-25

Comparison of Videostroboscopy to Stroboscopy Derived From High-Speed Videoendoscopy for Evaluating Patients With Vocal Fold Mass Lesions

OPENALEX - Publications

Maria Powell Dimitar D. Deliyski Robert E. Hillman Steven M. Zeitels James A. Burns and 1 more

Purpose Videostroboscopy (VS) uses an indirect physiological signal to predict the phase of vocal fold vibratory cycle for sampling. Simulated stroboscopy (SS) extracts glottal directly from changing area in high-speed videoendoscopy (HSV) image sequence. The purpose this study is determine reliability SS relative VS clinical assessment function patients with mass lesions. Methods and recordings were obtained 28 lesions before after phonomicrosurgery 17 controls who vocally healthy. Two...

10.1044/2016_ajslp-15-0050 article EN American Journal of Speech-Language Pathology 2016-10-07

Stakeholder perspectives on ethical and trustworthy voice AI in health care

OPENALEX - Publications

Jean‐Christophe Bélisle‐Pipon Maria Powell Renee English Marie‐Françoise Malo Vardit Ravitsky and 1 more

Objective Voice as a health biomarker using artificial intelligence (AI) is gaining momentum in research. The noninvasiveness of voice data collection through accessible technology (such smartphones, telehealth, and ambient recordings) or within clinical contexts means AI may help address disparities promote the inclusion marginalized communities. However, development AI-ready datasets free from bias discrimination complex task. objective this study to better understand perspectives engaged...

10.1177/20552076241260407 article EN cc-by-nc-nd Digital Health 2024-01-01

Current Practices in Voice Data Collection and Limitations to Voice AI Research: A National Survey

OPENALEX - Publications

Emily Evangelista Rohan Kale Desiree McCutcheon Anaïs Rameau Alexander Gelbard and 10 more

Accuracy and validity of voice AI algorithms rely on substantial quality data. Although commensurable amounts data are captured daily in centers across North America, there is no standardized protocol for acoustic management, which limits the usability these datasets artificial intelligence (AI) research.

10.1002/lary.31052 article EN The Laryngoscope 2023-12-13

SA-SASV: An End-to-End Spoof-Aggregated Spoofing-Aware Speaker Verification System

OPENALEX - Publications

Zhongwei Teng Quchen Fu Jules White Maria Powell Douglas C. Schmidt

Research in the past several years has boosted performance of automatic speaker verification systems and countermeasure to deliver low Equal Error Rates (EERs) on each system. However, research joint optimization both is still limited. The Spoofing-Aware Speaker Verification (SASV) 2022 challenge was proposed encourage development integrated SASV with new metrics evaluate model performance. This paper proposes an ensemble-free end-to-end solution, known as Spoof-Aggregated-SASV (SA-SASV)...

10.21437/interspeech.2022-11029 article EN Interspeech 2022 2022-09-16

Different Vibratory Conditions Elicit Different Structural and Biological Vocal Fold Changes in an In-Vivo Rabbit Model of Phonation

OPENALEX - Publications

Emily E. Kimball Lea Sayce Maria Powell Gary J. Gartling Jennifer Brandley and 1 more

10.1016/j.jvoice.2019.08.023 article EN Journal of Voice 2019-09-18

A multi-stage transfer learning strategy for diagnosing a class of rare laryngeal movement disorders

OPENALEX - Publications

Yu Yao Maria Powell Jules White Jian Feng Quchen Fu and 2 more

10.1016/j.compbiomed.2023.107534 article EN Computers in Biology and Medicine 2023-09-29

Continuous Rate Infusion of Ketamine Hydrochloride and Dexmedetomidine for Maintenance of Anesthesia during Laryngotracheal Surgery in New Zealand White Rabbits (Oryctolagus cuniculus)

OPENALEX - Publications

Lea Sayce Maria Powell Emily E. Kimball Patty Chen Gary J. Gartling and 1 more

New Zealand white rabbits (Oryctolagus cuniculus) are an established in vivo model for the study of structural and functional consequences vocal-fold vibration. Research design requires invasive laryngotracheal procedures, presence laryngospasms or pain responses (or both) hinder phonation-related data collection. Published anesthesia regimens report respiratory depression muscle tone changes have been unsuccessful mitigating autonomic laryngeal our protocol. Infusion ketamine hydrochloride...

10.30802/aalas-jaalas-19-000076 article EN Journal of the American Association for Laboratory Animal Science 2020-01-31

Optimizing Botox regimens in patients with adductor spasmodic dysphonia and essential tremor of voice: A 31‐year experience

OPENALEX - Publications

Amy L. Stone Maria Powell Kaitlyn Hamers Kenneth C. Fletcher David O. Francis and 3 more

The purpose of this study was to quantitatively compare the effectiveness unilateral and bilateral botulinum toxin A (BTX-A) injections for mitigating undesirable weak/breathy voice quality dysphagia patients with adductor spasmodic dysphonia and/or essential tremor (ETV).

10.1002/lio2.915 article EN cc-by-nc-nd Laryngoscope Investigative Otolaryngology 2022-09-19

Developing Multi-Disorder Voice Protocols: A team science approach involving clinical expertise, bioethics, standards, and DEI.

OPENALEX - Publications

Yaël Bensoussan Satrajit Ghosh Anaïs Rameau Micah Boyer Ruth Huntley Bahr and 13 more

The world of voice biomarkers is rapidly evolving thanks to the use artificial intelligence (AI) allowing large-scale analysis voice, speech, and respiratory sound data. Bridge2AI-Voice project aims build a large-scale, ethically sourced, diverse database human voices linked health information help fuel Voice AI research, dubbed Audiomics. current paper describes development protocols data acquisition across 4 different adult cohorts disease (voice, respiratory, neurodegenerative diseases,...

10.21437/interspeech.2024-1926 article EN Interspeech 2022 2024-09-01

Unilateral and Bilateral Laryngeal Pacing for Bilateral Vocal Fold Paralysis

OPENALEX - Publications

Maria Powell David L. Zealear Yike Li C. Gaelyn Garrett Kate Von Wahlde and 1 more

Present the state-of-the-art overview of laryngeal pacing for treatment bilateral vocal fold paralysis. A minimally invasive unilateral system and a fully implantable are currently in clinical trials. The relative advantages disadvantages each discussed.Research functional electrical stimulation reanimation posterior cricoarytenoid muscle has successfully translated from animal models to human trials pacing. Current findings suggest humans significantly improves ventilation but only...

10.1007/s40136-020-00313-7 article EN cc-by Current Otorhinolaryngology Reports 2020-09-03

Machine Learning for Automatic Detection of Velopharyngeal Dysfunction: Proof of Concept

OPENALEX - Publications

Nicholas R. O’Sick Claiborne Lucas Ricardo A. Torres‐Guzman Andrew James Scott Corlew and 4 more

Background: Even after palatoplasty, the incidence of velopharyngeal dysfunction (VPD) can reach 30%; however, these estimates arise from high-income countries (HICs) where speech-language pathologists are part standardized cleft teams. The VPD burden in low- and middle-income (LMICs) is unknown. This study aims to develop a machine learning model that detect presence using audio samples alone. Methods: Case control were obtained by institutional publicly available sources. A was built...

10.1097/01.gox.0001024432.49195.f3 article EN cc-by-nc-nd Plastic & Reconstructive Surgery Global Open 2024-06-01

Experimental Framework for Generating Reliable Ground Truth for Laryngeal Spatial Segmentation Tasks

OPENALEX - Publications

Hamzeh Ghasemzadeh David S. Ford Maria Powell Dimitar D. Deliyski

Objective: The validity of objective measures derived from high-speed videoendoscopy (HSV) depends, among other factors, on the spatial segmentation. Evaluation segmentation requires existence reliable ground truths. This study presents a framework for creating truth with sub-pixel resolution and then evaluates its performance. Method: proposed is three-stage process. First, three laryngeal imaging experts performed task. Second, regions high discrepancies between were determined overlaid...

10.48550/arxiv.2409.02809 preprint EN arXiv (Cornell University) 2024-09-04

Coming Soon ...