NFDI4DS | UHH-SEMS - Publication Details

Elizabeth Salesky

ORCID: 0000-0001-6765-1447

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5028564314

Research Areas

Natural Language Processing Techniques
Topic Modeling
Speech Recognition and Synthesis
Speech and dialogue systems
Multimodal Machine Learning Applications
Language and cultural evolution
Text Readability and Simplification
Music and Audio Processing
Phonetics and Phonology Research
Semantic Web and Ontologies
Biomedical Text Mining and Ontologies
Machine Learning and Data Classification
Data Quality and Management
Authorship Attribution and Profiling
Web Data Mining and Analysis
Speech and Audio Processing
Seismology and Earthquake Studies
Digital Humanities and Scholarship
Hand Gesture Recognition Systems
GNSS positioning and interference
Subtitles and Audiovisual Media
Linguistic Variation and Morphology
Hearing Impairment and Communication
Advanced Data Processing Techniques
Domain Adaptation and Few-Shot Learning

Johns Hopkins University
2020-2023

Microsoft (United States)
2023

University of Copenhagen
2020-2023

Charles University
2023

The University of Melbourne
2020-2022

Carnegie Mellon University
2013-2022

Carleton University
2022

Georgetown University
2022

Yale University
2022

University of Florida
2022

Findings of the IWSLT 2022 Evaluation Campaign

OPENALEX - Publications

Antonios Anastasopoulos Loïc Barrault Luisa Bentivogli Marcely Zanon Boito Ondřej Bojar and 38 more

Antonios Anastasopoulos, Loïc Barrault, Luisa Bentivogli, Marcely Zanon Boito, Ondřej Bojar, Roldano Cattoni, Anna Currey, Georgiana Dinu, Kevin Duh, Maha Elbayad, Clara Emmanuel, Yannick Estève, Marcello Federico, Christian Federmann, Souhir Gahbiche, Hongyu Gong, Roman Grundkiewicz, Barry Haddow, Benjamin Hsu, Dávid Javorský, Vĕra Kloudová, Surafel Lakew, Xutai Ma, Prashant Mathur, Paul McNamee, Kenton Murray, Maria Nǎdejde, Satoshi Nakamura, Matteo Negri, Jan Niehues, Xing Niu, John...

10.18653/v1/2022.iwslt-1.10 article EN cc-by 2022-01-01

FINDINGS OF THE IWSLT 2023 EVALUATION CAMPAIGN

OPENALEX - Publications

Milind Agarwal Sweta Agrawal Antonios Anastasopoulos Luisa Bentivogli Ondřej Bojar and 57 more

Milind Agarwal, Sweta Agrawal, Antonios Anastasopoulos, Luisa Bentivogli, Ondřej Bojar, Claudia Borg, Marine Carpuat, Roldano Cattoni, Mauro Cettolo, Mingda Chen, William Khalid Choukri, Alexandra Chronopoulou, Anna Currey, Thierry Declerck, Qianqian Dong, Kevin Duh, Yannick Estève, Marcello Federico, Souhir Gahbiche, Barry Haddow, Benjamin Hsu, Phu Mon Htut, Hirofumi Inaguma, Dávid Javorský, John Judge, Yasumasa Kano, Tom Ko, Rishu Kumar, Pengwei Li, Xutai Ma, Prashant Mathur, Evgeny...

10.18653/v1/2023.iwslt-1.1 article EN cc-by 2023-01-01

FINDINGS OF THE IWSLT 2020 EVALUATION CAMPAIGN

OPENALEX - Publications

Ebrahim Ansari Amittai Axelrod Nguyễn Bách Ondřej Bojar Roldano Cattoni and 18 more

Ebrahim Ansari, Amittai Axelrod, Nguyen Bach, Ondřej Bojar, Roldano Cattoni, Fahim Dalvi, Nadir Durrani, Marcello Federico, Christian Federmann, Jiatao Gu, Fei Huang, Kevin Knight, Xutai Ma, Ajay Nagesh, Matteo Negri, Jan Niehues, Juan Pino, Elizabeth Salesky, Xing Shi, Sebastian Stüker, Marco Turchi, Alexander Waibel, Changhan Wang. Proceedings of the 17th International Conference on Spoken Language Translation. 2020.

10.18653/v1/2020.iwslt-1.1 article EN cc-by 2020-01-01

The IWSLT 2019 Evaluation Campaign

OPENALEX - Publications

Jan Niehues Roldano Cattoni Sebastian Stüker Matteo Negri Marco Turchi and 5 more

10.5281/zenodo.3525578 article EN 2019-11-02

The Multilingual TEDx Corpus for Speech Recognition and Translation

OPENALEX - Publications

Elizabeth Salesky Matthew Wiesner Jacob Bremerman Roldano Cattoni Matteo Negri and 3 more

We present the Multilingual TEDx corpus, built to support speech recognition (ASR) and translation (ST) research across many non-English source languages.The corpus is a collection of audio recordings from talks in 8 languages.We segment transcripts into sentences align them sourcelanguage target-language translations.The released along with open-sourced code enabling extension new languages as they become available.Our creation methodology can be applied more than previous work, creates...

10.21437/interspeech.2021-11 article EN Interspeech 2022 2021-08-27

SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection

OPENALEX - Publications

Ekaterina Vylomova Jennifer C. White Elizabeth Salesky Sabrina J. Mielke Shijie Wu and 23 more

Ekaterina Vylomova, Jennifer White, Elizabeth Salesky, Sabrina J. Mielke, Shijie Wu, Edoardo Maria Ponti, Rowan Hall Maudslay, Ran Zmigrod, Josef Valvoda, Svetlana Toldova, Francis Tyers, Elena Klyachko, Ilya Yegorov, Natalia Krizhanovsky, Paula Czarnowska, Irene Nikkarinen, Andrew Tiago Pimentel, Lucas Torroba Hennigen, Christo Kirov, Garrett Nicolai, Adina Williams, Antonios Anastasopoulos, Hilaria Cruz, Eleanor Chodroff, Ryan Cotterell, Miikka Silfverberg, Mans Hulden. Proceedings of the...

10.18653/v1/2020.sigmorphon-1.1 article EN cc-by 2020-01-01

FINDINGS OF THE IWSLT 2021 EVALUATION CAMPAIGN

OPENALEX - Publications

Antonios Anastasopoulos Ondřej Bojar Jacob Bremerman Roldano Cattoni Maha Elbayad and 13 more

Antonios Anastasopoulos, Ondřej Bojar, Jacob Bremerman, Roldano Cattoni, Maha Elbayad, Marcello Federico, Xutai Ma, Satoshi Nakamura, Matteo Negri, Jan Niehues, Juan Pino, Elizabeth Salesky, Sebastian Stüker, Katsuhito Sudoh, Marco Turchi, Alexander Waibel, Changhan Wang, Matthew Wiesner. Proceedings of the 18th International Conference on Spoken Language Translation (IWSLT 2021). 2021.

10.18653/v1/2021.iwslt-1.1 article EN cc-by 2021-01-01

Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP

OPENALEX - Publications

Sabrina J. Mielke Zaid Alyafeai Elizabeth Salesky Colin Raffel Manan Dey and 6 more

What are the units of text that we want to model? From bytes multi-word expressions, can be analyzed and generated at many granularities. Until recently, most natural language processing (NLP) models operated over words, treating those as discrete atomic tokens, but starting with byte-pair encoding (BPE), subword-based approaches have become dominant in areas, enabling small vocabularies while still allowing for fast inference. Is end road character-level model or byte-level processing? In...

10.48550/arxiv.2112.10508 preprint EN other-oa arXiv (Cornell University) 2021-01-01

UniMorph 4.0: Universal Morphology

OPENALEX - Publications

Khuyagbaatar Batsuren Omer Goldman Salam Khalifa Nizar Habash Witold Kieraś and 90 more

The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological inflection tables for hundreds of diverse world languages. comprises two major thrusts: language-independent feature schema rich annotation and type-level resource annotated data in languages realizing that schema. This paper presents the expansions improvements made on several fronts over last couple years (since McCarthy et al. (2020)). Collaborative efforts...

10.48550/arxiv.2205.03608 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Exploring Phoneme-Level Speech Representations for End-to-End Speech Translation

OPENALEX - Publications

Elizabeth Salesky Matthias Sperber Alan W. Black

Previous work on end-to-end translation from speech has primarily used frame-level features as representations, which creates longer, sparser sequences than text. We show that a naive method to create compressed phoneme-like representations is far more effective and efficient for traditional features. Specifically, we generate phoneme labels frames average consecutive with the same label shorter, higher-level source translation. see improvements of up 5 BLEU both our high low resource...

10.18653/v1/p19-1179 article EN cc-by 2019-01-01

Generalized Entropy Regularization or: There’s Nothing Special about Label Smoothing

OPENALEX - Publications

Clara Meister Elizabeth Salesky Ryan Cotterell

Prior work has explored directly regularizing the output distributions of probabilistic models to alleviate peaky (i.e. over-confident) predictions, a common sign overfitting. This class techniques, which label smoothing is one, connection entropy regularization. Despite consistent success across architectures and data sets in language generation tasks, two problems remain open: (1) there little understanding underlying effects regularizers have on models, (2) full space regularization...

10.18653/v1/2020.acl-main.615 article EN cc-by 2020-01-01

Findings of the IWSLT 2024 Evaluation Campaign

OPENALEX - Publications

Ibrahim Said Ahmad Antonios Anastasopoulos Ondřej Bojar Claudia Borg Marine Carpuat and 40 more

This paper reports on the shared tasks organized by 21st IWSLT Conference. The address 7 scientific challenges in spoken language translation: simultaneous and offline translation, automatic subtitling dubbing, speech-to-speech dialect low-resource speech Indic languages. attracted 18 teams whose submissions are documented 26 system papers. growing interest towards translation is also witnessed constantly increasing number of task organizers contributors to overview paper, almost evenly...

10.48550/arxiv.2411.05088 preprint EN arXiv (Cornell University) 2024-11-07

Robust Open-Vocabulary Translation from Visual Text Representations

OPENALEX - Publications

Elizabeth Salesky David Etter Matt Post

Machine translation models have discrete vocabularies and commonly use subword segmentation techniques to achieve an ‘open vocabulary.’ This approach relies on consistent correct underlying unicode sequences, makes susceptible degradation from common types of noise variation. Motivated by the robustness human language processing, we propose visual text representations, which dispense with a finite set embeddings in favor continuous created processing visually rendered sliding windows. We...

10.18653/v1/2021.emnlp-main.576 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2021-01-01

WMT24++: Expanding the Language Coverage of WMT24 to 55 Languages & Dialects

OPENALEX - Publications

Daniel Deutsch Eleftheria Briakou Isaac Caswell Mara Finkelstein Rebecca Galor and 12 more

As large language models (LLM) become more and capable in languages other than English, it is important to collect benchmark datasets order evaluate their multilingual performance, including on tasks like machine translation (MT). In this work, we extend the WMT24 dataset cover 55 by collecting new human-written references post-edits for 46 dialects addition of 8 out 9 original dataset. The covers four domains: literary, news, social, speech. We a variety MT providers LLMs collected using...

10.48550/arxiv.2502.12404 preprint EN arXiv (Cornell University) 2025-02-17

Phone Features Improve Speech Translation

OPENALEX - Publications

Elizabeth Salesky Alan W. Black

End-to-end models for speech translation (ST) more tightly couple recognition (ASR) and machine (MT) than a traditional cascade of separate ASR MT models, with simpler model architectures the potential reduced error propagation. Their performance is often assumed to be superior, though in many conditions this not yet case. We compare cascaded end-to-end across high, medium, low-resource conditions, show that cascades remain stronger baselines. Further, we introduce two methods incorporate...

10.18653/v1/2020.acl-main.217 article EN 2020-01-01

Relative Positional Encoding for Speech Recognition and Direct Translation

OPENALEX - Publications

Ngoc-Quan Pham Thanh-Le Ha Tuan-Nam Nguyen Thai‐Son Nguyen Elizabeth Salesky and 3 more

Transformer models are powerful sequence-to-sequence architectures that capable of directly mapping speech inputs to transcriptions or translations.However, the mechanism for modeling positions in this model was tailored text modeling, and thus is less ideal acoustic inputs.In work, we adapt relative position encoding scheme Speech Transformer, where key addition distance between input states self-attention network.As a result, network can better variable distributions present data.Our...

10.21437/interspeech.2020-2526 article EN Interspeech 2022 2020-10-25

Optimizing segmentation granularity for neural machine translation

OPENALEX - Publications

Elizabeth Salesky Andrew Runge Alex Coda Jan Niehues Graham Neubig

10.1007/s10590-019-09243-8 article EN Machine Translation 2020-01-24

Language Modelling with Pixels

OPENALEX - Publications

Phillip Rust Jonas F. Lotz Emanuele Bugliarello Elizabeth Salesky Miryam de Lhoneux and 1 more

Language models are defined over a finite set of inputs, which creates vocabulary bottleneck when we attempt to scale the number supported languages. Tackling this results in trade-off between what can be represented embedding matrix and computational issues output layer. This paper introduces PIXEL, Pixel-based Encoder Language, suffers from neither these issues. PIXEL is pretrained language model that renders text as images, making it possible transfer representations across languages...

10.48550/arxiv.2207.06991 preprint EN cc-by arXiv (Cornell University) 2022-01-01

Fluent Translations from Disfluent Speech in End-to-End Speech Translation

OPENALEX - Publications

Elizabeth Salesky Matthias Sperber Alexander Waibel

Elizabeth Salesky, Matthias Sperber, Alexander Waibel. Proceedings of the 2019 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). 2019.

10.18653/v1/n19-1285 article EN 2019-01-01

Towards Fluent Translations From Disfluent Speech

OPENALEX - Publications

Elizabeth Salesky Susanne Burger Jan Niehues Alex Waibel

When translating from speech, special consideration for conversational speech phenomena such as disfluencies is necessary. Most machine translation training data consists of well-formed written texts, causing issues when spontaneous speech. Previous work has introduced an intermediate step between recognition (ASR) and (MT) to remove disfluencies, making the better-matched typical text significantly improving performance. However, with rise end-to-end systems, this must be incorporated into...

10.1109/slt.2018.8639661 article EN 2022 IEEE Spoken Language Technology Workshop (SLT) 2018-12-01

A surprisal–duration trade-off across and within the world’s languages

OPENALEX - Publications

Tiago Pimentel Clara Meister Elizabeth Salesky Simone Teufel Damián E. Blasí and 1 more

While there exist scores of natural languages, each with its unique features and idiosyncrasies, they all share a unifying theme: enabling human communication. We may thus reasonably predict that cognition shapes how these languages evolve are used. Assuming the capacity to process information is roughly constant across populations, we expect surprisal–duration trade-off arise both within languages. analyse this using corpus 600 and, after controlling for several potential confounds, find...

10.18653/v1/2021.emnlp-main.73 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2021-01-01

SIGTYP 2020 Shared Task: Prediction of Typological Features

OPENALEX - Publications

Johannes Bjerva Elizabeth Salesky Sabrina J. Mielke Aditi Chaudhary Celano Giuseppe and 4 more

Johannes Bjerva, Elizabeth Salesky, Sabrina J. Mielke, Aditi Chaudhary, Giuseppe G. A. Celano, Edoardo Maria Ponti, Ekaterina Vylomova, Ryan Cotterell, Isabelle Augenstein. Proceedings of the Second Workshop on Computational Research in Linguistic Typology. 2020.

10.18653/v1/2020.sigtyp-1.1 article EN cc-by 2020-01-01

A Corpus for Large-Scale Phonetic Typology

OPENALEX - Publications

Elizabeth Salesky Eleanor Chodroff Tiago Pimentel Matthew Wiesner Ryan Cotterell and 2 more

Elizabeth Salesky, Eleanor Chodroff, Tiago Pimentel, Matthew Wiesner, Ryan Cotterell, Alan W Black, Jason Eisner. Proceedings of the 58th Annual Meeting Association for Computational Linguistics. 2020.

10.18653/v1/2020.acl-main.415 article EN cc-by 2020-01-01

Coming Soon ...