NFDI4DS | UHH-SEMS - Publication Details

Vivien Macketanz

ORCID: 0000-0003-4530-6241

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5088185761

Research Areas

Natural Language Processing Techniques
Topic Modeling
Text Readability and Simplification
Semantic Web and Ontologies
Software Testing and Debugging Techniques
Speech and dialogue systems
Mathematics, Computing, and Information Processing
Biomedical Text Mining and Ontologies
Software Engineering Research
Artificial Intelligence in Healthcare and Education
Speech Recognition and Synthesis
Handwritten Text Recognition Techniques
Explainable Artificial Intelligence (XAI)
Multimodal Machine Learning Applications
Translation Studies and Practices

German Research Centre for Artificial Intelligence
2016-2023

Uppsala University
2017

CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies

OPENALEX - Publications

Daniel Zeman Martin Popel Milan Straka Jan Hajič Joakim Nivre and 57 more

Daniel Zeman, Martin Popel, Milan Straka, Jan Hajič, Joakim Nivre, Filip Ginter, Juhani Luotolahti, Sampo Pyysalo, Slav Petrov, Potthast, Francis Tyers, Elena Badmaeva, Memduh Gokirmak, Anna Nedoluzhko, Silvie Cinková, Hajič jr., Jaroslava Hlaváčová, Václava Kettnerová, Zdeňka Urešová, Jenna Kanerva, Stina Ojala, Missilä, Christopher D. Manning, Sebastian Schuster, Siva Reddy, Dima Taji, Nizar Habash, Herman Leung, Marie-Catherine de Marneffe, Manuela Sanguinetti, Maria Simi, Hiroshi...

10.18653/v1/k17-3001 article EN cc-by 2017-01-01

A Linguistic Evaluation of Rule-Based, Phrase-Based, and Neural MT Engines

OPENALEX - Publications

Aljoscha Burchardt Vivien Macketanz Jon Dehdari Georg Heigold Jan-Thorsten Peter and 1 more

Abstract In this paper, we report an analysis of the strengths and weaknesses several Machine Translation (MT) engines implementing three most widely used paradigms. The is based on a manually built test suite that comprises large range linguistic phenomena. Two main observations are one hand striking improvement commercial online system when turning from phrase-based to neural engine other successful translations MT systems sometimes bear resemblance with rule-based system.

10.1515/pralin-2017-0017 article EN The Prague Bulletin of Mathematical Linguistics 2017-06-01

Machine Translation: Phrase-Based, Rule-Based and Neural Approaches with Linguistic Evaluation

OPENALEX - Publications

Vivien Macketanz Eleftherios Avramidis Aljoscha Burchardt Jindřich Helcl Ankit Srivastava

Abstract In this article we present a novel linguistically driven evaluation method and apply it to the main approaches of Machine Translation (Rule-based, Phrase-based, Neural) gain insights into their strengths weaknesses in much more detail than provided by current schemes. Translating between two languages requires substantial modelling knowledge about languages, translation, world. Using English-German IT-domain translation as case-study, also enhance Phrase-based system exploiting...

10.1515/cait-2017-0014 article EN cc-by-nc-nd Cybernetics and Information Technologies 2017-06-01

Linguistically Motivated Evaluation of the 2023 State-of-the-art Machine Translation: Can ChatGPT Outperform NMT?

OPENALEX - Publications

Shushen Manakhimova Eleftherios Avramidis Vivien Macketanz Ekaterina Lapshinova‐Koltunski Sergei Bagdasarov and 1 more

This paper offers a fine-grained analysis of the machine translation outputs in context Shared Task at 8th Conference Machine Translation (WMT23). Building on foundation previous test suite efforts, our includes Large Language Models and an updated set featuring new linguistic phenomena. To knowledge, this is first for GPT-4 outputs. Our evaluation spans German-English, English-German, English-Russian language directions. Some phenomena with lowest accuracies German-English are idioms...

10.18653/v1/2023.wmt-1.23 article EN cc-by 2023-01-01

Fine-grained evaluation of German-English Machine Translation based on a Test Suite

OPENALEX - Publications

Vivien Macketanz Eleftherios Avramidis Aljoscha Burchardt Hans Uszkoreit

We present an analysis of 16 state-of-the-art MT systems on German-English based a linguistically-motivated test suite. The suite has been devised manually by team language professionals in order to cover broad variety linguistic phenomena that often fails translate properly. It contains 5,000 sentences covering 106 14 categories, with increased focus verb tenses, aspects and moods. outputs are evaluated semi-automatic way through regular expressions only the part sentence is relevant each...

10.18653/v1/w18-6436 preprint EN cc-by 2018-01-01

Linguistic Evaluation of German-English Machine Translation Using a Test Suite

OPENALEX - Publications

Eleftherios Avramidis Vivien Macketanz Ursula Strohriegel Hans Uszkoreit

We present the results of application a grammatical test suite for German-to-English MT on systems submitted at WMT19, with detailed analysis 107 phenomena organized in 14 categories. The still translate wrong one out four items average. Low performance is indicated idioms, modals, pseudo-clefts, multi-word expressions and verb valency. When compared to last year, there has been improvement function words, non verbal agreement punctuation. More conclusions about particular are also presented.

10.18653/v1/w19-5351 preprint EN cc-by 2019-01-01

Fine-grained evaluation of Quality Estimation for Machine translation based on a linguistically-motivated Test Suite

OPENALEX - Publications

Eleftherios Avramidis Vivien Macketanz Arle Lommel Hans Uszkoreit

We present an alternative method of evaluating Quality Estimation systems, which is based on a linguistically-motivated Test Suite. create test-set consisting 14 linguistic error categories and we gather for each them set samples with both correct erroneous translations. Then, measure the performance 5 systems by checking their ability to distinguish between The detailed results are much more informative about system. fact that different perform differently at various phenomena confirms usefulness

10.48550/arxiv.1910.07468 preprint EN cc-by arXiv (Cornell University) 2019-01-01

DFKI's system for WMT16 IT-domain task, including analysis of systematic errors

OPENALEX - Publications

Eleftherios Avramidis Aljoscha Burchardt Vivien Macketanz Ankit Srivastava

We are presenting a hybrid MT approach in the WMT2016 Shared Translation Task for IT-Domain. Our work consists of several translation components based on rule-based and statistical approaches that feed into an informed selection mechanism. Additions to last year’s submission include WSD component, syntactically-enhanced component improvements relevant particular domain. also present detailed human evaluation output all components, focusing systematic errors.

10.18653/v1/w16-2329 article EN cc-by 2016-01-01

Train, Sort, Explain: Learning to Diagnose Translation Models

OPENALEX - Publications

Robert Schwarzenberg David Harbecke Vivien Macketanz Eleftherios Avramidis Sebastian Möller

Robert Schwarzenberg, David Harbecke, Vivien Macketanz, Eleftherios Avramidis, Sebastian Möller. Proceedings of the 2019 Conference North American Chapter Association for Computational Linguistics (Demonstrations). 2019.

10.18653/v1/n19-4006 article EN 2019-01-01

Fine-grained linguistic evaluation for state-of-the-art Machine Translation

OPENALEX - Publications

Eleftherios Avramidis Vivien Macketanz Ursula Strohriegel Aljoscha Burchardt Sebastian Möller

This paper describes a test suite submission providing detailed statistics of linguistic performance for the state-of-the-art German-English systems Fifth Conference Machine Translation (WMT20). The analysis covers 107 phenomena organized in 14 categories based on about 5,500 items, including manual annotation effort 45 person hours. Two (Tohoku and Huoshan) appear to have significantly better accuracy than others, although best system WMT20 is not one from WMT19 macro-average. Additionally,...

10.48550/arxiv.2010.06359 preprint EN cc-by arXiv (Cornell University) 2020-01-01

Occiglot at WMT24: European Open-source Large Language Models Evaluated on Translation

OPENALEX - Publications

Eleftherios Avramidis Annika Grützner-Zahn Manuel Brack Patrick Schramowski Pedro Ortiz Suarez and 6 more

10.18653/v1/2024.wmt-1.23 article EN 2024-01-01

Investigating the Linguistic Performance of Large Language Models in Machine Translation

OPENALEX - Publications

Shushen Manakhimova Vivien Macketanz Eleftherios Avramidis Ekaterina Lapshinova‐Koltunski Sergei Bagdasarov and 1 more

10.18653/v1/2024.wmt-1.28 article EN 2024-01-01

Machine Translation Metrics Are Better in Evaluating Linguistic Errors on LLMs than on Encoder-Decoder Systems

OPENALEX - Publications

Eleftherios Avramidis Shushen Manakhimova Vivien Macketanz Sebastian Möller

10.18653/v1/2024.wmt-1.37 article EN 2024-01-01

Challenging the State-of-the-art Machine Translation Metrics from a Linguistic Perspective

OPENALEX - Publications

Eleftherios Avramidis Shushen Manakhimova Vivien Macketanz Sebastian Möller

We employ a linguistically motivated challenge set in order to evaluate the state-of-the-art machine translation metrics submitted Metrics Shared Task of 8th Conference for Machine Translation. The includes about 21,000 items extracted from 155 systems three language directions, covering more than 100 linguistically-motivated phenomena organized 14 categories. that have best performance with regard our analysis are Cometoid22-wmt23 (a trained metric based on distillation) German-English and...

10.18653/v1/2023.wmt-1.58 article EN cc-by 2023-01-01

A new deal for translation quality

OPENALEX - Publications

Aljoscha Burchardt Arle Lommel Vivien Macketanz

10.1007/s10209-020-00736-5 article EN Universal Access in the Information Society 2020-07-20

Perceptual Quality Dimensions of Machine-Generated Text with a Focus on Machine Translation

OPENALEX - Publications

Vivien Macketanz Babak Naderi Steven Schmidt Sebastian Möller

The quality of machine-generated text is a complex construct consisting various aspects and dimensions. We present study that aims to uncover relevant perceptual dimensions for one type text, is, Machine Translation. conducted crowdsourcing survey in the style Semantic Differential collect attribute ratings German MT outputs. An Exploratory Factor Analysis revealed underlying As result, we extracted four factors operate as Quality Experience outputs: precision, complexity, grammaticality,...

10.18653/v1/2022.humeval-1.3 article EN cc-by 2022-01-01

Train, Sort, Explain: Learning to Diagnose Translation Models

OPENALEX - Publications

Robert Schwarzenberg David Harbecke Vivien Macketanz Eleftherios Avramidis Sebastian Möller

Evaluating translation models is a trade-off between effort and detail. On the one end of spectrum there are automatic count-based methods such as BLEU, on other linguistic evaluations by humans, which arguably more informative but also require disproportionately high effort. To narrow spectrum, we propose general approach how to automatically expose systematic differences human machine translations experts. Inspired adversarial settings, train neural text classifier distinguish from...

10.48550/arxiv.1903.12017 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Observing the Learning Curve of NMT Systems With Regard to Linguistic Phenomena

OPENALEX - Publications

Patrick Stadler Vivien Macketanz Eleftherios Avramidis

Patrick Stadler, Vivien Macketanz, Eleftherios Avramidis. Proceedings of the 59th Annual Meeting Association for Computational Linguistics and 11th International Joint Conference on Natural Language Processing: Student Research Workshop. 2021.

10.18653/v1/2021.acl-srw.20 article EN cc-by 2021-01-01

Coming Soon ...