NFDI4DS | UHH-SEMS - Publication Details

Steffen Eger

ORCID: 0000-0003-4663-8336

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5053947568

Research Areas

Natural Language Processing Techniques
Topic Modeling
Software Engineering Research
Advanced Combinatorial Mathematics
Text Readability and Simplification
Adversarial Robustness in Machine Learning
Advanced Text Analysis Techniques
Language and cultural evolution
Algorithms and Data Compression
Advanced Mathematical Identities
Opinion Dynamics and Social Influence
Multimodal Machine Learning Applications
semigroups and automata theory
Text and Document Classification Technologies
Explainable Artificial Intelligence (XAI)
Biomedical Text Mining and Ontologies
Hate Speech and Cyberbullying Detection
Misinformation and Its Impacts
Lexicography and Language Studies
Sentiment Analysis and Opinion Mining
Authorship Attribution and Profiling
Complex Network Analysis Techniques
Artificial Intelligence in Healthcare and Education
Artificial Intelligence in Games
Handwritten Text Recognition Techniques

University of Mannheim
2023-2024

Bielefeld University
2022-2024

IT University of Copenhagen
2023

Tokyo Institute of Technology
2023

Administration for Community Living
2023

American Jewish Committee
2023

Heidelberg University
2023

University of Haifa
2023

National Research University Higher School of Economics
2023

Technical University of Darmstadt
2016-2022

MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance

OPENALEX - Publications

Wei Zhao Maxime Peyrard Fei Liu Yang Gao Christian M. Meyer and 1 more

Wei Zhao, Maxime Peyrard, Fei Liu, Yang Gao, Christian M. Meyer, Steffen Eger. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint (EMNLP-IJCNLP). 2019.

10.18653/v1/d19-1053 article EN cc-by 2019-01-01

Short-term multi-hour ahead country-wide wind power prediction for Germany using gated recurrent unit deep learning

OPENALEX - Publications

Farah Shahid Wood David A. Nisar Humaira Aneela Zameer Steffen Eger

10.1016/j.rser.2022.112700 article EN Renewable and Sustainable Energy Reviews 2022-06-24

ChatGPT: A meta-analysis after 2.5 months

OPENALEX - Publications

Christoph Leiter Ran Zhang Yanran Chen Jonas Belouadi Daniil Larionov and 2 more

ChatGPT, a chatbot developed by OpenAI, has gained widespread popularity and media attention since its release in November 2022. However, little hard evidence is available regarding perception various sources. In this paper, we analyze over 300,000 tweets more than 150 scientific papers to investigate how ChatGPT perceived discussed. Our findings show that generally viewed as of high quality, with positive sentiment emotions joy dominating social media. Its slightly decreased debut, however,...

10.1016/j.mlwa.2024.100541 article EN cc-by Machine Learning with Applications 2024-03-05

Neural End-to-End Learning for Computational Argumentation Mining

OPENALEX - Publications

Steffen Eger Johannes Daxenberger Iryna Gurevych

We investigate neural techniques for end-to-end computational argumentation mining (AM). frame AM both as a token-based dependency parsing and sequence tagging problem, including multi-task learning setup. Contrary to models that operate on the argument component level, we find framing leads subpar performance results. In contrast, less complex (local) based BiLSTMs perform robustly across classification scenarios, being able catch long-range dependencies inherent problem. Moreover, jointly...

10.18653/v1/p17-1002 article EN cc-by Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) 2017-01-01

Towards Scalable and Reliable Capsule Networks for Challenging NLP Applications

OPENALEX - Publications

Wei Zhao Haiyun Peng Steffen Eger Erik Cambria Min Yang

Obstacles hindering the development of capsule networks for challenging NLP applications include poor scalability to large output spaces and less reliable routing processes. In this paper, we introduce: (i) an agreement score evaluate performance processes at instance-level; (ii) adaptive optimizer enhance reliability routing; (iii) compression partial improve networks. We validate our approach on two tasks, namely: multi-label text classification question answering. Experimental results...

10.18653/v1/p19-1150 preprint EN cc-by 2019-01-01

SUPERT: Towards New Frontiers in Unsupervised Evaluation Metrics for Multi-Document Summarization

OPENALEX - Publications

Yang Gao Wei Zhao Steffen Eger

We study unsupervised multi-document summarization evaluation metrics, which require neither human-written reference summaries nor human annotations (e.g. preferences, ratings, etc.). propose SUPERT, rates the quality of a summary by measuring its semantic similarity with pseudo summary, i.e. selected salient sentences from source documents, using contextualized embeddings and soft token alignment techniques. Compared to state-of-the-art SUPERT correlates better ratings 18- 39%. Furthermore,...

10.18653/v1/2020.acl-main.124 article EN 2020-01-01

ArgumenText: Searching for Arguments in Heterogeneous Sources

OPENALEX - Publications

Christian Stab Johannes Daxenberger Chris Stahlhut Tristan Miller Benjamin Schiller and 3 more

Christian Stab, Johannes Daxenberger, Chris Stahlhut, Tristan Miller, Benjamin Schiller, Christopher Tauchmann, Steffen Eger, Iryna Gurevych. Proceedings of the 2018 Conference North American Chapter Association for Computational Linguistics: Demonstrations. 2018.

10.18653/v1/n18-5005 article EN cc-by 2018-01-01

Text Processing Like Humans Do: Visually Attacking and Shielding

OPENALEX - Publications

Steffen Eger Gözde Gül Şahin Andreas Rücklé Ji-Ung Lee Cláudia Schulz and 4 more

Steffen Eger, Gözde Gül Şahin, Andreas Rücklé, Ji-Ung Lee, Claudia Schulz, Mohsen Mesgar, Krishnkant Swarnkar, Edwin Simpson, Iryna Gurevych. Proceedings of the 2019 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). 2019.

10.18653/v1/n19-1165 article EN 2019-01-01

What is the Essence of a Claim? Cross-Domain Claim Identification

OPENALEX - Publications

Johannes Daxenberger Steffen Eger Ivan Habernal Christian Stab Iryna Gurevych

Argument mining has become a popular research area in NLP. It typically includes the identification of argumentative components, e.g. claims, as central component an argument. We perform qualitative analysis across six different datasets and show that these appear to conceptualize claims quite differently. To learn about consequences such conceptualizations claim for practical applications, we carried out extensive experiments using state-of-the-art feature-rich deep learning systems,...

10.18653/v1/d17-1218 preprint EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2017-01-01

Is it Time to Swish? Comparing Deep Learning Activation Functions Across NLP tasks

OPENALEX - Publications

Steffen Eger Paul Youssef Iryna Gurevych

Activation functions play a crucial role in neural networks because they are the nonlinearities which have been attributed to success story of deep learning. One currently most popular activation is ReLU, but several competitors recently proposed or ‘discovered’, including LReLU and swish. While works compare newly on few tasks (usually from image classification) against ReLU), we perform first largescale comparison 21 across eight different NLP tasks. We find that largely unknown function...

10.18653/v1/d18-1472 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2018-01-01

Multi-Task Learning for Argumentation Mining in Low-Resource Settings

OPENALEX - Publications

Cláudia Schulz Steffen Eger Johannes Daxenberger Tobias Kahse Iryna Gurevych

Claudia Schulz, Steffen Eger, Johannes Daxenberger, Tobias Kahse, Iryna Gurevych. Proceedings of the 2018 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers). 2018.

10.18653/v1/n18-2006 article EN cc-by 2018-01-01

On the Limitations of Cross-lingual Encoders as Exposed by Reference-Free Machine Translation Evaluation

OPENALEX - Publications

Wei Zhao Goran Glavaš Maxime Peyrard Yang Gao Robert West and 1 more

Evaluation of cross-lingual encoders is usually performed either via zero-shot transfer in supervised downstream tasks or unsupervised textual similarity. In this paper, we concern ourselves with reference-free machine translation (MT) evaluation where directly compare source texts to (sometimes low-quality) system translations, which represents a natural adversarial setup for multilingual encoders. Reference-free holds the promise web-scale comparison MT systems. We systematically...

10.18653/v1/2020.acl-main.151 article EN cc-by 2020-01-01

Inducing Language-Agnostic Multilingual Representations

OPENALEX - Publications

Wei Zhao Steffen Eger Johannes Bjerva Isabelle Augenstein

Cross-lingual representations have the potential to make NLP techniques available vast majority of languages in world. However, they currently require large pretraining corpora or access typologically similar languages. In this work, we address these obstacles by removing language identity signals from multilingual embeddings. We examine three approaches for this: (i) re-aligning vector spaces target (all together) a pivot source language; (ii) language-specific means and variances, which...

10.18653/v1/2021.starsem-1.22 article EN cc-by 2021-01-01

ChatGPT: A Meta-Analysis after 2.5 Months

OPENALEX - Publications

Christoph Leiter Ran Zhang Yanran Chen Jonas Belouadi Daniil Larionov and 2 more

10.48550/arxiv.2302.13795 preprint EN cc-by arXiv (Cornell University) 2023-01-01

DiscoScore: Evaluating Text Generation with BERT and Discourse Coherence

OPENALEX - Publications

Wei Zhao Michael Strube Steffen Eger

Recently, there has been a growing interest in designing text generation systems from discourse coherence perspective, e.g., modeling the interdependence between sentences. Still, recent BERT-based evaluation metrics are weak recognizing coherence, and thus not reliable way to spot discourse-level improvements of those systems. In this work, we introduce DiscoScore, parametrized metric, which uses BERT model different perspectives, driven by Centering theory. Our experiments encompass 16...

10.18653/v1/2023.eacl-main.278 article EN cc-by 2023-01-01

On the Linearity of Semantic Change: Investigating Meaning Variation via Dynamic Graph Models

OPENALEX - Publications

Steffen Eger Alexander Mehler

We consider two graph models of semantic change.The first is a time-series model that relates embedding vectors from one time period to previous periods.In the second, we construct for each word: nodes in this correspond points and edge weights similarity word's meaning across points.We apply our corpora three different languages.We find change linear senses.Firstly, today's (= meaning) words can be derived as combinations their neighbors periods.Secondly, self-similarity decays linearly...

10.18653/v1/p16-2009 article EN cc-by 2016-01-01

Text Processing Like Humans Do: Visually Attacking and Shielding NLP Systems

OPENALEX - Publications

Steffen Eger Gözde Gül Şahin Andreas Rücklé Ji-Ung Lee Cláudia Schulz and 4 more

Visual modifications to text are often used obfuscate offensive comments in social media (e.g., "!d10t") or as a writing style ("1337" "leet speak"), among other scenarios. We consider this new type of adversarial attack NLP, setting which humans very robust, our experiments with both simple and more difficult visual input perturbations demonstrate. then investigate the impact attacks on current NLP systems character-, word-, sentence-level tasks, showing that neural non-neural models are,...

10.48550/arxiv.1903.11508 preprint EN other-oa arXiv (Cornell University) 2019-01-01

MENLI: Robust Evaluation Metrics from Natural Language Inference

OPENALEX - Publications

Yanran Chen Steffen Eger

Abstract Recently proposed BERT-based evaluation metrics for text generation perform well on standard benchmarks but are vulnerable to adversarial attacks, e.g., relating information correctness. We argue that this stems (in part) from the fact they models of semantic similarity. In contrast, we develop based Natural Language Inference (NLI), which deem a more appropriate modeling. design preference-based attack framework and show our NLI much robust attacks than recent metrics. On...

10.1162/tacl_a_00576 article EN cc-by Transactions of the Association for Computational Linguistics 2023-01-01

UScore: An Effective Approach to Fully Unsupervised Evaluation Metrics for Machine Translation

OPENALEX - Publications

Jonas Belouadi Steffen Eger

The vast majority of evaluation metrics for machine translation are supervised, i.e., (i) trained on human scores, (ii) assume the existence reference translations, or (iii) leverage parallel data. This hinders their applicability to cases where such supervision signals not available. In this work, we develop fully unsupervised metrics. To do so, similarities and synergies between metric induction, corpus mining, MT systems. particular, use an mine pseudo-parallel data, which remap deficient...

10.18653/v1/2023.eacl-main.27 article EN cc-by 2023-01-01

NLLG Quarterly arXiv Report 09/24: What are the most influential current AI Papers?

OPENALEX - Publications

Christoph Leiter Jonas Belouadi Yanran Chen Ran Zhang Daniil Larionov and 2 more

10.2139/ssrn.5045225 preprint EN 2025-01-01

Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation

OPENALEX - Publications

Steffen Eger Cao Yong Jennifer D’Souza Andreas Geiger Christian Greisinger and 9 more

With the advent of large multimodal language models, science is now at a threshold an AI-based technological transformation. Recently, plethora new AI models and tools has been proposed, promising to empower researchers academics worldwide conduct their research more effectively efficiently. This includes all aspects cycle, especially (1) searching for relevant literature; (2) generating ideas conducting experimentation; (3) text-based (4) content (e.g., scientific figures diagrams); (5)...

10.48550/arxiv.2502.05151 preprint EN arXiv (Cornell University) 2025-02-07

Concatenated Power Mean Word Embeddings as Universal Cross-Lingual Sentence Representations

OPENALEX - Publications

Andreas Rücklé Steffen Eger Maxime Peyrard Iryna Gurevych

Average word embeddings are a common baseline for more sophisticated sentence embedding techniques. However, they typically fall short of the performances complex models such as InferSent. Here, we generalize concept average to power mean embeddings. We show that concatenation different types considerably closes gap state-of-the-art methods monolingually and substantially outperforms these techniques cross-lingually. In addition, our proposed method recently baselines SIF Sent2Vec by solid...

10.48550/arxiv.1803.01400 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Does My Rebuttal Matter? Insights from a Major

OPENALEX - Publications

Yang Gao Steffen Eger Ilia Kuznetsov Iryna Gurevych Yusuke Miyao

Yang Gao, Steffen Eger, Ilia Kuznetsov, Iryna Gurevych, Yusuke Miyao. Proceedings of the 2019 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). 2019.

10.18653/v1/n19-1129 article EN 2019-01-01

The Eval4NLP 2023 Shared Task on Prompting Large Language Models as Explainable Metrics

OPENALEX - Publications

Christoph Leiter Juri Opitz Daniel Deutsch Yang Gao Rotem Dror and 1 more

Generative large language models (LLMs) have seen many breakthroughs over the last year. With an increasing number of parameters and pre-training data, they shown remarkable capabilities to solve tasks with minimal or no task-related examples. Notably, LLMs been successfully employed as evaluation metrics in text generation tasks. Strategies this context differ choice input prompts, selection samples for demonstration, methodology used construct scores grading generations. Approaches often...

10.18653/v1/2023.eval4nlp-1.10 article EN cc-by 2023-01-01

Coming Soon ...