NFDI4DS | UHH-SEMS - Publication Details

Laura Kallmeyer

ORCID: 0000-0001-9691-5990

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5004268590

Research Areas

Natural Language Processing Techniques
Topic Modeling
semigroups and automata theory
Speech and dialogue systems
Syntax, Semantics, Linguistic Variation
Algorithms and Data Compression
Semantic Web and Ontologies
Text Readability and Simplification
Logic, programming, and type systems
Speech Recognition and Synthesis
linguistics and terminology studies
Linguistic research and analysis
Multimodal Machine Learning Applications
Logic, Reasoning, and Knowledge
Multi-Agent Systems and Negotiation
Language, Metaphor, and Cognition
Advanced Text Analysis Techniques
Model-Driven Software Engineering Techniques
DNA and Biological Computing
Constraint Satisfaction and Optimization
AI-based Problem Solving and Planning
Sentiment Analysis and Opinion Mining
Categorization, perception, and language
Authorship Attribution and Profiling
Gender Studies in Language

Heinrich Heine University Düsseldorf
2014-2023

University of Pavia
2023

Hochschule Düsseldorf University of Applied Sciences
2016-2021

Deutsche Nationalbibliothek
2021

Association for Computational Linguistics
2021

Chitose Institute of Science and Technology
2020

McGill University
2018

University of Tübingen
2000-2010

Société Française d'Allergologie
2008

Langues, Textes, Traitements Informatiques, Cognition
2003-2005

Multilingual Code-switching Identification via LSTM Recurrent Neural Networks

OPENALEX - Publications

Younes Samih Suraj Maharjan Mohammed Attia Laura Kallmeyer Thamar Solorio

This paper describes the HHU-UH-G system submitted to EMNLP 2016 Second Workshop on Computational Approaches Code Switching.Our ranked first place for Arabic (MSA-Egyptian) with an F1-score of 0.83 and second Spanish-English 0.90.The introduces a novel unified neural network architecture language identification in code-switched tweets both MSA-Egyptian dialect.The makes use word character level representations identify code-switching.For dialect does not rely any kind language-specific...

10.18653/v1/w16-5806 article EN cc-by 2016-01-01

Syntax-driven semantic frame composition in Lexicalized Tree Adjoining Grammars

OPENALEX - Publications

Laura Kallmeyer Rainer Osswald

The grammar framework presented in this paper combines Lexicalized Tree Adjoining Grammar (LTAG) with a (de)compositional frame semantics. We introduce elementary constructions as pairs of LTAG trees and decompositional frames. linking between syntax semantics can largely be captured by such since LTAG, represent full argument projections. Substitution adjunction the then trigger unification associated semantic frames, which are formally defined base-labelled feature structures. Moreover,...

10.15398/jlm.v1i2.61 article EN cc-by Journal of Language Modelling 2014-01-28

A Neural Architecture for Dialectal Arabic Segmentation

OPENALEX - Publications

Younes Samih Mohammed Attia Mohamed I. Eldesouki Ahmed Abdelalí Hamdy Mubarak and 2 more

The automated processing of Arabic Dialects is challenging due to the lack spelling standards and scarcity annotated data resources in general. Segmentation words into its constituent parts an important building block. In this paper, we show how a segmenter can be trained using only 350 tweets neural networks without any normalization or use lexical features resources. We deal with segmentation as sequence labeling problem at character level. experimentally that our model rival...

10.18653/v1/w17-1306 article EN cc-by 2017-01-01

OPENALEX - Publications

Laura Kallmeyer Aravind K. Joshi

10.1023/a:1024564228892 article EN Research on Language and Computation 2003-01-01

Data-Driven Parsing using Probabilistic Linear Context-Free Rewriting Systems

OPENALEX - Publications

Laura Kallmeyer Wolfgang Maier

This paper presents the first efficient implementation of a weighted deductive CYK parser for Probabilistic Linear Context-Free Rewriting Systems (PLCFRSs). LCFRS, an extension CFG, can describe discontinuities in straightforward way and is therefore natural candidate to be used data-driven parsing. To speed up parsing, we use different context-summary estimates parse items, some them allowing A* We evaluate our with grammars extracted from German NeGra treebank. Our experiments show that...

10.1162/coli_a_00136 article EN cc-by Computational Linguistics 2012-11-13

Semantic construction in feature-based TAG

OPENALEX - Publications

Claire Gardent Laura Kallmeyer

We propose a semantic construction method for Feature-Based Tree Adjoining Grammar which is based on the derived tree, compare it with related proposals and briefly discuss some implementation possibilities,

10.3115/1067807.1067825 article EN 2003-01-01

Scope and Situation Binding in LTAG Using Semantic Unification

OPENALEX - Publications

Laura Kallmeyer Maribel Romero

10.1007/s11168-008-9046-6 article EN Research on Language and Computation 2008-03-01

SemEval-2019 Task 2: Unsupervised Lexical Frame Induction

OPENALEX - Publications

Behrang QasemiZadeh Miriam R. L. Petruck Regina Stodden Laura Kallmeyer Marie Candito

This paper presents Unsupervised Lexical Frame Induction, Task 2 of the International Workshop on Semantic Evaluation in 2019. Given a set prespecified syntactic forms context, task requires that verbs and their arguments be clustered to resemble semantic frame structures. Results are useful identifying polysemous words, i.e., those whose structures not easily distinguished, as well discerning relations arguments. unsupervised induction methods fell into two tracks: A) Verb Clustering based...

10.18653/v1/s19-2003 article EN cc-by 2019-01-01

Learning from Relatives: Unified Dialectal Arabic Segmentation

OPENALEX - Publications

Younes Samih Mohamed I. Eldesouki Mohammed Attia Kareem Darwish Ahmed Abdelalí and 2 more

Younes Samih, Mohamed Eldesouki, Mohammed Attia, Kareem Darwish, Ahmed Abdelali, Hamdy Mubarak, Laura Kallmeyer. Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017). 2017.

10.18653/v1/k17-1043 article EN cc-by 2017-01-01

Decoding Prompt Syntax: Analysing its Impact on Knowledge Retrieval in Large Language Models

OPENALEX - Publications

Stephan Linzbach Tim Tressel Laura Kallmeyer Stefan Dietze Hajira Jabeen

Large Language Models (LLMs), with their advanced architectures and training on massive language datasets, contain unexplored knowledge. One method to infer this knowledge is through the use of cloze-style prompts. Typically, these prompts are manually designed because phrasing impacts retrieval performance, even if LLM encodes desired information. In paper, we study impact prompt syntax capacity LLMs. We a template-based approach paraphrase simple into more complex grammatical structure....

10.1145/3543873.3587655 article EN 2023-04-28

Tree-Local Multicomponent Tree-Adjoining Grammars with Shared Nodes

OPENALEX - Publications

Laura Kallmeyer

This article addresses the problem that expressive power of tree-adjoining grammars (TAGs) is too limited to deal with certain syntactic phenomena, in particular, scrambling free-word-order languages. The TAG variants proposed so far order account for are not entirely satisfying. Therefore, introduces an alternative extension based on notion node sharing, so-called (restricted) tree-local multicomponent shared nodes (RSN-MCTAG). analysis some German data sketched show this can scrambling....

10.1162/0891201054223968 article EN cc-by-nc-nd Computational Linguistics 2005-06-01

SAWT: Sequence Annotation Web Tool

OPENALEX - Publications

Younes Samih Wolfgang Maier Laura Kallmeyer

We present SAWT, a web-based tool for the annotation of token sequences with an arbitrary set labels.The key property is simplicity and ease use both annotators administrators.SAWT runs in any modern browser, including browsers on mobile devices, only has minimal server-side requirements.

10.18653/v1/w16-5808 article EN cc-by 2016-01-01

TS-ANNO: An Annotation Tool to Build, Annotate and Evaluate Text Simplification Corpora

OPENALEX - Publications

Regina Stodden Laura Kallmeyer

We introduce TS-ANNO, an open-source web application for manual creation and evaluation of parallel corpora text simplification. TS-ANNO can be used i) sentence–wise alignment, ii) rating alignment pairs (e.g., w.r.t. grammaticality, meaning preservation, ...), iii) annotating simplification transformations lexical substitution, sentence splitting, iv) complex documents. For evaluation, calculates inter-annotator agreement alignments (i) annotations (ii).

10.18653/v1/2022.acl-demo.14 article EN cc-by 2022-01-01

Probing for Constituency Structure in Neural Language Models

OPENALEX - Publications

David Arps Younes Samih Laura Kallmeyer Hassan Sajjad

In this paper, we investigate to which extent contextual neural language models (LMs) implicitly learn syntactic structure. More concretely, focus on constituent structure as represented in the Penn Treebank (PTB). Using standard probing techniques based diagnostic classifiers, assess accuracy of representing constituents different categories within neuron activations a LM such RoBERTa. order make sure that our probe focuses knowledge and not implicit semantic generalizations, also...

10.18653/v1/2022.findings-emnlp.502 article EN cc-by 2022-01-01

TuLiPA

OPENALEX - Publications

Laura Kallmeyer Timm Lichte Wolfgang Maier Yannick Parmentier Johannes Dellert and 1 more

In this paper, we present an open-source parsing environment (Tübingen Linguistic Parsing Architecture, TuLiPA) which uses Range Concatenation Grammar (RCG) as a pivot formalism, thus opening the way to of several mildly context-sensitive formalisms. This currently supports tree-based grammars (namely Tree-Adjoining Grammars (TAG) and Multi-Component with Tree Tuples (TT-MCTAG)) allows computation not only syntactic structures, but also corresponding semantic representations. It is used for...

10.3115/1611546.1611547 article EN 2008-01-01

A Declarative Characterization of Different Types of Multicomponent Tree Adjoining Grammars

OPENALEX - Publications

Laura Kallmeyer

10.1007/s11168-009-9064-z article EN Research on Language and Computation 2009-03-01

Coming Soon ...