- Natural Language Processing Techniques
- Language and Culture
- Lexicography and Language Studies
- Topic Modeling
- Syntax, Semantics, Linguistic Variation
- Literature, Language, and Rhetoric Studies
- Semantic Web and Ontologies
- Speech and dialogue systems
- Text Readability and Simplification
- linguistics and terminology studies
- Authorship Attribution and Profiling
- Linguistic research and analysis
- Linguistics, Language Diversity, and Identity
- Data Mining Algorithms and Applications
- Historical Linguistics and Language Studies
- Logic, Reasoning, and Knowledge
- Linguistics and Discourse Analysis
- Language, Discourse, Communication Strategies
- Linguistic Studies and Language Acquisition
- Translation Studies and Practices
- Algorithms and Data Compression
- Language, Metaphor, and Cognition
- Advanced Algebra and Logic
- Gender Studies in Language
- Epistemology, Ethics, and Metaphysics
University of Warsaw
2015-2024
Polish Academy of Sciences
2015-2024
Institute of Computer Science
2014-2024
Charles University
2023
Czech Academy of Sciences, Institute of Philosophy
2019-2021
University of Oxford
2019-2021
Czech Academy of Sciences, Institute of Computer Science
2011-2021
TH Bingen University of Applied Sciences
1999-2015
University of Warmia and Mazury in Olsztyn
2010
Institute of Informatics of the Slovak Academy of Sciences
2008
This paper reports on the first shared task statistical parsing of morphologically rich languages (MRLs). The features data sets from nine languages, each available both in constituency and dependency annotation. We report preparation sets, proposed scenarios, evaluation metrics for MRLs given different representation types. present analyze results obtained by participants, then provide an analysis comparison parsers across frameworks, reported gold input as well more realistic scenarios.
–
The paper briefly reexamines arguments for the argument–adjunct dichotomy, commonly assumed in contemporary linguistics, showing that they do not stand up to scrutiny. It demonstrates – perhaps surprisingly LFG currently only assumes this dichotomy its f-structure feature geometry, and does rely on it any crucial way. Building observation, presents a way of getting rid altogether.
The article notes certain weaknesses of current efforts aiming at the standardization POS tagsets for morphologically rich languages and argues that, in order to achieve clear mappings between tagsets, it is necessary have formal rules delimiting POSs grammatical categories within any given tagset. An attempt constructing such a tagset Polish presented.
This paper presents the results of preliminary experiments in automatic extraction definitions (for semi-automatic glossary construction) from usually unstructured or only weakly structured e-learning texts Bulgarian, Czech and Polish. The is performed by regular grammars over XML-encoded morphosyntactically-annotated documents. are less than satisfying we claim that reason for intrinsic difficulty task, as measured low interannotator agreement, which calls more sophisticated deeper...
The aim of this paper is to reexamine the rich repertoire grammatical functions assumed in LFG and provide novel arguments for claim, voiced earlier example Alsina et al. 2005, that most them are redundant. We also demonstrate a textbook test sameness different predicates fails on closer scrutiny. Constructively, we propose more constrained approach functions, which, however, has advantage formalising function hierarchy, analyses diverse phenomena but apparently not previously formalised.
This paper presents recent extensions to Poliqarp, an open source tool for indexing and searching morphosyntactically annotated corpora, which turn it into a certain kinds of treebanks, complementary existing treebank search engines. In particular, the discusses motivation such new tool, extended query syntax Poliqarp implementation efficiency issues.
Abstract The aim of this squib is to question the popular belief that metaphor valency was introduced linguistics by Lucien Tesnière in middle 20th century. Rather, we show it first used Charles Peirce half a century earlier, leading apparently independent – but probably mediated Roman Jakobson ‘discoveries’ linguists Soviet Union, Holland, USA and indeed France.
Abstract The aim of this paper is to critically examine the tests used distinguish arguments from adjuncts in Functional Generative Description (Sgall et al., 1986) and question general usefulness distinction. In particular, we demonstrate that neither two FGD inner participants free adverbials (i.e. based on iterability specificity) stands up scrutiny, also point out practical problems with application dialogue test, semantically obligatory optional dependents. Since these are among most...
Information Extraction (IE) often involves some amount of partial syntactic processing. This is clear in cases interesting high-level IE tasks, such as finding information about who did what to whom (when, where, how and why), but it also true case simpler company names texts. The aim this paper give an overview Slavonic phenomena which pose particular problems for parsing, seem easier treat than Germanic or Romance; I mention various tools have been used the processing Slavonic.