- Natural Language Processing Techniques
- linguistics and terminology studies
- Topic Modeling
- Lexicography and Language Studies
- Semantic Web and Ontologies
- Speech and dialogue systems
- Language, Metaphor, and Cognition
- Literature, Language, and Rhetoric Studies
- Syntax, Semantics, Linguistic Variation
- Categorization, perception, and language
Plovdiv University
2023-2024
Institute for Bulgarian Language
2012-2014
Bulgarian Academy of Sciences
2012
Abstract The extent to which languages share properties reflecting the non-linguistic constraints of speakers who speak them is key debate regarding relationship between language and cognition. A critical case spatial communication, where it has been argued that semantic universals should exist, if anywhere. Here, using an experimental paradigm able separate variation within a from languages, we tested use demonstratives—the most fundamental frequent terms across languages. In n = 874 29...
The paper discusses several key concepts related to the development of corpora and reconsiders them in light recent developments NLP. On basis an overview present-day corpora, we conclude that dominant practices corpus design do not utilise adequately technologies and, as a result, fail meet demands linguistics, computational lexicology linguistics alike.We proceed lay out data-driven approach design, which integrates best traditional with potential latest allowing fast collection, automatic...
The paper presents Anaphora -an OS and language independent tool for clause annotation alignment, developed at the Department of Computational Linguistics, Institute Bulgarian Language, Academy Sciences.The supports automated sentence splitting alignment modes manual monolingual multilingual sentences clauses.Anaphora has been successfully applied Bulgarian-English Sentence-and Clause-Aligned Corpus (Koeva et al. 2012a) a number other languages including French Spanish.