NFDI4DS | UHH-SEMS - Publication Details

Maciej Ogrodniczuk

ORCID: 0000-0002-3467-9424

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5067499396

Research Areas

Natural Language Processing Techniques
Language and Culture
Topic Modeling
Semantic Web and Ontologies
Literature, Language, and Rhetoric Studies
Speech and dialogue systems
Mathematics, Computing, and Information Processing
linguistics and terminology studies
Linguistics, Language Diversity, and Identity
Lexicography and Language Studies
Text Readability and Simplification
Biomedical Text Mining and Ontologies
Library Science and Information Systems
European and International Law Studies
Advanced Text Analysis Techniques
Linguistic research and analysis
Digital Humanities and Scholarship
Service-Oriented Architecture and Web Services
Legal Language and Interpretation
Language, Metaphor, and Cognition
Image Processing and 3D Reconstruction
Authorship Attribution and Profiling
Digital Rights Management and Security
Speech Recognition and Synthesis
Algorithms and Data Compression

Polish Academy of Sciences
2014-2024

Institute of Computer Science
2014-2024

The Institute of the Polish Language of the Polish Academy of Sciences
2022

Czech Academy of Sciences, Institute of Computer Science
2014-2019

Université de Tours
2014

University of Warsaw
2004

The ParlaMint corpora of parliamentary proceedings

OPENALEX - Publications

Tomaž Erjavec Maciej Ogrodniczuk Petya Osenova Nikola Ljubešić Kiril Simov and 23 more

This paper presents the ParlaMint corpora containing transcriptions of sessions 17 European national parliaments with half a billion words. The are uniformly encoded, contain rich meta-data about 11 thousand speakers, and linguistically annotated following Universal Dependencies formalism named entities. Samples conversion scripts available from project's GitHub repository, complete openly via CLARIN.SI repository for download, as well through NoSketch Engine KonText concordancers Parlameter...

10.1007/s10579-021-09574-0 article EN cc-by Language Resources and Evaluation 2022-02-02

TED Multilingual Discourse Bank (TED-MDB): a parallel corpus annotated in the PDTB style

OPENALEX - Publications

Deniz Zeyrek Amália Mendes Yulia Grishina Murathan Kurfalı Samuel Gibbon and 1 more

10.1007/s10579-019-09445-9 article EN Language Resources and Evaluation 2019-04-06

Korpusomat – a Tool for Creating Searchable Morphosyntactically Tagged Corpora

OPENALEX - Publications

Witold Kieraś Łukasz Kobyliński Maciej Ogrodniczuk

The paper presents Korpusomat, a web application aimed at building annotated corpora for the purpose of corpus linguistic studies.Korpusomat combines existing tools, such as morphological analyser, tagger and search engine, provides an easy-to-use environment technically compatible with National Corpus Polish from almost any text, including texts in binary formats.In we present current state project, its features functionalities, well some future plans developments tasks.A usage example is...

10.12921/cmst.2018.0000005 article EN Computational Methods in Science and Technology 2018-03-31

Co słychać w Jasnopisie? Minęło 10 lat od startu aplikacji

OPENALEX - Publications

Ewa Kozioł-Chrzanowska Włodzimierz Gruszczyński Anna Niepytalska-Osiecka Maciej Ogrodniczuk Monika Buraczyńska and 1 more

10.33896/porj.2025.2.7 article PL cc-by Poradnik Językowy 2025-03-28

The European Language Technology Landscape in 2020: Language-Centric and Human-Centric AI for Cross-Cultural Communication in Multilingual Europe

OPENALEX - Publications

Georg Rehm Katrin Marheinecke Stefanie Hegele Stelios Piperidis Kalina Bontcheva and 42 more

Multilingualism is a cultural cornerstone of Europe and firmly anchored in the European treaties including full language equality. However, barriers impacting business, cross-lingual cross-cultural communication are still omnipresent. Language Technologies (LTs) powerful means to break down these barriers. While last decade has seen various initiatives that created multitude approaches technologies tailored Europe's specific needs, there an immense level fragmentation. At same time, AI...

10.48550/arxiv.2003.13833 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Findings of the Second Shared Task on Multilingual Coreference Resolution

OPENALEX - Publications

Zdeněk Žabokrtský Miloslav Konopík Anna Nedoluzhko Michal Novák Maciej Ogrodniczuk and 4 more

Zdeněk Žabokrtský, Miloslav Konopik, Anna Nedoluzhko, Michal Novák, Maciej Ogrodniczuk, Martin Popel, Ondrej Prazak, Jakub Sido, Daniel Zeman. Proceedings of the CRAC 2023 Shared Task on Multilingual Coreference Resolution. 2023.

10.18653/v1/2023.crac-sharedtask.1 article EN cc-by 2023-01-01

ParlaMint II: advancing comparable parliamentary corpora across Europe

OPENALEX - Publications

Tomaž Erjavec Matyáš Kopp Nikola Ljubešić Taja Kuzman Paul Rayson and 32 more

Abstract The paper presents the results of ParlaMint II project, which comprise comparable corpora parliamentary debates 29 European countries and autonomous regions, covering at least period from 2015 to 2022, containing over 1 billion words. are uniformly encoded, contain rich metadata about their 24 thousand speakers, linguistically annotated up level Universal Dependencies syntax named entities. focuses on enhancement made since I project compilation corpora, including encoding...

10.1007/s10579-024-09798-w article EN cc-by Language Resources and Evaluation 2024-12-28

Findings of the Shared Task on Multilingual Coreference Resolution

OPENALEX - Publications

Zdeněk Žabokrtský Miloslav Konopík Anna Nedoluzhko Michal Novák Maciej Ogrodniczuk and 5 more

This paper presents an overview of the shared task on multilingual coreference resolution associated with CRAC 2022 workshop. Shared participants were supposed to develop trainable systems capable identifying mentions and clustering them according identity coreference. The public edition CorefUD 1.0, which contains 13 datasets for 10 languages, was used as source training evaluation data. CoNLL score in previous coreference-oriented tasks main metric. There 8 prediction submitted by 5...

10.48550/arxiv.2209.07841 preprint EN other-oa arXiv (Cornell University) 2022-01-01

The strategic impact of META-NET on the regional, national and international level

OPENALEX - Publications

Georg Rehm Hans Uszkoreit Sophia Ananiadou Núria Bel Audronė Bielevičienė and 39 more

10.1007/s10579-015-9333-4 article EN Language Resources and Evaluation 2016-01-09

Coming Soon ...