NFDI4DS | UHH-SEMS - Publication Details

Inter-sentence Relation Extraction with Document-level Graph Convolutional Neural Network

OPENALEX - Publications

Sunil Kumar Sahu Fenia Christopoulou Makoto Miwa Sophia Ananiadou

Inter-sentence relation extraction deals with a number of complex semantic relationships in documents, which require local, non-local, syntactic and dependencies. Existing methods do not fully exploit such We present novel inter-sentence model that builds labelled edge graph convolutional neural network on document-level graph. The is constructed using various inter- intra-sentence dependencies to capture local non-local dependency information. In order predict the an entity pair, we utilise...

10.18653/v1/p19-1423 article EN cc-by 2019-01-01

Connecting the Dots: Document-level Neural Relation Extraction with Edge-oriented Graphs

OPENALEX - Publications

Fenia Christopoulou Makoto Miwa Sophia Ananiadou

Fenia Christopoulou, Makoto Miwa, Sophia Ananiadou. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint (EMNLP-IJCNLP). 2019.

10.18653/v1/d19-1498 article EN cc-by 2019-01-01

A Walk-based Model on Entity Graphs for Relation Extraction

OPENALEX - Publications

Fenia Christopoulou Makoto Miwa Sophia Ananiadou

We present a novel graph-based neural network model for relation extraction. Our treats multiple pairs in sentence simultaneously and considers interactions among them. All the entities are placed as nodes fully-connected graph structure. The edges represented with position-aware contexts around entity pairs. In order to consider different paths between two entities, we construct up l-length walks each pair. resulting merged iteratively used update edge representations into longer...

10.18653/v1/p18-2014 article EN cc-by 2018-01-01

Adverse drug events and medication relation extraction in electronic health records with ensemble deep learning methods

OPENALEX - Publications

Fenia Christopoulou Thy Thy Tran Sunil Kumar Sahu Makoto Miwa Sophia Ananiadou

Identification of drugs, associated medication entities, and interactions among them are crucial to prevent unwanted effects drug therapy, known as adverse events. This article describes our participation the n2c2 shared-task in extracting relations between medication-related entities electronic health records.We proposed an ensemble approach for relation extraction classification drugs entities. We incorporated state-of-the-art named-entity recognition (NER) models based on bidirectional...

10.1093/jamia/ocz101 article EN cc-by Journal of the American Medical Informatics Association 2019-05-24

PanGu-Coder: Program Synthesis with Function-Level Language Modeling

OPENALEX - Publications

Fenia Christopoulou Γεράσιμος Λάμπουρας Milan Gritta Guchun Zhang Yinpeng Guo and 17 more

We present PanGu-Coder, a pretrained decoder-only language model adopting the PanGu-Alpha architecture for text-to-code generation, i.e. synthesis of programming solutions given natural problem description. train PanGu-Coder using two-stage strategy: first stage employs Causal Language Modelling (CLM) to pre-train on raw data, while second uses combination and Masked (MLM) training objectives that focus downstream task generation loosely curated pairs program definitions code functions....

10.48550/arxiv.2207.11280 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Tweester at SemEval-2016 Task 4: Sentiment Analysis in Twitter Using Semantic-Affective Model Adaptation

OPENALEX - Publications

Elisavet Palogiannidi Athanasia Kolovou Fenia Christopoulou Filippos Kokkinos Elias Iosif and 4 more

Elisavet Palogiannidi, Athanasia Kolovou, Fenia Christopoulou, Filippos Kokkinos, Elias Iosif, Nikolaos Malandrakis, Haris Papageorgiou, Shrikanth Narayanan, Alexandros Potamianos. Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016). 2016.

10.18653/v1/s16-1023 article EN cc-by Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022) 2016-01-01

Distantly Supervised Relation Extraction with Sentence Reconstruction and Knowledge Base Priors

OPENALEX - Publications

Fenia Christopoulou Makoto Miwa Sophia Ananiadou

Fenia Christopoulou, Makoto Miwa, Sophia Ananiadou. Proceedings of the 2021 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies. 2021.

10.18653/v1/2021.naacl-main.2 article EN cc-by Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies 2021-01-01

NERO: a biomedical named-entity (recognition) ontology with a large, annotated corpus reveals meaningful associations through text embedding

OPENALEX - Publications

Kanix Wang Robert Stevens Halima Alachram Yu Li Larisa Soldatova and 20 more

Machine reading (MR) is essential for unlocking valuable knowledge contained in millions of existing biomedical documents. Over the last two decades1,2, most dramatic advances MR have followed wake critical corpus development3. Large, well-annotated corpora been associated with punctuated methodology and automated extraction systems same way that ImageNet4 was fundamental developing machine vision techniques. This study contributes six components to an advanced, named entity analysis tool...

10.1038/s41540-021-00200-x article EN cc-by npj Systems Biology and Applications 2021-10-20

Comparing neural models for nested and overlapping biomedical event detection

OPENALEX - Publications

Kurt Junshean Espinosa Panagiotis Georgiadis Fenia Christopoulou Meizhi Ju Makoto Miwa and 1 more

Nested and overlapping events are particularly frequent informative structures in biomedical event extraction. However, state-of-the-art neural models either neglect those during learning or use syntactic features external tools to detect them. To overcome these limitations, this paper presents compares two models: a novel EXhaustive Neural Network (EXNN) Search-Based (SBNN) for detection of nested events.We evaluate the proposed as an component isolation within pipeline setting. Evaluation...

10.1186/s12859-022-04746-3 article EN cc-by BMC Bioinformatics 2022-06-02

EntityCS: Improving Zero-Shot Cross-lingual Transfer with Entity-Centric Code Switching

OPENALEX - Publications

Chenxi Whitehouse Fenia Christopoulou Ignacio Iacobacci

Accurate alignment between languages is fundamental for improving cross-lingual pre-trained language models (XLMs). Motivated by the natural phenomenon of code-switching (CS) in multilingual speakers, CS has been used as an effective data augmentation method that offers at word- or phrase-level, contrast to sentence-level via parallel instances. Existing approaches either use dictionaries sentences with word-alignment generate randomly switching words a sentence. However, such methods can be...

10.18653/v1/2022.findings-emnlp.499 article EN cc-by 2022-01-01

Mixture of Topic-Based Distributional Semantic and Affective Models

OPENALEX - Publications

Fenia Christopoulou Eleftheria Briakou Elias Iosif Alexandros Potamianos

Typically, Distributional Semantic Models (DSMs) estimate semantic similarity between words using a single-model, where the multiple senses of polysemous are conflated in single representation. Similarly, textual affective analysis tasks, ambiguous usually not treated differently when estimating word scores. In this work, mixture model is proposed enabling combination scores estimated across topic-specific DSMs (TDSMs). Based on assumption that implies similarity, we extend to perform...

10.1109/icsc.2018.00036 article EN 2018-01-01

Text-to-Code Generation with Modality-relative Pre-training

OPENALEX - Publications

Fenia Christopoulou Guchun Zhang Γεράσιμος Λάμπουρας

Large pre-trained language models have recently been expanded and applied to programming tasks with great success, often through further pre-training of a strictly-natural model--where training sequences typically contain both natural (linearised) language. Such approaches effectively map modalities the sequence into same embedding space. However, keywords (e.g. ``while'') very strictly defined semantics. As such, transfer learning from their usage may not necessarily be beneficial code...

10.48550/arxiv.2402.05783 preprint EN arXiv (Cornell University) 2024-02-08

Human-like Episodic Memory for Infinite Context LLMs

OPENALEX - Publications

Zafeirios Fountas Martin A Benfeghoul Adnan Oomerjee Fenia Christopoulou Γεράσιμος Λάμπουρας and 2 more

Large language models (LLMs) have shown remarkable capabilities, but still struggle with processing extensive contexts, limiting their ability to maintain coherence and accuracy over long sequences. In contrast, the human brain excels at organising retrieving episodic experiences across vast temporal scales, spanning a lifetime. this work, we introduce EM-LLM, novel approach that integrates key aspects of memory event cognition into LLMs, enabling them effectively handle practically infinite...

10.48550/arxiv.2407.09450 preprint EN arXiv (Cornell University) 2024-07-12

SparsePO: Controlling Preference Alignment of LLMs via Sparse Token Masks

OPENALEX - Publications

Fenia Christopoulou Ronald Cardenas Γεράσιμος Λάμπουρας Haitham Bou-Ammar Jun Wang

Preference Optimization (PO) has proven an effective step for aligning language models to human-desired behaviors. Current variants, following the offline Direct objective, have focused on a strict setting where all tokens are contributing signals of KL divergence and rewards loss function. However, human preference is not affected by each word in sequence equally but often dependent specific words or phrases, e.g. existence toxic terms leads non-preferred responses. Based this observation,...

10.48550/arxiv.2410.05102 preprint EN arXiv (Cornell University) 2024-10-07

Training Dynamics for Curriculum Learning: A Study on Monolingual and Cross-lingual NLU

OPENALEX - Publications

Fenia Christopoulou Γεράσιμος Λάμπουρας Ignacio Iacobacci

Curriculum Learning (CL) is a technique of training models via ranking examples in typically increasing difficulty trend with the aim accelerating convergence and improving generalisability. Current approaches for Natural Language Understanding (NLU) tasks use CL to improve in-distribution data performance often heuristic-oriented or task-agnostic difficulties. In this work, instead, we employ NLU by taking advantage dynamics as metrics, i.e., statistics that measure behavior model at hand...

10.18653/v1/2022.emnlp-main.167 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2022-01-01

Connecting the Dots: Document-level Neural Relation Extraction with Edge-oriented Graphs

OPENALEX - Publications

Fenia Christopoulou Makoto Miwa Sophia Ananiadou

Document-level relation extraction is a complex human process that requires logical inference to extract relationships between named entities in text. Existing approaches use graph-based neural models with words as nodes and edges relations them, encode across sentences. These are node-based, i.e., they form pair representations based solely on the two target node representations. However, entity can be better expressed through unique edge formed paths nodes. We thus propose an edge-oriented...

10.48550/arxiv.1909.00228 preprint EN other-oa arXiv (Cornell University) 2019-01-01

NERO: A Biomedical Named-entity (Recognition) Ontology with a Large, Annotated Corpus Reveals Meaningful Associations Through Text Embedding

OPENALEX - Publications

Kanix Wang Robert Stevens Halima Alachram Yu Li Larisa Soldatova and 17 more

Machine reading is essential for unlocking valuable knowledge contained in the millions of existing biomedical documents. Over last two decades 1,2 , most dramatic advances machine-reading have followed wake critical corpus development 3 . Large, well-annotated corpora been associated with punctuated machine methodology and automated extraction systems same way that ImageNet 4 was fundamental developing vision techniques. This study contributes six components to an advanced, named-entity...

10.1101/2020.11.05.368969 preprint EN cc-by bioRxiv (Cold Spring Harbor Laboratory) 2020-11-06

Inter-sentence Relation Extraction with Document-level Graph Convolutional Neural Network

OPENALEX - Publications

Sunil Kumar Sahu Fenia Christopoulou Makoto Miwa Sophia Ananiadou

Inter-sentence relation extraction deals with a number of complex semantic relationships in documents, which require local, non-local, syntactic and dependencies. Existing methods do not fully exploit such We present novel inter-sentence model that builds labelled edge graph convolutional neural network on document-level graph. The is constructed using various inter- intra-sentence dependencies to capture local non-local dependency information. In order predict the an entity pair, we utilise...

10.48550/arxiv.1906.04684 preprint EN other-oa arXiv (Cornell University) 2019-01-01

A Walk-based Model on Entity Graphs for Relation Extraction

OPENALEX - Publications

Fenia Christopoulou Makoto Miwa Sophia Ananiadou

We present a novel graph-based neural network model for relation extraction. Our treats multiple pairs in sentence simultaneously and considers interactions among them. All the entities are placed as nodes fully-connected graph structure. The edges represented with position-aware contexts around entity pairs. In order to consider different paths between two entities, we construct up l-length walks each pair. resulting merged iteratively used update edge representations into longer...

10.48550/arxiv.1902.07023 preprint EN other-oa arXiv (Cornell University) 2019-01-01