NFDI4DS | UHH-SEMS - Publication Details

Thomas Demeester

ORCID: 0000-0002-9901-5768

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5075509168

Research Areas

Topic Modeling
Natural Language Processing Techniques
Web Data Mining and Analysis
Advanced Text Analysis Techniques
Information Retrieval and Search Behavior
Text Readability and Simplification
Esophageal and GI Pathology
Esophageal Cancer Research and Treatment
Sentiment Analysis and Opinion Mining
Biomedical Text Mining and Ontologies
Electromagnetic Compatibility and Noise Suppression
Semantic Web and Ontologies
Data Quality and Management
Electromagnetic Simulation and Numerical Methods
Complex Network Analysis Techniques
Neural Networks and Applications
Multimodal Machine Learning Applications
Logic, Reasoning, and Knowledge
Speech and dialogue systems
Advanced Graph Neural Networks
Gastroesophageal reflux and treatments
Bayesian Modeling and Causal Inference
Expert finding and Q&A systems
Electromagnetic Scattering and Analysis
Adversarial Robustness in Machine Learning

Ghent University
2015-2024

Ghent University Hospital
2008-2024

Hong Kong Polytechnic University
2023

Bangalore University
2023

University of the Basque Country
2023

Nokia (United Kingdom)
2023

Imec the Netherlands
2023

iMinds
2013-2018

University of Southern California
1995-2010

Creative Commons
2009

Barrettʼs Esophagus

OPENALEX - Publications

David B. Skinner BRUNO C. WALTHER Robert H. Riddell HELMUT SCHMIDT CLEMENT IASCONE and 1 more

Using strict criteria for diagnosis, 23 patients having benign Barrett's esophagus, and 20 with adenocarcinoma arising in this epithelium have been analyzed. Evidence supports severe gastroesophageal reflux as a cause of esophagus. Successful antireflux surgery leads to stabilization possibly regression the dysplasia epithelium, can be followed by squamous epithelial regeneration some. Antireflux is advocated all esophagus demonstrated abnormal regardless symptoms. The malignant potential...

10.1097/00000658-198310000-00016 article EN Annals of Surgery 1983-10-01

Joint entity recognition and relation extraction as a multi-head selection problem

OPENALEX - Publications

Giannis Bekoulis Johannes Deleu Thomas Demeester Chris Develder

10.1016/j.eswa.2018.07.032 article EN Expert Systems with Applications 2018-07-17

Representation learning for very short texts using weighted word embedding aggregation

OPENALEX - Publications

Cedric De Boom Steven Van Canneyt Thomas Demeester Bart Dhoedt

10.1016/j.patrec.2016.06.012 article EN Pattern Recognition Letters 2016-06-28

Adversarial training for multi-context joint entity and relation extraction

OPENALEX - Publications

Giannis Bekoulis Johannes Deleu Thomas Demeester Chris Develder

Adversarial training (AT) is a regularization method that can be used to improve the robustness of neural network methods by adding small perturbations in data. We show how use AT for tasks entity recognition and relation extraction. In particular, we demonstrate applying general purpose baseline model jointly extracting entities relations, allows improving state-of-the-art effectiveness on several datasets different contexts (i.e., news, biomedical, real estate data) languages (English Dutch).

10.18653/v1/d18-1307 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2018-01-01

BioLORD-2023: semantic textual representations fusing large language models and clinical knowledge graph insights

OPENALEX - Publications

François Remy Kris Demuynck Thomas Demeester

Abstract Objective In this study, we investigate the potential of large language models (LLMs) to complement biomedical knowledge graphs in training semantic for and clinical domains. Materials Methods Drawing on wealth Unified Medical Language System graph harnessing cutting-edge LLMs, propose a new state-of-the-art approach obtaining high-fidelity representations concepts sentences, consisting 3 steps: an improved contrastive learning phase, novel self-distillation weight averaging phase....

10.1093/jamia/ocae029 article EN cc-by Journal of the American Medical Informatics Association 2024-02-27

Inflammation and Specialized Intestinal Metaplasia of Cardiac Mucosa Is a Manifestation of Gastroesophageal Reflux Disease

OPENALEX - Publications

Stefan Öberg J. H. Peters Thomas Demeester Para Chandrasoma J. A. Hagen and 5 more

Objective The purpose of the study was to test hypothesis that cardiac mucosa, carditis, and specialized intestinal metaplasia at an endoscopically normal-appearing cardia are manifestations gastroesophageal reflux disease. Summary Background Data In absence esophageal mucosal injury, diagnosis disease currently rests on 24-hour pH monitoring. Histologic examination esophagus is not useful. recent identification cardia, along with observation it occurs in inflamed led authors focus type...

10.1097/00000658-199710000-00013 article EN Annals of Surgery 1997-10-01

DeepProbLog: Neural Probabilistic Logic Programming

OPENALEX - Publications

Robin Manhaeve Sebastijan Dumančić Angelika Kimmig Thomas Demeester Luc De Raedt

We introduce DeepProbLog, a probabilistic logic programming language that incorporates deep learning by means of neural predicates. show how existing inference and techniques can be adapted for the new language. Our experiments demonstrate DeepProbLog supports both symbolic subsymbolic representations inference, 1) program induction, 2) (logic) programming, 3) (deep) from examples. To best our knowledge, this work is first to propose framework where general-purpose networks expressive...

10.48550/arxiv.1805.10872 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Lifted Rule Injection for Relation Embeddings

OPENALEX - Publications

Thomas Demeester Tim Rocktäschel Sebastian Riedel

Methods based on representation learning currently hold the state-of-the-art in many natural language processing and knowledge base inference tasks.Yet, a major challenge is how to efficiently incorporate commonsense into such models.A recent approach regularizes relation entity representations by propositionalization of first-order logic rules.However, does not scale beyond domains with only few entities rules.In this paper we present highly efficient method for incorporating implication...

10.18653/v1/d16-1146 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2016-01-01

A Self-Training Approach for Short Text Clustering

OPENALEX - Publications

Amir Hadifar Lucas Sterckx Thomas Demeester Chris Develder

Short text clustering is a challenging problem when adopting traditional bag-of-words or TF-IDF representations, since these lead to sparse vector representations of the short texts. Low-dimensional continuous embeddings can counter that sparseness problem: their high representational power exploited in deep algorithms. While has been studied extensively computer vision, relatively little work focused on NLP. The method we propose, learns discriminative features from both an autoencoder and...

10.18653/v1/w19-4322 article EN cc-by 2019-01-01

Neural probabilistic logic programming in DeepProbLog

OPENALEX - Publications

Robin Manhaeve Sebastijan Dumančić Angelika Kimmig Thomas Demeester Luc De Raedt

10.1016/j.artint.2021.103504 article EN Artificial Intelligence 2021-04-15

Learning Semantic Similarity for Very Short Texts

OPENALEX - Publications

Cedric De Boom Steven Van Canneyt Steven Bohez Thomas Demeester Bart Dhoedt

Levering data on social media, such as Twitter and Facebook, requires information retrieval algorithms to become able relate very short text fragments each other. Traditional similarity methods tf-idf cosine-similarity, based word overlap, mostly fail produce good results in this case, since overlap is little or non-existent. Recently, distributed representations, embeddings, have been shown successfully allow words match the semantic level. In order pair -- a concatenation of separate an...

10.1109/icdmw.2015.86 preprint EN 2015-11-01

Adversarial Sets for Regularising Neural Link Predictors

OPENALEX - Publications

Pasquale Minervini Thomas Demeester Tim Rocktäschel Sebastian Riedel

In adversarial training, a set of models learn together by pursuing competing goals, usually defined on single data instances. However, in relational learning and other non-i.i.d domains, goals can also be over sets For example, link predictor for the is-a relation needs to consistent with transitivity property: if is-a(x_1, x_2) is-a(x_2, x_3) hold, hold as well. Here we use such assumptions deriving an inconsistency loss, measuring degree which model violates adversarially-generated...

10.48550/arxiv.1707.07596 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Topical Word Importance for Fast Keyphrase Extraction

OPENALEX - Publications

Lucas Sterckx Thomas Demeester Johannes Deleu Chris Develder

We propose an improvement on a state-of-the-art keyphrase extraction algorithm, Topical PageRank (TPR), incorporating topical information from topic models. While the original algorithm requires random walk for each in model being used, ours is independent of model, computing but single text regardless amount topics model. This increases speed drastically and enables it use large collections using vast models, while not altering performance algorithm.

10.1145/2740908.2742730 article EN 2015-05-18

Overly optimistic prediction results on imbalanced data: a case study of flaws and benefits when applying over-sampling

OPENALEX - Publications

Gilles Vandewiele Isabelle Dehaene György Kovács Lucas Sterckx Olivier Janssens and 7 more

10.1016/j.artmed.2020.101987 article EN Artificial Intelligence in Medicine 2020-11-20

DWIE: An entity-centric dataset for multi-task document-level information extraction

OPENALEX - Publications

Klim Zaporojets Johannes Deleu Chris Develder Thomas Demeester

10.1016/j.ipm.2021.102563 article EN Information Processing & Management 2021-03-20

Predictive Factors of Barrett Esophagus

OPENALEX - Publications

G.M Campos S Demeester J. H. Peters Stefan Öberg Peter F. Crookes and 5 more

Risk factors for the presence and extent of Barrett esophagus (BE) can be identified in patients with gastroesophageal reflux disease (GERD).Case-comparison study.University tertiary referral center.Five hundred two consecutive GERD documented by 24-hour esophageal pH monitoring complete demographic, endoscopic, physiological evaluation, divided groups according to BE (328 without 174 [67 short-segment 107 long-segment BE]).Clinical, data, studied multivariate analysis, identify independent...

10.1001/archsurg.136.11.1267 article EN Archives of Surgery 2001-11-01

Supervised Keyphrase Extraction as Positive Unlabeled Learning

OPENALEX - Publications

Lucas Sterckx Cornelia Caragea Thomas Demeester Chris Develder

The problem of noisy and unbalanced training data for supervised keyphrase extraction results from the subjectivity assignment, which we quantify by crowdsourcing keyphrases news fashion magazine articles with many annotators per document.We show that exhibit substantial disagreement, meaning single annotator could lead to very different sets extractors.Thus, annotations authors or readers poor performance resulting extractor.We provide a simple but effective solution still work such...

10.18653/v1/d16-1198 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2016-01-01

EduQG: A Multi-Format Multiple-Choice Dataset for the Educational Domain

OPENALEX - Publications

Amir Hadifar Semere Kiros Bitew Johannes Deleu Chris Develder Thomas Demeester

Natural language processing technology has made significant progress in recent years, fuelled by increasingly powerful general models. This also inspired a sizeable body of work targeted specifically towards the educational domain, where creation questions (both for assessment and practice) is laborious/expensive effort. Thus, automatic Question-Generation (QG) solutions have been proposed studied. Yet, according to survey QG community's progress, common baseline dataset unifying multiple...

10.1109/access.2023.3248790 article EN cc-by-nc-nd IEEE Access 2023-01-01

Prior Knowledge Injection into Deep Learning Models Predicting Gene Expression from Whole Slide Images

OPENALEX - Publications

Max Hallemeesch Marija Pizurica Paloma Rabaey Olivier Gevaert Thomas Demeester and 1 more

Cancer diagnosis and prognosis primarily depend on clinical parameters such as age tumor grade, are increasingly complemented by molecular data, gene expression, from sequencing. However, sequencing is costly delays oncology workflows. Recent advances in Deep Learning allow to predict information morphological features within Whole Slide Images (WSIs), offering a cost-effective proxy of the markers. While promising, current methods lack robustness fully replace direct Here we aim improve...

10.48550/arxiv.2501.14056 preprint EN arXiv (Cornell University) 2025-01-23

Quasi-TM Transmission Line Parameters of Coupled Lossy Lines Based on the Dirichlet to Neumann Boundary Operator

OPENALEX - Publications

Thomas Demeester D. De Zutter

<para xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> This paper presents a new multiconductor transmission line model for general 2-D lossy configurations based on mode reciprocity. Particular attention is devoted to elucidate the validity of quasi-TM and approximations that have be invoked obtain this model. A derivation complex capacitance matrix given, especially taking into account presence semiconductors. automatically leads nonclassical...

10.1109/tmtt.2008.925215 article EN IEEE Transactions on Microwave Theory and Techniques 2008-07-01

Taily

OPENALEX - Publications

Robin Aly Djoerd Hiemstra Thomas Demeester

Search engines can improve their efficiency by selecting only few promising shards for each query. State-of-the-art shard selection algorithms first query a central index of sampled documents, and effectiveness is similar to searching all shards. However, the search in also hurts efficiency. Additionally, we show that these approaches varies substantially with documents. This paper proposes Taily, novel algorithm models query's score distribution as Gamma selects highly scored documents tail...

10.1145/2484028.2484033 article EN Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval 2013-07-28

Cdx-2 expression in squamous and metaplastic columnar epithelia of the esophagus

OPENALEX - Publications

Daniel Vallböhmer Steven R. DeMeester Jeffrey H. Peters Daniel Oh Hidekazu Kuramochi and 6 more

The molecular pathogenesis of Barrett's esophagus is poorly understood. Evidence suggests that at a phenotypic level, the metaplastic process begins with transformation squamous epithelium in distal to cardiac mucosa, which subsequently becomes intestinalized. homeobox gene Cdx-2 has been shown be an important transcriptional regulator embryonic differentiation and maintenance adult intestinal type epithelium. We hypothesized expression levels increase normal mucosa intestinalized columnar...

10.1111/j.1442-2050.2006.00586.x article EN Diseases of the Esophagus 2006-07-07

Esophagectomy for cancer in octogenarians

OPENALEX - Publications

Jörg Zehetner John C. Lipham Shahin Ayazi Farzaneh Banki Arzu Oezcelik and 3 more

Because of changes in life expectancy, there is an increasing number elderly patients with esophageal cancer. The aim this study was to assess the outcome esophagectomy for cancer 80 years or older. A retrospective review performed records all who underwent from 1992 2007. cardiac and pulmonary evaluation obtained on individual basis younger octogenarians. Among 560 cancer, 47 (8%) were median age group (n= 513) 63 (interquartile range 56-71). Octogenarians had significantly more stage III...

10.1111/j.1442-2050.2010.01081.x article EN Diseases of the Esophagus 2010-06-10

Federated search in the wild

OPENALEX - Publications

Dong Nguyen Thomas Demeester Dolf Trieschnigg Djoerd Hiemstra

Federated search has the potential of improving web search: user becomes less dependent on a single provider and parts deep become available through unified interface, leading to wider variety in retrieved results. However, publicly dataset for federated reflecting an actual environment been absent. As result, it difficult assess whether proposed systems are suitable setting. We introduce new test collection containing results from more than hundred engines, ranging large general engines...

10.1145/2396761.2398535 article EN 2012-10-29

Coming Soon ...