- Topic Modeling
- Natural Language Processing Techniques
- Web Data Mining and Analysis
- Advanced Text Analysis Techniques
- Information Retrieval and Search Behavior
- Text Readability and Simplification
- Esophageal and GI Pathology
- Esophageal Cancer Research and Treatment
- Sentiment Analysis and Opinion Mining
- Biomedical Text Mining and Ontologies
- Electromagnetic Compatibility and Noise Suppression
- Semantic Web and Ontologies
- Data Quality and Management
- Electromagnetic Simulation and Numerical Methods
- Complex Network Analysis Techniques
- Neural Networks and Applications
- Multimodal Machine Learning Applications
- Logic, Reasoning, and Knowledge
- Speech and dialogue systems
- Advanced Graph Neural Networks
- Gastroesophageal reflux and treatments
- Bayesian Modeling and Causal Inference
- Expert finding and Q&A systems
- Electromagnetic Scattering and Analysis
- Adversarial Robustness in Machine Learning
Ghent University
2015-2024
Ghent University Hospital
2008-2024
Hong Kong Polytechnic University
2023
Bangalore University
2023
University of the Basque Country
2023
Nokia (United Kingdom)
2023
Imec the Netherlands
2023
iMinds
2013-2018
University of Southern California
1995-2010
Creative Commons
2009
Using strict criteria for diagnosis, 23 patients having benign Barrett's esophagus, and 20 with adenocarcinoma arising in this epithelium have been analyzed. Evidence supports severe gastroesophageal reflux as a cause of esophagus. Successful antireflux surgery leads to stabilization possibly regression the dysplasia epithelium, can be followed by squamous epithelial regeneration some. Antireflux is advocated all esophagus demonstrated abnormal regardless symptoms. The malignant potential...
Adversarial training (AT) is a regularization method that can be used to improve the robustness of neural network methods by adding small perturbations in data. We show how use AT for tasks entity recognition and relation extraction. In particular, we demonstrate applying general purpose baseline model jointly extracting entities relations, allows improving state-of-the-art effectiveness on several datasets different contexts (i.e., news, biomedical, real estate data) languages (English Dutch).
Abstract Objective In this study, we investigate the potential of large language models (LLMs) to complement biomedical knowledge graphs in training semantic for and clinical domains. Materials Methods Drawing on wealth Unified Medical Language System graph harnessing cutting-edge LLMs, propose a new state-of-the-art approach obtaining high-fidelity representations concepts sentences, consisting 3 steps: an improved contrastive learning phase, novel self-distillation weight averaging phase....
Objective The purpose of the study was to test hypothesis that cardiac mucosa, carditis, and specialized intestinal metaplasia at an endoscopically normal-appearing cardia are manifestations gastroesophageal reflux disease. Summary Background Data In absence esophageal mucosal injury, diagnosis disease currently rests on 24-hour pH monitoring. Histologic examination esophagus is not useful. recent identification cardia, along with observation it occurs in inflamed led authors focus type...
We introduce DeepProbLog, a probabilistic logic programming language that incorporates deep learning by means of neural predicates. show how existing inference and techniques can be adapted for the new language. Our experiments demonstrate DeepProbLog supports both symbolic subsymbolic representations inference, 1) program induction, 2) (logic) programming, 3) (deep) from examples. To best our knowledge, this work is first to propose framework where general-purpose networks expressive...
Methods based on representation learning currently hold the state-of-the-art in many natural language processing and knowledge base inference tasks.Yet, a major challenge is how to efficiently incorporate commonsense into such models.A recent approach regularizes relation entity representations by propositionalization of first-order logic rules.However, does not scale beyond domains with only few entities rules.In this paper we present highly efficient method for incorporating implication...
Short text clustering is a challenging problem when adopting traditional bag-of-words or TF-IDF representations, since these lead to sparse vector representations of the short texts. Low-dimensional continuous embeddings can counter that sparseness problem: their high representational power exploited in deep algorithms. While has been studied extensively computer vision, relatively little work focused on NLP. The method we propose, learns discriminative features from both an autoencoder and...
Levering data on social media, such as Twitter and Facebook, requires information retrieval algorithms to become able relate very short text fragments each other. Traditional similarity methods tf-idf cosine-similarity, based word overlap, mostly fail produce good results in this case, since overlap is little or non-existent. Recently, distributed representations, embeddings, have been shown successfully allow words match the semantic level. In order pair -- a concatenation of separate an...
In adversarial training, a set of models learn together by pursuing competing goals, usually defined on single data instances. However, in relational learning and other non-i.i.d domains, goals can also be over sets For example, link predictor for the is-a relation needs to consistent with transitivity property: if is-a(x_1, x_2) is-a(x_2, x_3) hold, hold as well. Here we use such assumptions deriving an inconsistency loss, measuring degree which model violates adversarially-generated...
We propose an improvement on a state-of-the-art keyphrase extraction algorithm, Topical PageRank (TPR), incorporating topical information from topic models. While the original algorithm requires random walk for each in model being used, ours is independent of model, computing but single text regardless amount topics model. This increases speed drastically and enables it use large collections using vast models, while not altering performance algorithm.
Risk factors for the presence and extent of Barrett esophagus (BE) can be identified in patients with gastroesophageal reflux disease (GERD).Case-comparison study.University tertiary referral center.Five hundred two consecutive GERD documented by 24-hour esophageal pH monitoring complete demographic, endoscopic, physiological evaluation, divided groups according to BE (328 without 174 [67 short-segment 107 long-segment BE]).Clinical, data, studied multivariate analysis, identify independent...
The problem of noisy and unbalanced training data for supervised keyphrase extraction results from the subjectivity assignment, which we quantify by crowdsourcing keyphrases news fashion magazine articles with many annotators per document.We show that exhibit substantial disagreement, meaning single annotator could lead to very different sets extractors.Thus, annotations authors or readers poor performance resulting extractor.We provide a simple but effective solution still work such...
Natural language processing technology has made significant progress in recent years, fuelled by increasingly powerful general models. This also inspired a sizeable body of work targeted specifically towards the educational domain, where creation questions (both for assessment and practice) is laborious/expensive effort. Thus, automatic Question-Generation (QG) solutions have been proposed studied. Yet, according to survey QG community's progress, common baseline dataset unifying multiple...
Cancer diagnosis and prognosis primarily depend on clinical parameters such as age tumor grade, are increasingly complemented by molecular data, gene expression, from sequencing. However, sequencing is costly delays oncology workflows. Recent advances in Deep Learning allow to predict information morphological features within Whole Slide Images (WSIs), offering a cost-effective proxy of the markers. While promising, current methods lack robustness fully replace direct Here we aim improve...
<para xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> This paper presents a new multiconductor transmission line model for general 2-D lossy configurations based on mode reciprocity. Particular attention is devoted to elucidate the validity of quasi-TM and approximations that have be invoked obtain this model. A derivation complex capacitance matrix given, especially taking into account presence semiconductors. automatically leads nonclassical...
Search engines can improve their efficiency by selecting only few promising shards for each query. State-of-the-art shard selection algorithms first query a central index of sampled documents, and effectiveness is similar to searching all shards. However, the search in also hurts efficiency. Additionally, we show that these approaches varies substantially with documents. This paper proposes Taily, novel algorithm models query's score distribution as Gamma selects highly scored documents tail...
The molecular pathogenesis of Barrett's esophagus is poorly understood. Evidence suggests that at a phenotypic level, the metaplastic process begins with transformation squamous epithelium in distal to cardiac mucosa, which subsequently becomes intestinalized. homeobox gene Cdx-2 has been shown be an important transcriptional regulator embryonic differentiation and maintenance adult intestinal type epithelium. We hypothesized expression levels increase normal mucosa intestinalized columnar...
Because of changes in life expectancy, there is an increasing number elderly patients with esophageal cancer. The aim this study was to assess the outcome esophagectomy for cancer 80 years or older. A retrospective review performed records all who underwent from 1992 2007. cardiac and pulmonary evaluation obtained on individual basis younger octogenarians. Among 560 cancer, 47 (8%) were median age group (n= 513) 63 (interquartile range 56-71). Octogenarians had significantly more stage III...
Federated search has the potential of improving web search: user becomes less dependent on a single provider and parts deep become available through unified interface, leading to wider variety in retrieved results. However, publicly dataset for federated reflecting an actual environment been absent. As result, it difficult assess whether proposed systems are suitable setting. We introduce new test collection containing results from more than hundred engines, ranging large general engines...