- Particle physics theoretical and experimental studies
- High-Energy Particle Collisions Research
- Quantum Chromodynamics and Particle Interactions
- Semantic Web and Ontologies
- Logic, Reasoning, and Knowledge
- Particle Detector Development and Performance
- Topic Modeling
- Logic, programming, and type systems
- Advanced Database Systems and Queries
- Natural Language Processing Techniques
- Advanced Algebra and Logic
- Dark Matter and Cosmic Phenomena
- Advanced Text Analysis Techniques
- Data Management and Algorithms
- Data Quality and Management
- Web Data Mining and Analysis
- Formal Methods in Verification
- Text and Document Classification Technologies
- Computational and Text Analysis Methods
- Information Retrieval and Search Behavior
- Superconducting Materials and Applications
- Particle Accelerators and Free-Electron Lasers
- Multi-Agent Systems and Negotiation
- Algorithms and Data Compression
- Cosmology and Gravitation Theories
University of Amsterdam
2015-2024
Vrije Universiteit Amsterdam
1999-2022
Amsterdam University of the Arts
2009-2022
Aixtron (Germany)
2020
University of Washington
2014-2017
Ludwig-Maximilians-Universität München
2011-2016
The University of Adelaide
2014-2016
Humboldt-Universität zu Berlin
2016
Seattle University
2014-2015
University of Duisburg-Essen
2014
We study the effect on click-through rates of applying textual and stylistic features often related to clickbait headlines newspaper articles which can be bought in a digital environment. Having dataset consisting triples—original headline, rewritten CTR, where CTR is rate headline newsletter from online kiosk Blendle—we directly measure whether these "clickbait features" do what they are believed do: entice readers click them. The main findings as follows. First, data shows that editors...
Abstract Hybrid languages are expansions of propositional modal which can refer to (or even quantify over) worlds. The use strong hybrid dates back at least [Pri67], but recent work (for example [BS98, BT98a, BT99]) has focussed on a more constrained system called H (↓, @). We show in detail that @) is modally natural. begin by studying its expressivity, and provide model theoretic characterizations (via restricted notion Ehrenfeucht-Fraïssé game, an enriched bisimulation) syntactic...
This paper presents the ParlaMint corpora containing transcriptions of sessions 17 European national parliaments with half a billion words. The are uniformly encoded, contain rich meta-data about 11 thousand speakers, and linguistically annotated following Universal Dependencies formalism named entities. Samples conversion scripts available from project's GitHub repository, complete openly via CLARIN.SI repository for download, as well through NoSketch Engine KonText concordancers Parlameter...
XPath 1.0 is a variable free language designed to specify paths between nodes in XML documents. Such can alternatively be specified first-order logic. The logical abstraction of 1.0, usually called Navigational or Core XPath, not powerful enough express every definable path. In this article, we show that there exists natural expansion which path document trees expressible. This Conditional XPath. It contains additional axis relations the form (child::n[F])+, denoting transitive closure...
Access control for XML documents is a non-trivial topic, as can be witnessed from the number of approaches presented in literature. Trying to compare these, we discovered need simple, clearand unambiguous language state declarative semantics an access policy. All current natural language, which has none above properties. This makes it hard assess whether proposed algorithms are correct (i.e., really implement described semantics). It also policy on its merits, and others (for file systems...
Abstract This paper describes the digitization and enrichment of Canadian House Commons English Debates from 1901 to present. We start by laying out general framework in which this project took place then present structure database provide guidelines prospective users. The concludes with introduction www.lipad.ca , an online platform designed as a hub for archiving political data, parliamentary proceedings at centre its architecture.
We give semantic characterizations of the expressive power navigational XPath (a.k.a. Core XPath) in terms first order logic. can be used to specify sets nodes and paths an XML document tree. consider both uses. For nodes, is equally as logic two variables. paths, defined using four simple connectives, which together yield class definable relations are safe for bisimulation. Furthermore, we a characterization expressible conjunctive queries.
Recently, researchers started to pay attention the detection of temporal shifts in meaning words. However, most (if not all) these approaches restricted their efforts uncovering change over time, thus neglecting other valuable dimensions such as social or political variability. We propose an approach for detecting semantic between different viewpoints---broadly defined a set texts that share specific metadata feature, which can be time-period, but also entity party. For each viewpoint, we...
XPath is the W3C -- standard node addressing language for XML documents. still under development and its technical aspects are intensively studied. What missing at present a clear characterization of expressive power XPath, be it either semantical or with reference to some well established existing (logical) formalism. Core (the logical core 1.0 defined by Gottlob et al.) cannot express queries conditional paths as exemplified "do child step, while test true resulting node." In first-order...
This paper is about a special version of PDL, proposed by Marcus Kracht, for reasoning sibling ordered trees. It has four basic programs corresponding to the child, parent, left- and right-sibling relations in such The original motivation this language rooted field model-theoretic syntax. Motivated recent developments area semi-structured data, and, especially, query languages XML (eXtensible Markup Language) documents, we revisit language. renewed interest comes with focus on complexity...
Several on-line daily newspapers offer readers the opportunity to directly comment on articles.In Netherlands this feature is used quite often and quality (grammatically content-wise) surprisingly high.We develop techniques collect, store, enrich analyze these comments.After giving a high-level overview of Dutch 'commentosphere' we zoom in extracting discussion structure found flat threads; people not only news article, they also heavily other comments, resembling fora.We show how from...
Identifying authors of short texts on Internet or social media based communication systems is an important tool against fraud and cybercrimes. Besides the challenges raised by limited length these messages, evolving language writing styles makes authorship attribution difficult. Most current text approaches only address challenge length. However, neglecting second may lead to poor performance for who change their styles.
In this article, an optimized carbon-doped AlGaN/AlN super-lattice (SL) buffer structure for GaN-based high electron mobility transistors, grown on 200-mm Si wafers is demonstrated. The resulting transistor features: 1) maximum vertical breakdown strength as 2.72 MV/cm, 2) voltages (BVs) above 1.2 kV, 3) lateral BVs 2.2 4) reduction in traps, which expected to result low-dynamic RON, and 5) more than 50 years of extrapolated lifetime at 150 °C under 650-V bias. These were achieved by...
Abstract Efficiently exploiting all sources of information such as labeled instances, classes’ representation, and relations them has a high impact on the performance Multi-Label Text Classification (MLTC) systems. Most current approaches use documents primary source for MLTC. We investigate effectiveness different information— training data, textual labels classes, taxonomy classes— More specifically, first, each document–class pair, features are extracted using information. The reflect...