Paul Groth

ORCID: 0000-0003-0183-6910
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Scientific Computing and Data Management
  • Semantic Web and Ontologies
  • Research Data Management Practices
  • Data Quality and Management
  • Distributed and Parallel Computing Systems
  • Topic Modeling
  • Advanced Graph Neural Networks
  • Biomedical Text Mining and Ontologies
  • Natural Language Processing Techniques
  • Service-Oriented Architecture and Web Services
  • Complex Network Analysis Techniques
  • Advanced Database Systems and Queries
  • Advanced Data Storage Technologies
  • Bioinformatics and Genomic Networks
  • Web Data Mining and Analysis
  • Data Mining Algorithms and Applications
  • Big Data and Business Intelligence
  • Distributed systems and fault tolerance
  • Access Control and Trust
  • Advanced Text Analysis Techniques
  • Software Engineering Research
  • Data Management and Algorithms
  • History and advancements in chemistry
  • Computational Drug Discovery Methods
  • Machine Learning and Data Classification

University of Amsterdam
2013-2025

Amsterdam University of the Arts
2021-2025

RELX Group (Netherlands)
2007-2022

Vrije Universiteit Amsterdam
2010-2022

University of Manchester
2022

The University of Queensland
2022

ZB MED - Information Centre for Life Sciences
2022

Vlaams Instituut voor Biotechnologie
2022

VIB-UGent Center for Plant Systems Biology
2022

Universidad Politécnica de Madrid
2022

There is an urgent need to improve the infrastructure supporting reuse of scholarly data. A diverse set stakeholders-representing academia, industry, funding agencies, and publishers-have come together design jointly endorse a concise measureable principles that we refer as FAIR Data Principles. The intent these may act guideline for those wishing enhance reusability their data holdings. Distinct from peer initiatives focus on human scholar, Principles put specific emphasis enhancing ability...

10.1038/sdata.2016.18 article EN cc-by Scientific Data 2016-03-15

As the amount of scholarly communication increases, it is increasingly difficult for specific core scientific statements to be found, connected and curated.Additionally, redundancy these in multiple fora makes determine attribution, quality provenance.To tackle challenges, Concept Web Alliance has promoted notion nanopublications (core with associated context).In this document, we present a model along Named Graph/RDF serialization model.Importantly, defined completely using already existing...

10.3233/isu-2010-0613 article EN other-oa Information Services & Use 2010-09-21

What paper should I read next? Who talk to at a conference? Which research group get this grant? Researchers and funders alike must make daily judgments on how best spend their limited time money–judgments that are becoming increasingly difficult as the volume of scholarly communication increases. Not only does number papers continue grow, it is joined by new forms from data publications microblog posts. To deal with incoming information, scholars have always relied upon filters. At first...

10.1371/journal.pone.0048753 article EN cc-by PLoS ONE 2012-11-01

Generating value from data requires the ability to find, access and make sense of datasets. There are many efforts underway encourage sharing reuse, scientific publishers asking authors submit alongside manuscripts marketplaces, open portals communities. Google recently beta-released a search service for datasets, which allows users discover stored in various online repositories via keyword queries. These developments foreshadow an emerging research field around dataset or retrieval that...

10.1007/s00778-019-00564-x article EN cc-by The VLDB Journal 2019-08-24

An increasing number of researchers support reproducibility by including pointers to and descriptions datasets, software methods in their publications. However, scientific articles may be ambiguous, incomplete difficult process automated systems. In this paper we introduce RO-Crate, an open, community-driven, lightweight approach packaging research artefacts along with metadata a machine readable manner. RO-Crate is based on Schema$.$org annotations JSON-LD, aiming establish best practices...

10.3233/ds-210053 article EN cc-by-nc Data Science 2022-01-04

It would include details of the processes that produced electronic data as far back beginning time or at least epoch provenance awareness.

10.1145/1330311.1330323 article EN Communications of the ACM 2008-04-01

Describes the Wings intelligent workflow system that assists scientists with designing computational experiments by automatically tracking constraints and ruling out invalid designs, letting focus on their goals.

10.1109/mis.2010.9 article EN IEEE Intelligent Systems 2010-01-26

The World Wide Web is now deeply intertwined with our lives, and has become a catalyst for data deluge, making vast amounts of available online, at click button. With 2.0, users are

10.2200/s00528ed1v01y201308wbe007 article EN Synthesis lectures on the semantic web 2013-09-15

The prov family of documents are the final output World Wide Web Consortium Provenance Working Group, chartered to specify a representation provenance facilitate its exchange over Web. This article reflects upon key requirements, guiding principles, and design decisions that influenced documents. A broad range requirements were found, relating concepts necessary for describing provenance, such as resources, activities, agents events, balancing prov’s ease use with facility check validity. By...

10.1016/j.websem.2015.04.001 article EN cc-by Journal of Web Semantics 2015-04-20

Knowledge Graphs (KG) are of vital importance for multiple applications on the web, including information retrieval, recommender systems, and metadata annotation.

10.1145/3442381.3450141 preprint EN 2021-04-19

Entity alignment (EA) is the task of identifying entities that refer to same real-world object but are located in different knowledge graphs (KGs). For be aligned, existing EA solutions treat them separately and generate results as ranked lists on other side. Nevertheless, this decision-making paradigm fails take into account interdependence among entities. Although some recent efforts mitigate issue by imposing 1-to-1 constraint process, they still cannot adequately model underlying tend...

10.1145/3446428 article EN ACM transactions on office information systems 2021-05-05

Digital Twins (DT) facilitate monitoring and reasoning processes in cyber–physical systems. They have progressively gained popularity over the past years because of intense research activity industrial advancements. Cognitive is a novel concept, recently coined to refer involvement Semantic Web technology DTs. Recent studies address relevance ontologies knowledge graphs context DTs, terms representation, interoperability automatic reasoning. However, there no comprehensive analysis how...

10.1016/j.future.2023.12.013 article EN cc-by Future Generation Computer Systems 2023-12-19
Coming Soon ...