Jürgen Umbrich

ORCID: 0000-0002-3178-6910
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Semantic Web and Ontologies
  • Advanced Database Systems and Queries
  • Data Quality and Management
  • Web Data Mining and Analysis
  • Service-Oriented Architecture and Web Services
  • Data Management and Algorithms
  • Biomedical Text Mining and Ontologies
  • Advanced Graph Neural Networks
  • E-Government and Public Services
  • Natural Language Processing Techniques
  • Digital Rights Management and Security
  • Scientific Computing and Data Management
  • Peer-to-Peer Network Technologies
  • Privacy, Security, and Data Protection
  • Caching and Content Delivery
  • Library Science and Information Systems
  • Topic Modeling
  • Data Mining Algorithms and Applications
  • Big Data and Business Intelligence
  • Advanced Data Storage Technologies
  • Algorithms and Data Compression
  • Distributed systems and fault tolerance
  • Optimization and Search Problems
  • Research Data Management Practices
  • Technology Adoption and User Behaviour

Vienna University of Economics and Business
2014-2018

Ollscoil na Gaillimhe – University of Galway
2006-2013

Enterprise Ireland
2007-2008

Karlsruhe Institute of Technology
2007

Typical approaches for querying structured Web Data collect (crawl) and pre-process (index) large amounts of data in a central repository before allowing query answering. However, this time-consuming pre-processing phase however leverages the benefits Linked -- where is accessible live up-to-date at distributed resources that may change constantly only to limited degree, as results can never be current. An ideal answering system should return current answers reasonable amount time, even on...

10.1145/1772690.1772733 article EN 2010-04-26

The Open Data movement has become a driver for publicly available data on the Web. More and more data—from governments public institutions but also from private sector—are made online are mainly published in so-called portals. However, with increasing number of resources, there is concerns regards to quality sources corresponding metadata, which compromise searchability, discoverability, usability resources. In order get complete picture severity these issues, present work aims at developing...

10.1145/2964909 article EN Journal of Data and Information Quality 2016-10-25

Despite the enthusiasm caused by availability of a steadily increasing amount openly available, structured data, first critical voices appear addressing emerging issue low quality in meta data and source Open Data portals which is serious risk that could disrupt project. However, there exist no comprehensive reports about actual portals. In this work, we present our efforts to monitor assess 82 active portals, powered organisations across 35 different countries. We discuss metrics report...

10.1109/ficloud.2015.82 article EN 2015-08-01

We describe SPARQLES: an online system that monitors the health of public SPARQL endpoints on Web by probing them with custom-designed queries at regular intervals. present architecture SPARQLES and variety analytics it runs over endpoints, categorised ava ilability, discoverability, performance interoperability. also detail interfaces provides for human software agents to learn more about recent history current state individual endpoint or overall trends concerning maturity all monitored...

10.3233/sw-170254 article EN Semantic Web 2017-01-20

In recent years, Open Data has gained considerable attention: a steady growth in the number of openly published datasets - mainly by governments and public administrations can be observed as demand for rises. However, many potential providers are still hesitant to open their at same time users often face difficulties when attempting use this data practice. This indicates that there various barriers present both regarding usage publishing Data, but studies systematically collect assess these...

10.1109/cedem.2017.22 article EN 2017-05-01

This work analyzes an Open Data corpus containing 200K tabular resources with a total file size of 413 GB from data consumer perspective. Our study shows that ~10% the in portals are labelled as which only 50% can be considered CSV files. The inspects general shape these data, reports on column and row distribution, analyses availability (multiple) header rows if contains multiple tables. In addition, we inspect analyze table types, detect missing values report about distribution values.

10.1109/obd.2016.18 article EN 2016-08-01
Coming Soon ...