Uta Störl

ORCID: 0000-0003-2771-142X
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Database Systems and Queries
  • Cloud Computing and Resource Management
  • Distributed systems and fault tolerance
  • Data Quality and Management
  • Semantic Web and Ontologies
  • Software System Performance and Reliability
  • Data Management and Algorithms
  • Advanced Data Storage Technologies
  • Big Data and Business Intelligence
  • Algorithms and Data Compression
  • Service-Oriented Architecture and Web Services
  • Digital Rights Management and Security
  • Business Process Modeling and Analysis
  • Linguistic research and analysis
  • Cloud Data Security Solutions
  • Video Analysis and Summarization
  • Scientific Computing and Data Management
  • Peer-to-Peer Network Technologies
  • Libraries and Information Services
  • Digital Humanities and Scholarship
  • Quantum Computing Algorithms and Architecture
  • Data Visualization and Analytics
  • Machine Learning and Data Classification
  • Network Security and Intrusion Detection
  • Corporate Governance and Management

University of Hagen
2021-2025

Darmstadt University of Applied Sciences
2005-2021

Systems, Applications & Products in Data Processing (United Kingdom)
2010

Hochschule Bremen
2005

University of Bremen
2005

Friedrich Schiller University Jena
1995-2000

NoSQL data stores are commonly schema-less, providing no means for globally defining or managing the schema. While this offers great flexibility in early stages of application development, developers soon can experience heavy burden dealing with increasingly heterogeneous data. This paper targets schema evolution stores, complex task adapting and changing implicit structure stored. We discuss recommendations developer community on handling changes, introduce a simple, declarative language....

10.48550/arxiv.1308.0514 preprint EN other-oa arXiv (Cornell University) 2013-01-01

Zusammenfassung In der Ära rasanter technologischer Fortschritte erweisen sich traditionelle Methoden des Forschungsinformationsmanagements als unzureichend. Präsentiert wird eine Projektskizze, die das Konzept „Data Management 4.0“ einführt. Dieses nutzt Künstliche Intelligenz (KI), um Forschungsinformationssysteme signifikant zu verbessern. Durch Integration KI-gestützter Lösungen könnten Institutionen erhöhte Effizienz, Genauigkeit und optimale Ressourcenzuteilung bei Handhabung großer...

10.1515/iwp-2024-2048 article DE Information - Wissenschaft & Praxis 2025-01-14

Data accumulating in data lakes can become inaccessible the long run when its semantics are not available. The heterogeneity of formats and sheer volumes collections prohibit cleaning unifying manually. Thus, tools for automated lake analysis great interest. In this paper, we target particular problem reconstructing schema evolution history from lakes. Knowing how is structured, structure has evolved over time, enables programmatic access to lake. By deriving a sequence versions, rather than...

10.1109/bigdata.2017.8258204 article EN 2021 IEEE International Conference on Big Data (Big Data) 2017-12-01

This paper explores scalable implementation strategies for carrying out lazy schema evolution in NoSQL data stores. For decades, has been an evergreen database research. Yet new challenges arise the context of cloud-hosted backends: With all reads and writes charged by provider, migrating entire instance eagerly into a can be prohibitively expensive. Thus, migration may more cost-efficient, as legacy entities are only migrated case they actually accessed application. Related work shown that...

10.1109/bigdata.2016.7840924 article EN 2021 IEEE International Conference on Big Data (Big Data) 2016-12-01

The use of Elastic Stack (ELK) solutions and Knowledge Graphs (KGs) has attracted a lot attention lately, with promises vastly improving business performance based on new insights better decisions. This allows organizations not only to reap the ultimate benefits data governance but also consider widest possible range relevant information when deciding their next steps. In this paper, we examine how management visualization are used in that ELK collect integrated from different sources one...

10.3390/fi15060190 article EN cc-by Future Internet 2023-05-25

Building applications for processing data lakes is a software engineering challenge. We present Darwin, middleware that operate on variational data. This concerns with heterogeneous structure, usually stored within schema-flexible NoSQL database. Darwin assists application developers in essential and schema curation tasks: Upon request, extracts description, discovers the history of versions, proposes mappings between these versions. Users may interactively choose which are most realistic....

10.1109/icde.2018.00187 article EN 2022 IEEE 38th International Conference on Data Engineering (ICDE) 2018-04-01

We demonstrate MigCast, a tool-based advisor for exploring data migration strategies in the context of developing NoSQL-backed applications. Users MigCast can consider their options evolving model along with legacy already persisted cloud-hosted production database. They explore alternative actions as financial costs are predicted respective to cloud provider chosen. Thereby they better equipped assess potential consequences imminent decisions. To this end, maintains an internal cost model,...

10.1145/3299869.3320223 article EN Proceedings of the 2022 International Conference on Management of Data 2019-06-18

Abstract When NoSQL database systems are used in an agile software development setting, data model changes occur frequently and thus, is routinely stored different versions. The management of versioned leads to overhead potentially impeding the development. Several migration strategies exist that handle legacy differently during accesses, each which can be characterized by certain advantages disadvantages. Depending on requirements for application, we evaluate compare through metrics like...

10.1007/s10619-021-07334-1 article EN cc-by Distributed and Parallel Databases 2021-04-30

When NoSQL database systems are used in an agile software development setting, data model changes occur frequently and thus, is routinely stored different versions. This leads to overhead affecting the particular, management of accesses. In this context, migration strategies exist, which characterized by certain advantages disadvantages. Using exactly that strategy whose characteristics match according scenario, depends on query workload, caused schema evolution, requirements for application...

10.1109/icdew49219.2020.00013 article EN 2020-04-01

Abstract Data-driven methods and data science are important scientific in many research fields. All approaches require professional engineering components. At the moment, computer experts needed for solving these tasks. Simultaneously, scientists from fields (like natural sciences, medicine, environmental engineering) want to analyse their autonomously. The arising task is development of tools that can support an automated curation utilisable domain experts. In this article, we will...

10.1007/s13222-021-00399-3 article EN cc-by Datenbank-Spektrum 2021-12-22

To provide good results and decisions in data-driven systems, data quality must be ensured as a primary consideration. An important aspect of this is cleaning. Although many different algorithms tools already exist for cleaning, an end-to-end solution still needed. In paper, we present our vision well-founded optimizer. contrast to studies that consider cleaning the context machine learning, approach focuses on various scenarios, such when preprocessing downstream analysis are separated. Our...

10.1109/icdew61823.2024.00039 article EN 2024-05-13

10.1007/s13222-014-0156-z article DE Datenbank-Spektrum 2014-06-18

We address a practical challenge in agile web development against NoSQL data stores: Upon new release of the application, entities already persisted production no longer match application code. Rather than migrating all legacy eagerly (prior to release) and at cost downtime, lazy migration is popular alternative: When entity loaded by pending structural changes are applied. Yet correctly from several releases back, involving more one at-a-time, not trivial. In this paper, we propose holistic...

10.1145/2815072.2815078 article EN 2015-10-27
Coming Soon ...