NFDI4DS | UHH-SEMS - Publication Details

Stefanie Scherzinger

ORCID: 0000-0002-1960-6171

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5053676208

Research Areas

Advanced Database Systems and Queries
Cloud Computing and Resource Management
Scientific Computing and Data Management
Semantic Web and Ontologies
Distributed systems and fault tolerance
Software Engineering Research
Software System Performance and Reliability
Data Quality and Management
Advanced Data Storage Technologies
Data Management and Algorithms
Peer-to-Peer Network Technologies
Natural Language Processing Techniques
Algorithms and Data Compression
Data Visualization and Analytics
Logic, programming, and type systems
Digitalization, Law, and Regulation
Quantum Computing Algorithms and Architecture
Quantum Information and Cryptography
Web Data Mining and Analysis
Distributed and Parallel Computing Systems
Service-Oriented Architecture and Web Services
Digital Innovation in Industries
Evolutionary Algorithms and Applications
Data Stream Mining Techniques
Advanced Software Engineering Methodologies

University of Passau
2004-2024

Regensburg University of Applied Sciences
2013-2020

University of Regensburg
2018

Saarland University
2006-2008

Humboldt-Universität zu Berlin
2004

TU Wien
2004

Technical University of Munich
2004

Ready to Leap (by Co-Design)? Join Order Optimisation on Quantum Hardware

OPENALEX - Publications

Manuel Schönberger Stefanie Scherzinger Wolfgang Mauerer

The prospect of achieving computational speedups by exploiting quantum phenomena makes the use processing units (QPUs) attractive for many algorithmic database problems. Query optimisation, which concerns problems that typically need to explore large search spaces, seems like an ideal match algorithms. We present first implementation join ordering, one most investigated and fundamental query optimisation problems, based on a reformulation quadratic binary unconstrained empirically...

10.1145/3588946 article EN cc-by-nc-sa Proceedings of the ACM on Management of Data 2023-05-26

Validation of Modern JSON Schema: Formalization and Complexity

OPENALEX - Publications

Lyes Attouche Mohamed-Amine Baazizi Dario Colazzo Giorgio Ghelli Carlo Sartiani and 1 more

JSON Schema is the de-facto standard schema language for data. The went through many minor revisions, but most recent versions of language, starting from Draft 2019-09, added two novel features, dynamic references and annotation-dependent validation, that change evaluation model. Modern name used to indicate all which are characterized by these new while Classical previous versions. These “modern” features make quite difficult understand have generated discussions about correct...

10.1145/3632891 article EN Proceedings of the ACM on Programming Languages 2024-01-05

Managing Schema Evolution in NoSQL Data Stores

OPENALEX - Publications

Stefanie Scherzinger Meike Klettke Uta Störl

NoSQL data stores are commonly schema-less, providing no means for globally defining or managing the schema. While this offers great flexibility in early stages of application development, developers soon can experience heavy burden dealing with increasingly heterogeneous data. This paper targets schema evolution stores, complex task adapting and changing implicit structure stored. We discuss recommendations developer community on handling changes, introduce a simple, declarative language....

10.48550/arxiv.1308.0514 preprint EN other-oa arXiv (Cornell University) 2013-01-01

Uncovering the evolution history of data lakes

OPENALEX - Publications

Meike Klettke Hannes Awolin Uta Störl D. Müller Stefanie Scherzinger

Data accumulating in data lakes can become inaccessible the long run when its semantics are not available. The heterogeneity of formats and sheer volumes collections prohibit cleaning unifying manually. Thus, tools for automated lake analysis great interest. In this paper, we target particular problem reconstructing schema evolution history from lakes. Knowing how is structured, structure has evolved over time, enables programmatic access to lake. By deriving a sequence versions, rather than...

10.1109/bigdata.2017.8258204 article EN 2021 IEEE International Conference on Big Data (Big Data) 2017-12-01

NoSQL schema evolution and big data migration at scale

OPENALEX - Publications

Meike Klettke Uta Störl Manuel Shenavai Stefanie Scherzinger

This paper explores scalable implementation strategies for carrying out lazy schema evolution in NoSQL data stores. For decades, has been an evergreen database research. Yet new challenges arise the context of cloud-hosted backends: With all reads and writes charged by provider, migrating entire instance eagerly into a can be prohibitively expensive. Thus, migration may more cost-efficient, as legacy entities are only migrated case they actually accessed application. Related work shown that...

10.1109/bigdata.2016.7840924 article EN 2021 IEEE International Conference on Big Data (Big Data) 2016-12-01

1-2-3 Reproducibility for Quantum Software Experiments

OPENALEX - Publications

Wolfgang Mauerer Stefanie Scherzinger

Various fields of science face a reproducibility crisis. For quantum software engineering as an emerging field, it is therefore imminent to focus on proper from the start. Yet provision reproduction packages almost universally lacking. Actionable advice how build such rare, particularly unfortunate in field with many contributions researchers backgrounds outside computer science. In this article, we argue rectify deficiency by proposing 1-2-3 approach for experiments: Using meta-generation...

10.1109/saner53432.2022.00148 article EN 2022 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER) 2022-03-01

The Case for DBMS Live Patching [Extended Version]

OPENALEX - Publications

Michael Fruth Stefanie Scherzinger

Traditionally, when the code of a database management system (DBMS) needs to be updated, is restarted and clients suffer downtime, or provider instantiates hot-standby instances rolls over workload. We investigate third option, live patching DBMS binary. For certain changes, allows modify application in memory, without restart. The memory state all client connections can maintained. Although has been explored operating systems research community, it remains blind spot research. In this...

10.48550/arxiv.2410.09925 preprint EN arXiv (Cornell University) 2024-10-13

Attribute grammars for scalable query processing on XML streams

OPENALEX - Publications

Christoph Koch Stefanie Scherzinger

10.1007/s00778-005-0169-1 article EN The VLDB Journal 2006-01-30

ControVol: A framework for controlled schema evolution in NoSQL application development

OPENALEX - Publications

Stefanie Scherzinger Thomas Cerqueus Eduardo Cunha de Almeida

Building scalable web applications on top of NoSQL data stores is becoming common practice. Many these can easily be accessed programmatically, and do not enforce a schema. Software engineers design the model go, flexibility that crucial in agile software development. The typical tasks database schema management are now handled within application code, usually involving object mapper libraries. However, today's Integrated Development Environments (IDEs) lack proper tool support when it comes...

10.1109/icde.2015.7113402 preprint EN 2015-04-01

Peel | Pile? Cross-Framework Portability of Quantum Software

OPENALEX - Publications

Manuel Schönberger Maja Franz Stefanie Scherzinger Wolfgang Mauerer

In recent years, various vendors have made quantum software frameworks available. Yet with vendor-specific frameworks, code portability seems at risk, especially in a field where hardware and libraries not yet reached consolidated state, even foundational aspects of the technologies are still flux. Accordingly, development vendor-independent programming languages is often suggested. This follows established architectural pattern introducing additional levels abstraction into stacks, thereby...

10.1109/icsa-c54293.2022.00039 article EN 2022-03-01

Combined Static and Dynamic Analysis for Effective Buffer Minimization in Streaming XQuery Evaluation

OPENALEX - Publications

Michael Schmidt Stefanie Scherzinger Christoph Koch

Effective buffer management is crucial for efficient in-memory and streaming XQuery processing. We propose a scheme which combines static dynamic analysis to keep main memory consumption low. Our approach relies on technique that we call active garbage collection actively purges buffers at runtime based the current status of query evaluation. have built prototype system practical fragment employs our scheme. The experimental results demonstrate significant impact combined reducing running time.

10.1109/icde.2007.367869 article EN 2007-04-01

Curating Variational Data in Application Development

OPENALEX - Publications

Uta Störl D. Müller Alexander Tekleab Stephane Tolale Julian Stenzel and 2 more

Building applications for processing data lakes is a software engineering challenge. We present Darwin, middleware that operate on variational data. This concerns with heterogeneous structure, usually stored within schema-flexible NoSQL database. Darwin assists application developers in essential and schema curation tasks: Upon request, extracts description, discovers the history of versions, proposes mappings between these versions. Users may interactively choose which are most realistic....

10.1109/icde.2018.00187 article EN 2022 IEEE 38th International Conference on Data Engineering (ICDE) 2018-04-01

MigCast

OPENALEX - Publications

Andrea Hillenbrand Maksym Levchenko Uta Störl Stefanie Scherzinger Meike Klettke

We demonstrate MigCast, a tool-based advisor for exploring data migration strategies in the context of developing NoSQL-backed applications. Users MigCast can consider their options evolving model along with legacy already persisted cloud-hosted production database. They explore alternative actions as financial costs are predicted respective to cloud provider chosen. Thereby they better equipped assess potential consequences imminent decisions. To this end, maintains an internal cost model,...

10.1145/3299869.3320223 article EN Proceedings of the 2022 International Conference on Management of Data 2019-06-18

Live Patching for Distributed In-Memory Key-Value Stores

OPENALEX - Publications

Michael Fruth Stefanie Scherzinger

Providers of high-availability data stores need to roll out software updates without causing noticeable downtimes. For distributed like Redis Cluster, the state-of-the-art is a rolling update, where nodes are restarted in sequence. This requires preserving, restoring, and resynchronizing database state, which can significantly prolong for larger memory states, thus delay critical security fixes. In this article, we propose applying directly restarting any nodes. We present first fully...

10.1145/3698816 article EN Proceedings of the ACM on Management of Data 2024-12-18

XML Prefiltering as a String Matching Problem

OPENALEX - Publications

Christoph Koch Stefanie Scherzinger Michael Schmidt

We propose a new technique for the efficient search and navigation in XML documents streams. This takes string matching algorithms designed keyword flat strings into second dimension, to navigate tree structured data. consider important data management task of prefiltering (also called projection) as an application our approach. Different from existing schemes, we usually process only fractions input get by with very economical consumption both main memory processing time. Our experiments...

10.1109/icde.2008.4497471 article EN 2008-04-01

Towards Generating Consumer Labels for Machine Learning Models

OPENALEX - Publications

Christin Seifert Stefanie Scherzinger Lena Wiese

Machine learning (ML) based decision making is becoming commonplace. For persons affected by ML-based decisions, a certain level of transparency regarding the properties underlying ML model can be fundamental. In this vision paper, we propose to issue consumer labels for trained and published models. These primarily target machine lay persons, such as operators an system, executors subjects themselves. Provided that comprehensively capture characteristics model, consumers are enabled...

10.1109/cogmi48466.2019.00033 article EN 2019-12-01

On the necessity of model checking NoSQL database schemas when building SaaS applications

OPENALEX - Publications

Stefanie Scherzinger Eduardo Cunha de Almeida Felipe Ickert Marcos Didonet Del Fabro

The design of the NoSQL schema has a direct impact on scalability web applications. Especially for developers with little experience in stores, risks inherent poor can be incalculable. Worse yet, issues will only manifest once application been deployed, and growing user base causes highly concurrent writes. In this paper, we present model checking approach to reveal bottlenecks schemas. Our draws formal methods from tree automata theory perform conservative static analysis both expected...

10.1145/2489295.2489297 article EN 2013-07-15

Safely Managing Data Variety in Big Data Software Development

OPENALEX - Publications

Thomas Cerqueus Eduardo Cunha de Almeida Stefanie Scherzinger

We consider the task of building Big Data software systems, offered as software-as-a-service. These applications are commonly backed by NoSQL data stores that address proverbial Vs processing: can handle large volumes and many systems do not enforce a global schema, to account for structural variety in data. Thus, engineers design model on go, flexibility is particularly crucial agile development. However, yet veracity changes when it comes structure persisted Yet this an inevitable...

10.1109/bigdse.2015.9 preprint EN 2015-05-01

Coming Soon ...