NFDI4DS | UHH-SEMS - Publication Details

Cristina Sarasua

ORCID: 0000-0002-2076-9584

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5023061106

Research Areas

Scientific Computing and Data Management
Semantic Web and Ontologies
Mobile Crowdsensing and Crowdsourcing
Distributed and Parallel Computing Systems
Research Data Management Practices
Video Analysis and Summarization
Software Engineering Research
Wikis in Education and Collaboration
Data Quality and Management
Music and Audio Processing
Open Source Software Innovations
Auction Theory and Applications
Privacy-Preserving Technologies in Data
Service-Oriented Architecture and Web Services
Natural Language Processing Techniques
Biomedical Text Mining and Ontologies
Advanced Database Systems and Queries
Opinion Dynamics and Social Influence
Topic Modeling
Educational Assessment and Pedagogy
Software Testing and Debugging Techniques
Advanced Text Analysis Techniques
Animal Behavior and Welfare Studies
Complex Network Analysis Techniques
Image Retrieval and Classification Techniques

University of Zurich
2018-2024

Universität Koblenz
2013-2017

University of Koblenz and Landau
2013-2017

Karlsruhe Institute of Technology
2009-2012

Vicomtech
2008-2011

Instituto de Ciencias del Patrimonio
2011

The Impact of Task Abandonment in Crowdsourcing

OPENALEX - Publications

Lei Han Kevin Roitero Ujwal Gadiraju Cristina Sarasua Alessandro Checco and 2 more

Crowdsourcing has become a standard methodology to collect manually annotated data such as relevance judgments at scale. On crowdsourcing platforms like Amazon MTurk or FigureEight, crowd workers select tasks work on based different dimensions task reward and requester reputation. Requesters then receive the of who self-selected into completed them successfully. Several workers, however, preview tasks, begin working them, reaching varying stages completion without finally submitting their...

10.1109/tkde.2019.2948168 article EN IEEE Transactions on Knowledge and Data Engineering 2019-01-01

All Those Wasted Hours

OPENALEX - Publications

Lei Han Kevin Roitero Ujwal Gadiraju Cristina Sarasua Alessandro Checco and 2 more

10.1145/3289600.3291035 article EN 2019-01-30

The pig transport network in Switzerland: Structure, patterns, and implications for the transmission of infectious diseases between animal holdings

OPENALEX - Publications

Martin Sterchi Céline Faverjon Cristina Sarasua Maria Elena Vargas John Berezowski and 3 more

The topology of animal transport networks contributes substantially to how fast and what extent a disease can transmit between holdings. Therefore, public authorities in many countries mandate livestock holdings report all movements animals. However, the reported data often does not contain information about exact sequence transports, making it impossible assess effect truck sharing contamination on transmission. aim this study was analyze Swiss pig network by means social analysis...

10.1371/journal.pone.0217974 article EN cc-by PLoS ONE 2019-05-31

Crowd Worker Strategies in Relevance Judgment Tasks

OPENALEX - Publications

Lei Han Eddy Maddalena Alessandro Checco Cristina Sarasua Ujwal Gadiraju and 2 more

Crowdsourcing is a popular technique to collect large amounts of human-generated labels, such as relevance judgments used create information retrieval (IR) evaluation collections. Previous research has shown how collecting high quality labels from crowdsourcing platform can be challenging. Existing assurance techniques focus on answer aggregation or the use gold questions where ground-truth data allows check for responses.

10.1145/3336191.3371857 article EN 2020-01-20

Crowdsourcing and the Semantic Web: A Research Manifesto

OPENALEX - Publications

Cristina Sarasua Elena Simperl Natasha Noy Abraham Bernstein Jan Marco Leimeister

Our goal with this research manifesto is to define a roadmap guide the evolution of new field that emerging at intersection between crowdsourcing and Semantic Web. We analyze confluence these two disciplines by exploring their relationship. First, we focus on how application techniques can enhance machine-driven execution Web tasks. Second, look ways in which machine-processable semantics benefit design management projects. As result, are able describe list successful or promising scenarios...

10.15346/hc.v2i1.2 article EN Human Computation 2015-08-10

The Evolution of Power and Standard Wikidata Editors: Comparing Editing Behavior over Time to Predict Lifespan and Volume of Edits

OPENALEX - Publications

Cristina Sarasua Alessandro Checco Gianluca Demartini Djellel Difallah Michael D. Feldman and 1 more

10.1007/s10606-018-9344-y article EN Computer Supported Cooperative Work (CSCW) 2018-12-15

A Transdisciplinary Approach Supporting the Implementation of a Big Data Project in Livestock Production: An Example From the Swiss Pig Production Industry

OPENALEX - Publications

Céline Faverjon Abraham Bernstein Rolf Grütter Christina Nathues Heiko Nathues and 4 more

Big Data approaches offer potential benefits for improving animal health, but they have not been broadly implemented in livestock production systems. Privacy issues, the large number of stakeholders, and competitive environment all make data sharing integration a challenge The Swiss pig industry illustrates these other issues. It is highly decentralized fragmented complex network made up small independent actors collecting amount heterogeneous data. Transdisciplinary hold promise overcoming...

10.3389/fvets.2019.00215 article EN cc-by Frontiers in Veterinary Science 2019-07-04

Visualising data science workflows to support third-party notebook comprehension: an empirical study

OPENALEX - Publications

Dhivyabharathi Ramasamy Cristina Sarasua Alberto Bacchelli Abraham Bernstein

Data science is an exploratory and iterative process that often leads to complex unstructured code. This code usually poorly documented and, consequently, hard understand by a third party. In this paper, we first collect empirical evidence for the non-linearity of data from real-world Jupyter notebooks, confirming need new approaches aid in interaction comprehension. Second, propose visualisation method elucidates implicit workflow information assists scientists navigating so-called garden...

10.1007/s10664-023-10289-9 article EN cc-by Empirical Software Engineering 2023-03-23

Microtask Available, Send us your CV!

OPENALEX - Publications

Cristina Sarasua Matthias Thimm

Most current micro task crowd sourcing platforms do not exploit the individual expertise of workers, which becomes extremely relevant for knowledge-intensive tasks in human computation scenarios. In this paper, we discuss work progress on worker profiling within to increase both quality and satisfaction users. We analyse issue workers propose introduction a CV as comprehensive means describe worker's interests. several important dimensions that should be included such their benefits.

10.1109/cgc.2013.87 article EN International Conference on Cloud and Green Computing 2013-09-01

Ontology metadata for ontology reuse

OPENALEX - Publications

Elena Simperl Cristina Sarasua Rachanee Ungrangsi Tobias Bürger

Ontologies are often poorly documented, thus being hardly accessible to users and ontology reuse services. A first step towards a better documentation is an explicit schema for their systematic description. To offer informed background such we surveyed engineering technology, the types of ontology-related descriptive means they use. The result part OMV standard, was evaluated through professional reviews. second provision automatic techniques acquire documentation, which devised Ontology...

10.1504/ijmso.2011.046579 article EN International Journal of Metadata Semantics and Ontologies 2011-01-01

Retrieving Film Heritage Content Using an MPEG-7 Compliant Ontology

OPENALEX - Publications

Yolanda Cobos Cristina Sarasua María Teresa Linaza Ivan Jimenez Ander García

This paper presents the design and implementation of an MPEG-7 based Multimedia Retrieval System for Film Heritage. The multimedia content has been indexed using Annotation Tool on standard. Moreover, Compliant Ontology in OWL DL, which is briefly explained this paper, specially developed to fulfil requirements CINeSPACE project. ontology instantiated so that retrieval process can be handled. system assessed during validation

10.1109/smap.2008.16 article EN 2008-12-01

An architecture for fast semantic retrieval in the film heritage domain

OPENALEX - Publications

Yolanda Cobosi Carlos Toro Cristina Sarasua Javier Vaquero María Teresa Linaza and 1 more

How to best perform a search over dataset is common field of research in the scientific community. Query engines are used when information handled by an ontology order obtain structured semantically. A issue that arises same query performed several times, as engine must check domain every time retrieve information. In this paper, we propose architecture takes advantage concept Reflexive Ontologies (RO) achieve timely semantic retrieval. The proposed illustrated case study Film Heritage...

10.1109/cbmi.2008.4564957 article EN 2008-01-01

Workflow analysis of data science code in public GitHub repositories

OPENALEX - Publications

Dhivyabharathi Ramasamy Cristina Sarasua Alberto Bacchelli Abraham Bernstein

Despite the ubiquity of data science, we are far from rigorously understanding how coding in science is performed. Even though scientific literature has hinted at iterative and explorative nature coding, need further empirical evidence to understand this practice its workflows detail. Such critical recognise needs scientists and, for instance, inform tooling support. To obtain a deeper analysed 470 Jupyter notebooks publicly available GitHub repositories. We focused on extent which...

10.1007/s10664-022-10229-z article EN cc-by Empirical Software Engineering 2022-11-19

Crowdsourcing and the Semantic Web (Dagstuhl Seminar 14282)

OPENALEX - Publications

Abraham Bernstein Jan Marco Leimeister Natasha Noy Cristina Sarasua Elena Simperl

Semantic technologies provide flexible and scalable solutions to master make sense of an increasingly vast complex data landscape. However, while this potential has been acknowledged for various application scenarios domains, a number success stories exist, it is equally clear that the development deployment semantic will always remain reliant human input intervention. This due very nature some tasks associated with management life cycle, which are famous their knowledge-intensive and/or...

10.4230/dagrep.4.7.25 article EN Dagstuhl reports 2014-01-01

Fast and Adaptive Questionnaires for Voting Advice Applications

OPENALEX - Publications

Fynn Bachmann Cristina Sarasua Abraham Bernstein

The effectiveness of Voting Advice Applications (VAA) is often compromised by the length their questionnaires. To address user fatigue and incomplete responses, some applications (such as Swiss Smartvote) offer a condensed version questionnaire. However, these versions can not ensure accuracy recommended parties or candidates, which we show to remain below 40%. tackle limitations, this work introduces an adaptive questionnaire approach that selects subsequent questions based on users'...

10.48550/arxiv.2404.01872 preprint EN arXiv (Cornell University) 2024-04-02

Estimating the Semantic Density of Visual Media

OPENALEX - Publications

Luca Rossetto Cristina Sarasua Abraham Bernstein

10.1145/3664647.3681594 article EN 2024-10-26

Coming Soon ...