Cristina Sarasua

ORCID: 0000-0002-2076-9584
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Scientific Computing and Data Management
  • Semantic Web and Ontologies
  • Mobile Crowdsensing and Crowdsourcing
  • Distributed and Parallel Computing Systems
  • Research Data Management Practices
  • Video Analysis and Summarization
  • Software Engineering Research
  • Wikis in Education and Collaboration
  • Data Quality and Management
  • Music and Audio Processing
  • Open Source Software Innovations
  • Auction Theory and Applications
  • Privacy-Preserving Technologies in Data
  • Service-Oriented Architecture and Web Services
  • Natural Language Processing Techniques
  • Biomedical Text Mining and Ontologies
  • Advanced Database Systems and Queries
  • Opinion Dynamics and Social Influence
  • Topic Modeling
  • Educational Assessment and Pedagogy
  • Software Testing and Debugging Techniques
  • Advanced Text Analysis Techniques
  • Animal Behavior and Welfare Studies
  • Complex Network Analysis Techniques
  • Image Retrieval and Classification Techniques

University of Zurich
2018-2024

Universität Koblenz
2013-2017

University of Koblenz and Landau
2013-2017

Karlsruhe Institute of Technology
2009-2012

Vicomtech
2008-2011

Instituto de Ciencias del Patrimonio
2011

Crowdsourcing has become a standard methodology to collect manually annotated data such as relevance judgments at scale. On crowdsourcing platforms like Amazon MTurk or FigureEight, crowd workers select tasks work on based different dimensions task reward and requester reputation. Requesters then receive the of who self-selected into completed them successfully. Several workers, however, preview tasks, begin working them, reaching varying stages completion without finally submitting their...

10.1109/tkde.2019.2948168 article EN IEEE Transactions on Knowledge and Data Engineering 2019-01-01

Crowdsourcing has become a standard methodology to collect manually annotated data such as relevance judgments at scale. On crowdsourcing platforms like Amazon MTurk or FigureEight, crowd workers select tasks work on based different dimensions task reward and requester reputation. Requesters then receive the of who self-selected into completed them successfully. Several workers, however, preview tasks, begin working them, reaching varying stages completion without finally submitting their...

10.1145/3289600.3291035 article EN 2019-01-30

The topology of animal transport networks contributes substantially to how fast and what extent a disease can transmit between holdings. Therefore, public authorities in many countries mandate livestock holdings report all movements animals. However, the reported data often does not contain information about exact sequence transports, making it impossible assess effect truck sharing contamination on transmission. aim this study was analyze Swiss pig network by means social analysis...

10.1371/journal.pone.0217974 article EN cc-by PLoS ONE 2019-05-31

Crowdsourcing is a popular technique to collect large amounts of human-generated labels, such as relevance judgments used create information retrieval (IR) evaluation collections. Previous research has shown how collecting high quality labels from crowdsourcing platform can be challenging. Existing assurance techniques focus on answer aggregation or the use gold questions where ground-truth data allows check for responses.

10.1145/3336191.3371857 article EN 2020-01-20

Our goal with this research manifesto is to define a roadmap guide the evolution of new field that emerging at intersection between crowdsourcing and Semantic Web. We analyze confluence these two disciplines by exploring their relationship. First, we focus on how application techniques can enhance machine-driven execution Web tasks. Second, look ways in which machine-processable semantics benefit design management projects. As result, are able describe list successful or promising scenarios...

10.15346/hc.v2i1.2 article EN Human Computation 2015-08-10

Big Data approaches offer potential benefits for improving animal health, but they have not been broadly implemented in livestock production systems. Privacy issues, the large number of stakeholders, and competitive environment all make data sharing integration a challenge The Swiss pig industry illustrates these other issues. It is highly decentralized fragmented complex network made up small independent actors collecting amount heterogeneous data. Transdisciplinary hold promise overcoming...

10.3389/fvets.2019.00215 article EN cc-by Frontiers in Veterinary Science 2019-07-04

Data science is an exploratory and iterative process that often leads to complex unstructured code. This code usually poorly documented and, consequently, hard understand by a third party. In this paper, we first collect empirical evidence for the non-linearity of data from real-world Jupyter notebooks, confirming need new approaches aid in interaction comprehension. Second, propose visualisation method elucidates implicit workflow information assists scientists navigating so-called garden...

10.1007/s10664-023-10289-9 article EN cc-by Empirical Software Engineering 2023-03-23

Most current micro task crowd sourcing platforms do not exploit the individual expertise of workers, which becomes extremely relevant for knowledge-intensive tasks in human computation scenarios. In this paper, we discuss work progress on worker profiling within to increase both quality and satisfaction users. We analyse issue workers propose introduction a CV as comprehensive means describe worker's interests. several important dimensions that should be included such their benefits.

10.1109/cgc.2013.87 article EN International Conference on Cloud and Green Computing 2013-09-01

Ontologies are often poorly documented, thus being hardly accessible to users and ontology reuse services. A first step towards a better documentation is an explicit schema for their systematic description. To offer informed background such we surveyed engineering technology, the types of ontology-related descriptive means they use. The result part OMV standard, was evaluated through professional reviews. second provision automatic techniques acquire documentation, which devised Ontology...

10.1504/ijmso.2011.046579 article EN International Journal of Metadata Semantics and Ontologies 2011-01-01

This paper presents the design and implementation of an MPEG-7 based Multimedia Retrieval System for Film Heritage. The multimedia content has been indexed using Annotation Tool on standard. Moreover, Compliant Ontology in OWL DL, which is briefly explained this paper, specially developed to fulfil requirements CINeSPACE project. ontology instantiated so that retrieval process can be handled. system assessed during validation

10.1109/smap.2008.16 article EN 2008-12-01

How to best perform a search over dataset is common field of research in the scientific community. Query engines are used when information handled by an ontology order obtain structured semantically. A issue that arises same query performed several times, as engine must check domain every time retrieve information. In this paper, we propose architecture takes advantage concept Reflexive Ontologies (RO) achieve timely semantic retrieval. The proposed illustrated case study Film Heritage...

10.1109/cbmi.2008.4564957 article EN 2008-01-01

Despite the ubiquity of data science, we are far from rigorously understanding how coding in science is performed. Even though scientific literature has hinted at iterative and explorative nature coding, need further empirical evidence to understand this practice its workflows detail. Such critical recognise needs scientists and, for instance, inform tooling support. To obtain a deeper analysed 470 Jupyter notebooks publicly available GitHub repositories. We focused on extent which...

10.1007/s10664-022-10229-z article EN cc-by Empirical Software Engineering 2022-11-19

Semantic technologies provide flexible and scalable solutions to master make sense of an increasingly vast complex data landscape. However, while this potential has been acknowledged for various application scenarios domains, a number success stories exist, it is equally clear that the development deployment semantic will always remain reliant human input intervention. This due very nature some tasks associated with management life cycle, which are famous their knowledge-intensive and/or...

10.4230/dagrep.4.7.25 article EN Dagstuhl reports 2014-01-01

The effectiveness of Voting Advice Applications (VAA) is often compromised by the length their questionnaires. To address user fatigue and incomplete responses, some applications (such as Swiss Smartvote) offer a condensed version questionnaire. However, these versions can not ensure accuracy recommended parties or candidates, which we show to remain below 40%. tackle limitations, this work introduces an adaptive questionnaire approach that selects subsequent questions based on users'...

10.48550/arxiv.2404.01872 preprint EN arXiv (Cornell University) 2024-04-02
Coming Soon ...