- Semantic Web and Ontologies
- Data Quality and Management
- Service-Oriented Architecture and Web Services
- Topic Modeling
- Mobile Crowdsensing and Crowdsourcing
- Wikis in Education and Collaboration
- Natural Language Processing Techniques
- Business Process Modeling and Analysis
- Scientific Computing and Data Management
- Open Source Software Innovations
- Big Data and Business Intelligence
- Biomedical Text Mining and Ontologies
- Advanced Database Systems and Queries
- Species Distribution and Climate Change
- Research Data Management Practices
- Advanced Graph Neural Networks
- Web Data Mining and Analysis
- E-Government and Public Services
- Auction Theory and Applications
- Knowledge Management and Sharing
- Data Management and Algorithms
- FinTech, Crowdfunding, Digital Finance
- Data Stream Mining Techniques
- Peer-to-Peer Network Technologies
- Privacy-Preserving Technologies in Data
King's College London
2019-2025
Open Data Institute
2023-2024
Bern University of Applied Sciences
2021-2022
University of Southampton
2012-2020
Association for the Advancement of Artificial Intelligence
2018
University of Zurich
2018
National Institute for Astrophysics
2018
Karlsruhe Institute of Technology
2010-2015
Universität Innsbruck
2007-2013
Leipzig University
2013
Web data and computational models can play important roles in analyzing cultural trends.The current study presents an analysis of 23,492 sneaker images metadata collected from a global reselling shop, StockX.com.Based on encompassing 22 years 1999 to 2020, we propose design index that helps track changes the characteristics sneakers using contrastive learning method.Our suggest designs have been employing brighter colors lower hue saturation values over time.We also observe how popular...
Generating value from data requires the ability to find, access and make sense of datasets. There are many efforts underway encourage sharing reuse, scientific publishers asking authors submit alongside manuscripts marketplaces, open portals communities. Google recently beta-released a search service for datasets, which allows users discover stored in various online repositories via keyword queries. These developments foreshadow an emerging research field around dataset or retrieval that...
Sustained engagement of participants is essential for the success a citizen science project. However, motivations why people engage with such activities can be idiosyncratic, varied, and evolving. In this article we examine player participation in Eyewire, game. We undertake an investigation Eyewire players take part game based on responses from large-scale survey. Our analysis identifies 4 groups features which impact long-term engagement. draw theories motivation consider categories...
Structured data such as databases, spreadsheets and web tables is becoming critical in every domain professional role. Yet we still do not know much about how people interact with it. Our research focuses on the information seeking behaviour of looking for new sources structured online, including task context which will be used, search, identification relevant datasets from a set possible candidates. We present mixed-methods study covering in-depth interviews 20 participants various...
The fitness of the systems in which Machine Learning (ML) is used depends greatly on good-quality data. Specifications what makes a dataset have traditionally been defined by needs data users—typically analysts and engineers. Our article critically examines extent to established quality frameworks are applicable contemporary use cases ML. Using review recent literature at intersection ML, management, human-computer interaction, we find that classical “fitness-for-use” view can benefit from...
Abstract Building ontologies in a collaborative and increasingly community-driven fashion has become central paradigm of modern ontology engineering. This understanding engineering processes is the result intensive theoretical empirical research within Semantic Web community, supported by technology developments such as 2.0. Over 6 years after publication first methodology for engineering, it generally acknowledged that, order to be useful, but also economically feasible, should developed...
The state of the art in human interaction with computational systems blurs line between computations performed by machine logic and algorithms, those that result from input humans, arising their own psychological processes life experience. Current socio-technical systems, known as "social machines" exploit large-scale humans machines. Interactions are motivated numerous goals purposes including financial gain, charitable aid, simply for fun. In this paper we explore landscape social...
Designing an effective and sustainable citizen science (CS)project requires consideration of a great number factors. This makes the overall process unpredictable, even when sound, user-centred design approach is followed by experienced team UX designers. Moreover, such systems are deployed, complexity resulting interactions challenges any attempt to generalisation from retrospective analysis. In this paper, we present case study largest single platform driven data analysis projects date,...
Crowdsourcing via paid microtasks has been successfully applied in a plethora of domains and tasks. Previous efforts for making such crowdsourcing more effective have considered aspects as diverse task workflow design, spam detection, quality control, pricing models. Our work expands upon by examining the potential adding gamification to microtask interfaces means improving both worker engagement effectiveness. We run series experiments image labeling, one most common use cases...
The sharing and reuse of data are seen as critical to solving the most complex problems today. Despite this potential, relatively little attention has been paid a key step in reuse: behaviours involved data-centric sensemaking. We aim address gap by presenting mixed-methods study combining in-depth interviews, think-aloud task screen recording analysis with 31 researchers from different disciplines they summarised interacted both familiar unfamiliar data. use our findings identify detail...
Data is a critical resource for Machine Learning (ML), yet working with data remains key friction point. This paper introduces Croissant, metadata format datasets that simplifies how used by ML tools and frameworks. Croissant makes more discoverable, portable interoperable, thereby addressing significant challenges in management responsible AI. already supported several popular dataset repositories, spanning hundreds of thousands datasets, ready to be loaded into the most
The process of developing ontologies – a formal, explicit specification shared conceptualisation is addressed by well-known methodologies. As for any engineering development, its fundamental basis the collection requirements, which includes elicitation competency questions. Competency questions are defined through interacting with domain and application experts or investigating existing datasets that may be used to populate ontology i.e. knowledge graph. rise in popularity accessibility...
Abstract Semantic technologies promise to solve many challenging problems of the present Web applications. As they achieve a feasible level maturity, become increasingly accepted in various business settings at enterprise level. By contrast, their usability open environments such as Web—with respect issues scalability , dynamism and openness —still requires additional investigation. In particular, services have inherited service communication model, which is primarily based on synchronous...
Most people need textual or visual interfaces in order to make sense of Semantic Web data. In this paper, we investigate the problem generating natural language summaries for data using neural networks. Our end-to-end trainable architecture encodes information from a set triples into vector fixed dimensionality and generates summary by conditioning output on encoded vector. We explore different approaches that enable our models verbalise entities input generated text. systems are trained...