- Semantic Web and Ontologies
- Information Retrieval and Search Behavior
- Topic Modeling
- Data Quality and Management
- Web Data Mining and Analysis
- Advanced Database Systems and Queries
- Scientific Computing and Data Management
- Natural Language Processing Techniques
- Advanced Text Analysis Techniques
- Image Retrieval and Classification Techniques
- Data Management and Algorithms
- Library Science and Information Systems
- Digital and Traditional Archives Management
- Research Data Management Practices
- Speech and dialogue systems
- Data Visualization and Analytics
- Digital Humanities and Scholarship
- Mobile Crowdsensing and Crowdsourcing
- Biomedical Text Mining and Ontologies
- Algorithms and Data Compression
- Explainable Artificial Intelligence (XAI)
- Big Data and Business Intelligence
- Distributed and Parallel Computing Systems
- Expert finding and Q&A systems
- Personal Information Management and User Behavior
University of Padua
2015-2024
Mylan (South Africa)
2022
Politecnico di Milano
2021
Delft University of Technology
2018
University of Tennessee at Knoxville
2017
National Institute of Standards and Technology
2017
University of Modena and Reggio Emilia
2015
Control Systems Research (United States)
2007
Science is facing a so-called reproducibility crisis, where researchers struggle to repeat experiments and get the same or comparable results. This represents fundamental problem in any scientific discipline because lies at very basis of method. A central methodological question how measure interpret different measures. In Information Retrieval (IR), current practices rely mainly on comparing averaged scores. If reproduced score close enough original one, experiment deemed successful,...
This report documents the program and outcomes of Dagstuhl Seminar 23031 "Frontiers Information Access Experimentation for Research Education", which brought together 38 participants from 12 countries. The seminar addressed technology-enhanced information access (information retrieval, recommender systems, natural language processing) specifically focused on developing more responsible experimental practices leading to valid results, both research as well scientific education. featured a...
This article is a study of the themes and issues concerning annotation digital contents, such as textual documents, images, multimedia documents in general. These contents are automatically managed by different kinds library management systems more generally information systems. Even though this topic has already been partially studied other researchers, previous research work on annotations left many open issues. concern lack clarity about what an is, its features are, how it used. mainly...
research-article Share on Reproducibility Challenges in Information Retrieval Evaluation Author: Nicola Ferro University of Padua, Padova (PD), Italy 0000-0001-9219-6239View Profile Authors Info & Claims Journal Data and QualityVolume 8Issue 2February 2017 Article No.: 8pp 1–4https://doi.org/10.1145/3020206Published:04 January 2017Publication History 21citation558DownloadsMetricsTotal Citations21Total Downloads558Last 12 Months35Last 6 weeks11 Get Citation AlertsNew Alert added!This alert...
The Dagstuhl Seminar on "Reproducibility of Data-Oriented Experiments in e-Science", held 24-29 January 2016, focused the core issues and approaches to reproducibility experiments from a multidisciplinary point view, sharing experience coming several fields computer science. In this paper, we discuss, summarize, adapt main findings seminar context IR evaluation -- both system-oriented user-oriented order raise awareness our community stimulate towards increased experiments.
Recently, the ACM created a policy on Artifact Review and Badging, which presents framework to help SIGs recognize repeatability, replicability reproducibility in published research. While established vocabulary definitions, it did not prescribe procedures for implementation. Rather, has left this each SIG define given variety of research traditions approaches that exist with community. are required implement badging, but growing interest topic SIGIR community, task force been assembled...
Information Retrieval (IR) is a discipline deeply rooted in evaluation since its inception. Indeed, experimentally measuring and statistically validating the performance of IR systems are only possible ways to compare understand which better than others and, ultimately, more effective useful for end-users. Since seminal paper by Stevens [103], it known that properties measurement scales determine operations you should or not perform with values from those scales. For example, suggested can...
Creating test collections for offline retrieval evaluation requires human effort to judge documents' relevance. This expensive activity motivated much work in developing methods constructing benchmarks with fewer assessment costs. In this respect, adjudication actively decide both which documents and the order experts review them, better exploit budget or lower it. Researchers evaluate quality of those by measuring correlation between known gold ranking systems under full collection observed...
Feature selection is a common step in many ranking, classification, or prediction tasks and serves purposes. By removing redundant noisy features, the accuracy of ranking classification can be improved computational cost subsequent learning steps reduced. However, feature itself computationally expensive process. While for decades confined to theoretical algorithmic papers, quantum computing now becoming viable tool tackle realistic problems, particular special-purpose solvers based on...
Moffat recently commented on our previous work. Our work focused how laying the foundations of evaluation methodology into theory measurement can improve knowledge and understanding measures we use in IR it shed light different types scales adopted by measures; also provided evidence, through extensive experimentation, impact statistical analyses, as well departing from their assumptions. Moreover, investigated, for first time IR, concept meaningfulness, i.e. invariance experimental...
Interval scales are assumed by several basic descriptive statistics, such as mean and variance, many statistical significance tests which daily used in IR to compare systems. Unfortunately, so far, there has not been any systematic formal study discover the actual scale properties of measures. Therefore, this paper, we develop a theory <i>Information Retrieval (IR)</i> evaluation measures, based on representational measurements, determine whether when measures interval scales. We found that...