Nicola Ferro

ORCID: 0000-0001-9219-6239
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Semantic Web and Ontologies
  • Information Retrieval and Search Behavior
  • Topic Modeling
  • Data Quality and Management
  • Web Data Mining and Analysis
  • Advanced Database Systems and Queries
  • Scientific Computing and Data Management
  • Natural Language Processing Techniques
  • Advanced Text Analysis Techniques
  • Image Retrieval and Classification Techniques
  • Data Management and Algorithms
  • Library Science and Information Systems
  • Digital and Traditional Archives Management
  • Research Data Management Practices
  • Speech and dialogue systems
  • Data Visualization and Analytics
  • Digital Humanities and Scholarship
  • Mobile Crowdsensing and Crowdsourcing
  • Biomedical Text Mining and Ontologies
  • Algorithms and Data Compression
  • Explainable Artificial Intelligence (XAI)
  • Big Data and Business Intelligence
  • Distributed and Parallel Computing Systems
  • Expert finding and Q&A systems
  • Personal Information Management and User Behavior

University of Padua
2015-2024

Mylan (South Africa)
2022

Politecnico di Milano
2021

Delft University of Technology
2018

University of Tennessee at Knoxville
2017

National Institute of Standards and Technology
2017

University of Modena and Reggio Emilia
2015

Control Systems Research (United States)
2007

Science is facing a so-called reproducibility crisis, where researchers struggle to repeat experiments and get the same or comparable results. This represents fundamental problem in any scientific discipline because lies at very basis of method. A central methodological question how measure interpret different measures. In Information Retrieval (IR), current practices rely mainly on comparing averaged scores. If reproduced score close enough original one, experiment deemed successful,...

10.1016/j.ipm.2023.103332 article EN cc-by Information Processing & Management 2023-03-14

This report documents the program and outcomes of Dagstuhl Seminar 23031 "Frontiers Information Access Experimentation for Research Education", which brought together 38 participants from 12 countries. The seminar addressed technology-enhanced information access (information retrieval, recommender systems, natural language processing) specifically focused on developing more responsible experimental practices leading to valid results, both research as well scientific education. featured a...

10.1145/3636341.3636351 article EN ACM SIGIR Forum 2023-06-01

This article is a study of the themes and issues concerning annotation digital contents, such as textual documents, images, multimedia documents in general. These contents are automatically managed by different kinds library management systems more generally information systems. Even though this topic has already been partially studied other researchers, previous research work on annotations left many open issues. concern lack clarity about what an is, its features are, how it used. mainly...

10.1145/1292591.1292594 article EN ACM transactions on office information systems 2007-11-01

research-article Share on Reproducibility Challenges in Information Retrieval Evaluation Author: Nicola Ferro University of Padua, Padova (PD), Italy 0000-0001-9219-6239View Profile Authors Info & Claims Journal Data and QualityVolume 8Issue 2February 2017 Article No.: 8pp 1–4https://doi.org/10.1145/3020206Published:04 January 2017Publication History 21citation558DownloadsMetricsTotal Citations21Total Downloads558Last 12 Months35Last 6 weeks11 Get Citation AlertsNew Alert added!This alert...

10.1145/3020206 article EN Journal of Data and Information Quality 2017-01-04

The Dagstuhl Seminar on "Reproducibility of Data-Oriented Experiments in e-Science", held 24-29 January 2016, focused the core issues and approaches to reproducibility experiments from a multidisciplinary point view, sharing experience coming several fields computer science. In this paper, we discuss, summarize, adapt main findings seminar context IR evaluation -- both system-oriented user-oriented order raise awareness our community stimulate towards increased experiments.

10.1145/2964797.2964808 article EN ACM SIGIR Forum 2016-06-27

Recently, the ACM created a policy on Artifact Review and Badging, which presents framework to help SIGs recognize repeatability, replicability reproducibility in published research. While established vocabulary definitions, it did not prescribe procedures for implementation. Rather, has left this each SIG define given variety of research traditions approaches that exist with community. are required implement badging, but growing interest topic SIGIR community, task force been assembled...

10.1145/3274784.3274786 article EN ACM SIGIR Forum 2018-08-31

Information Retrieval (IR) is a discipline deeply rooted in evaluation since its inception. Indeed, experimentally measuring and statistically validating the performance of IR systems are only possible ways to compare understand which better than others and, ultimately, more effective useful for end-users. Since seminal paper by Stevens [103], it known that properties measurement scales determine operations you should or not perform with values from those scales. For example, suggested can...

10.1109/access.2021.3116857 article EN cc-by IEEE Access 2021-01-01

Creating test collections for offline retrieval evaluation requires human effort to judge documents' relevance. This expensive activity motivated much work in developing methods constructing benchmarks with fewer assessment costs. In this respect, adjudication actively decide both which documents and the order experts review them, better exploit budget or lower it. Researchers evaluate quality of those by measuring correlation between known gold ranking systems under full collection observed...

10.1145/3583780.3614916 preprint EN cc-by 2023-10-21

Feature selection is a common step in many ranking, classification, or prediction tasks and serves purposes. By removing redundant noisy features, the accuracy of ranking classification can be improved computational cost subsequent learning steps reduced. However, feature itself computationally expensive process. While for decades confined to theoretical algorithmic papers, quantum computing now becoming viable tool tackle realistic problems, particular special-purpose solvers based on...

10.1145/3477495.3531755 article EN Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval 2022-07-06

Moffat recently commented on our previous work. Our work focused how laying the foundations of evaluation methodology into theory measurement can improve knowledge and understanding measures we use in IR it shed light different types scales adopted by measures; also provided evidence, through extensive experimentation, impact statistical analyses, as well departing from their assumptions. Moreover, investigated, for first time IR, concept meaningfulness, i.e. invariance experimental...

10.48550/arxiv.2212.11735 preprint EN cc-by arXiv (Cornell University) 2022-01-01

Interval scales are assumed by several basic descriptive statistics, such as mean and variance, many statistical significance tests which daily used in IR to compare systems. Unfortunately, so far, there has not been any systematic formal study discover the actual scale properties of measures. Therefore, this paper, we develop a theory <i>Information Retrieval (IR)</i> evaluation measures, based on representational measurements, determine whether when measures interval scales. We found that...

10.1109/tkde.2018.2840708 article EN IEEE Transactions on Knowledge and Data Engineering 2018-05-25
Coming Soon ...