NFDI4DS | UHH-SEMS - Publication Details

Pietro Astolfi

ORCID: 0000-0002-5192-9608

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5049825997

Research Areas

Advanced Neuroimaging Techniques and Applications
Multimodal Machine Learning Applications
Medical Imaging and Analysis
Fetal and Pediatric Neurological Disorders
Medical Image Segmentation Techniques
Domain Adaptation and Few-Shot Learning
Generative Adversarial Networks and Image Synthesis
Image Retrieval and Classification Techniques
Smart Agriculture and AI
Advanced MRI Techniques and Applications
Advanced Image and Video Retrieval Techniques
Robotics and Sensor-Based Localization
Natural Language Processing Techniques
Handwritten Text Recognition Techniques
MRI in cancer diagnosis
Advanced Neural Network Applications
Data Management and Algorithms
Functional Brain Connectivity Studies
Machine Learning and Data Classification
Cerebrospinal fluid and hydrocephalus
Hand Gesture Recognition Systems
Philosophy and History of Science
Brain Tumor Detection and Classification
Cancer-related molecular mechanisms research
Insect Pheromone Research and Control

Fondazione Bruno Kessler
2020-2023

Italian Institute of Technology
2020-2023

University of Trento
2020-2023

Institut national de recherche en informatique et en automatique
2023

Kessler Foundation
2020

Politecnico di Milano
2017-2018

Object-centric Binding in Contrastive Language-Image Pretraining

OPENALEX - Publications

Rim Assouel Pietro Astolfi Florian Bordes Michal Drozdzal Adriana Romero-Soriano

Recent advances in vision language models (VLM) have been driven by contrastive such as CLIP, which learn to associate visual information with their corresponding text descriptions. However, these limitations understanding complex compositional scenes involving multiple objects and spatial relationships. To address challenges, we propose a novel approach that diverges from commonly used strategies, rely on the design of hard-negative augmentations. Instead, our work focuses integrating...

10.48550/arxiv.2502.14113 preprint EN arXiv (Cornell University) 2025-02-19

Semi-supervised learning made simple with self-supervised clustering

OPENALEX - Publications

Enrico Fini Pietro Astolfi Karteek Alahari Xavier Alameda-Pineda Julien Mairal and 2 more

Self-supervised learning models have been shown to learn rich visual representations without requiring human annotations. However, in many real-world scenarios, labels are partially available, motivating a recent line of work on semi-supervised methods inspired by self-supervised principles. In this paper, we propose conceptually simple yet empirically powerful approach turn clustering-based such as SwAV or DINO into learners. More precisely, introduce multi-task framework merging supervised...

10.1109/cvpr52729.2023.00311 preprint EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

A Picture is Worth More Than 77 Text Tokens: Evaluating CLIP-Style Models on Dense Captions

OPENALEX - Publications

Jack Urbanek Florian Bordes Pietro Astolfi Mary Williamson Vasu Sharma and 1 more

10.1109/cvpr52733.2024.02521 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

Classifyber, a robust streamline-based linear classifier for white matter bundle segmentation

OPENALEX - Publications

Giulia Bertò Daniel Bullock Pietro Astolfi Soichi Hayashi Luca Zigiotto and 7 more

Virtual delineation of white matter bundles in the human brain is paramount importance for multiple applications, such as pre-surgical planning and connectomics. A substantial body literature related to methods that automatically segment from diffusion Magnetic Resonance Imaging (dMRI) data indirectly, by exploiting either idea connectivity between regions or geometry fiber paths obtained with tractography techniques, or, directly, through information volumetric data. Despite remarkable...

10.1016/j.neuroimage.2020.117402 article EN cc-by NeuroImage 2020-09-23

An Introduction to Vision-Language Modeling

OPENALEX - Publications

Florian Bordes Richard Yuanzhe Pang Anurag Ajay Alexander C. Li Adrien Bardes and 36 more

Following the recent popularity of Large Language Models (LLMs), several attempts have been made to extend them visual domain. From having a assistant that could guide us through unfamiliar environments generative models produce images using only high-level text description, vision-language model (VLM) applications will significantly impact our relationship with technology. However, there are many challenges need be addressed improve reliability those models. While language is discrete,...

10.48550/arxiv.2405.17247 preprint EN arXiv (Cornell University) 2024-05-27

Vineyard Autonomous Navigation in the Echord++ GRAPE Experiment

OPENALEX - Publications

Pietro Astolfi Alessandro Gabrielli Luca Bascetta Matteo Matteucci

Field robotics is a fast developing research field, in particular precision agriculture gaining popularity due to the high return productivity and reduced pollution impact on environment. The GRAPE project an ECHORD++ robotic experiment aimed at use of mobile robot for automatic pheromone dispenser distribution vineyards, reduce pesticide thanks mate disruption. This work describes autonomous navigation system such robot. For specific scenario real state art does not exists, so we adapted...

10.1016/j.ifacol.2018.08.401 article EN IFAC-PapersOnLine 2018-01-01

Improving Text-to-Image Consistency via Automatic Prompt Optimization

OPENALEX - Publications

Oscar Mañas Pietro Astolfi Melissa Hall Candace Ross Jack Urbanek and 4 more

Impressive advances in text-to-image (T2I) generative models have yielded a plethora of high performing which are able to generate aesthetically appealing, photorealistic images. Despite the progress, these still struggle produce images that consistent with input prompt, oftentimes failing capture object quantities, relations and attributes properly. Existing solutions improve prompt-image consistency suffer from following challenges: (1) they require model fine-tuning, (2) only focus on...

10.48550/arxiv.2403.17804 preprint EN arXiv (Cornell University) 2024-03-26

Consistency-diversity-realism Pareto fronts of conditional image generative models

OPENALEX - Publications

Pietro Astolfi Marlène Careil Melissa Hall Oscar Mañas Matthew J. Muckley and 3 more

Building world models that accurately and comprehensively represent the real is utmost aspiration for conditional image generative as it would enable their use simulators. For these to be successful models, they should not only excel at quality prompt-image consistency but also ensure high representation diversity. However, current research in mostly focuses on creative applications are predominantly concerned with human preferences of aesthetics. We note have inference time mechanisms - or...

10.48550/arxiv.2406.10429 preprint EN arXiv (Cornell University) 2024-06-14

Improved baselines for vision-language pre-training

OPENALEX - Publications

Enrico Fini Pietro Astolfi Adriana Romero-Soriano Jakob Verbeek Michal Drozdzal

Contrastive learning has emerged as an efficient framework to learn multimodal representations. CLIP, a seminal work in this area, achieved impressive results by training on paired image-text data using the contrastive loss. Recent claims improvements over CLIP additional non-contrastive losses inspired from self-supervised learning. However, it is sometimes hard disentangle contribution of these other implementation details, e.g., augmentation or regularization techniques, used train model....

10.48550/arxiv.2305.08675 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Supervised tractogram filtering using Geometric Deep Learning

OPENALEX - Publications

Pietro Astolfi Ruben Verhagen Laurent Petit Emanuele Olivetti Silvio Sarubbo and 3 more

10.1016/j.media.2023.102893 article EN Medical Image Analysis 2023-07-17

A Stem-Based Dissection of Inferior Fronto-Occipital Fasciculus with A Deep Learning Model

OPENALEX - Publications

Pietro Astolfi Alessandro De Benedictis Silvio Sarubbo Giulia Bertò Emanuele Olivetti and 2 more

The aim of this work is to improve the virtual dissection Inferior Frontal Occipital Fasciculus (IFOF) by combining a recent insight on white matter anatomy from ex-vivo and data driven approach with deep learning model. Current methods tract are not robust respect false positives neglecting neuroanatomical waypoints given tract, like stem. In we design model segment stem IFOF show how can be improved. proposed method validated Human Connectome Project dataset, where expert neuroanatomists...

10.1109/isbi45749.2020.9098483 article EN 2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI) 2020-04-01

DP-RDM: Adapting Diffusion Models to Private Domains Without Fine-Tuning

OPENALEX - Publications

Jonathan Lebensold Maziar Sanjabi Pietro Astolfi Adriana Romero-Soriano Kamalika Chaudhuri and 2 more

Text-to-image diffusion models have been shown to suffer from sample-level memorization, possibly reproducing near-perfect replica of images that they are trained on, which may be undesirable. To remedy this issue, we develop the first differentially private (DP) retrieval-augmented generation algorithm is capable generating high-quality image samples while providing provable privacy guarantees. Specifically, assume access a text-to-image model on small amount public data, and design DP...

10.48550/arxiv.2403.14421 preprint EN arXiv (Cornell University) 2024-03-21

$\mathbb{X}$-Sample Contrastive Loss: Improving Contrastive Learning with Sample Similarity Graphs

OPENALEX - Publications

Vlad Sobal Mark Ibrahim Randall Balestriero Vivien Cabannes Diane Bouchacourt and 3 more

Learning good representations involves capturing the diverse ways in which data samples relate. Contrastive loss - an objective matching related underlies methods from self-supervised to multimodal learning. losses, however, can be viewed more broadly as modifying a similarity graph indicate how should relate embedding space. This view reveals shortcoming contrastive learning: is binary, only one sample positive sample. Crucially, similarities \textit{across} are ignored. Based on this...

10.48550/arxiv.2407.18134 preprint EN arXiv (Cornell University) 2024-07-25

EvalGIM: A Library for Evaluating Generative Image Models

OPENALEX - Publications

Melissa Hall Oscar Mañas Reyhane Askari Mark Ibrahim Candace Ross and 12 more

As the use of text-to-image generative models increases, so does adoption automatic benchmarking methods used in their evaluation. However, while metrics and datasets abound, there are few unified libraries that provide a framework for performing evaluations across many metrics. Furthermore, rapid introduction increasingly robust requires evaluation remain flexible to new Finally, remains gap synthesizing order deliver actionable takeaways about model performance. To enable unified,...

10.48550/arxiv.2412.10604 preprint EN arXiv (Cornell University) 2024-12-13

Classifyber, a robust streamline-based linear classifier for white matter bundle segmentation

OPENALEX - Publications

Giulia Bertò Daniel Bullock Pietro Astolfi Soichi Hayashi Luca Zigiotto and 7 more

Abstract Virtual delineation of white matter bundles in the human brain is paramount importance for multiple applications, such as pre-surgical planning and connectomics. A substantial body literature related to methods that automatically segment from diffusion Magnetic Resonance Imaging (dMRI) data indirectly, by exploiting either idea connectivity between regions or geometry fiber paths obtained with tractography techniques, or, directly, through information volumetric data. Despite...

10.1101/2020.02.10.942714 preprint EN cc-by bioRxiv (Cold Spring Harbor Laboratory) 2020-02-12

Clustered Dynamic Graph CNN for Biometric 3D Hand Shape Recognition

OPENALEX - Publications

Jan Svoboda Pietro Astolfi Davide Boscaini Jonathan Masci Michael M. Bronstein

The research in biometric recognition using hand shape has been somewhat stagnating the last decade. Meanwhile, computer vision and machine learning have experienced a paradigm shift with renaissance of deep learning, which set new state-of-the-art many related fields. Inspired by successful applications for other modalities, we propose novel approach to 3D from RGB-D data based on geometric techniques. We show how train our model synthetic retain performance real samples during test time....

10.1109/ijcb48548.2020.9304894 article EN 2020-09-28

Instance-Conditioned GAN Data Augmentation for Representation Learning

OPENALEX - Publications

Pietro Astolfi Arantxa Casanova Jakob Verbeek P. Vincent Adriana Romero-Soriano and 1 more

Data augmentation has become a crucial component to train state-of-the-art visual representation models. However, handcrafting combinations of transformations that lead improved performances is laborious task, which can result in visually unrealistic samples. To overcome these limitations, recent works have explored the use generative models as learnable data tools, showing promising results narrow application domains, e.g., few-shot learning and low-data medical imaging. In this paper, we...

10.48550/arxiv.2303.09677 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Semi-supervised learning made simple with self-supervised clustering

OPENALEX - Publications

Enrico Fini Pietro Astolfi Karteek Alahari Xavier Alameda-Pineda Julien Mairal and 2 more

10.48550/arxiv.2306.07483 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Coming Soon ...