Pietro Zanuttigh

ORCID: 0000-0002-9502-2389
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Vision and Imaging
  • Domain Adaptation and Few-Shot Learning
  • Multimodal Machine Learning Applications
  • Advanced Neural Network Applications
  • Optical measurement and interference techniques
  • Advanced Optical Sensing Technologies
  • Advanced Image and Video Retrieval Techniques
  • COVID-19 diagnosis using AI
  • Robotics and Sensor-Based Localization
  • Video Surveillance and Tracking Methods
  • Computer Graphics and Visualization Techniques
  • Video Coding and Compression Technologies
  • Hand Gesture Recognition Systems
  • Human Pose and Action Recognition
  • Advanced Data Compression Techniques
  • 3D Surveying and Cultural Heritage
  • Medical Image Segmentation Techniques
  • Industrial Vision Systems and Defect Detection
  • 3D Shape Modeling and Analysis
  • Image and Object Detection Techniques
  • Image Processing Techniques and Applications
  • Remote Sensing and LiDAR Applications
  • Particle Detector Development and Performance
  • Advanced Image Processing Techniques
  • Image Retrieval and Classification Techniques

University of Padua
2015-2024

Engineering (Italy)
2024

University of Genoa
2013

UNSW Sydney
2006

Istituto Nazionale di Fisica Nucleare, Sezione di Padova
2005

The recent introduction of novel acquisition devices like the Leap Motion and Kinect allows to obtain a very informative description hand pose that can be exploited for accurate gesture recognition. This paper proposes recognition scheme explicitly targeted data. An ad-hoc feature set based on positions orientation fingertips is computed fed into multi-class SVM classifier in order recognize performed gestures. A features also extracted from depth combined with ones improve performance....

10.1109/icip.2014.7025313 article EN 2022 IEEE International Conference on Image Processing (ICIP) 2014-10-01

Deep learning architectures exhibit a critical drop of performance due to catastrophic forgetting when they are required incrementally learn new tasks. Contemporary incremental frameworks focus on image classification and object detection while in this work we formally introduce the problem for semantic segmentation which pixel-wise labeling is considered. To tackle task propose distill knowledge previous model retain information about previously learned classes, whilst updating current...

10.1109/iccvw.2019.00400 article EN 2019-10-01

Microsoft Kinect had a key role in the development of consumer depth sensors being device that brought acquisition to mass market. Despite success this sensor, with introduction second generation, has completely changed technology behind sensor from structured light Time-Of-Flight. This paper presents comparison data provided by first and generation order explain achievements have been obtained switch technology. After an accurate analysis accuracy two under different conditions, sample...

10.1109/icme.2015.7177380 article EN 2022 IEEE International Conference on Multimedia and Expo (ICME) 2015-06-01

Deep neural networks suffer from the major limitation of catastrophic forgetting old tasks when learning new ones. In this paper we focus on class incremental continual in semantic segmentation, where categories are made available over time while previous training data is not retained. The proposed scheme shapes latent space to reduce whilst improving recognition novel classes. Our framework driven by three components which also combine top existing techniques effortlessly. First, prototypes...

10.1109/cvpr46437.2021.00117 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

Deep networks allow to obtain outstanding results in semantic segmentation, however they need be trained a single shot with large amount of data. Continual learning settings where new classes are learned incremental steps and previous training data is no longer available challenging due the catastrophic forgetting phenomenon. Existing approaches typically fail when several performed or presence distribution shift background class. We tackle these issues by recreating for old outlining...

10.1109/iccv48922.2021.00694 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

10.1016/j.cviu.2021.103167 article EN Computer Vision and Image Understanding 2021-01-23

Accurate scene understanding from multiple sensors mounted on cars is a key requirement for autonomous driving systems. Nowadays, this task mainly performed through data-hungry deep learning techniques that need very large amounts of data to be trained. Due the high cost performing segmentation labeling, many synthetic datasets have been proposed. However, most them miss multi-sensor nature data, and do not capture significant changes introduced by variation daytime weather conditions. To...

10.1109/tits.2023.3257086 article EN cc-by IEEE Transactions on Intelligent Transportation Systems 2023-03-28

Federated Learning (FL) has recently emerged as a possible way to tackle the domain shift in real-world Semantic Segmentation (SS) without compromising private nature of collected data. However, most existing works on FL unrealistically assume labeled data re-mote clients. Here we propose novel task (FFreeDA) which clients' is unlabeled and server accesses source dataset for pre-training only. To solve FFreeDA, LADD, leverages knowledge pre-trained model by employing self-supervision with...

10.1109/wacv56688.2023.00052 article EN 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023-01-01

Deep learning techniques have been widely used in autonomous driving systems for the semantic understanding of urban scenes. However, they need a huge amount labeled data training, which is difficult and expensive to acquire. A recently proposed workaround train deep networks using synthetic data, but domain shift between real world representations limits performance. In this work, novel Unsupervised Domain Adaptation (UDA) strategy introduced solve issue. The driven by three components:...

10.1109/tiv.2020.2980671 article EN IEEE Transactions on Intelligent Vehicles 2020-03-13

Deep learning frameworks allowed for a remarkable advancement in semantic segmentation, but the data hungry nature of convolutional networks has rapidly raised demand adaptation techniques able to transfer learned knowledge from label-abundant domains unlabeled ones. In this paper we propose an effective Unsupervised Domain Adaptation (UDA) strategy, based on feature clustering method that captures different modes distribution and groups features same class into tight well-separated...

10.1109/wacv48630.2021.00140 article EN 2021-01-01

The standard class-incremental continual learning setting assumes a set of tasks seen one after the other in fixed and pre-defined order. This is not very realistic federated environments where each client works independently an asynchronous manner getting data for different time-frames orders totally uncorrelated with ones. We introduce novel (AFCL) multiple happens at orderings time slots. tackle this task using prototype-based learning, representation loss, fractal pre-training, modified...

10.1109/cvprw59228.2023.00534 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2023-06-01

With the increasing availability of depth sensors, multimodal frameworks that combine color information with data are gaining interest. However, ground truth for semantic segmentation is burdensome to provide, thus making domain adaptation a significant research area. Yet most methods not able effectively handle data. Specifically, we address challenging source-free setting where performed without reusing source We propose MISFIT: MultImodal Source-Free Information fusion Transformer....

10.1109/wacvw60836.2024.00070 article EN 2024-01-01

Time-of-Flight data is typically affected by a high level of noise and artifacts due to Multi-Path Interference (MPI). While various traditional approaches for ToF improvement have been proposed, machine learning techniques seldom applied this task, mostly the limited availability real world training with depth ground truth. In paper, we avoid rely on labeled in framework. A Coarse-Fine CNN, able exploit multi-frequency MPI correction, trained synthetic truth supervised way. parallel, an...

10.1109/cvpr.2019.00573 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

The semantic understanding of urban scenes is one the key components for an autonomous driving system. Complex deep neural networks this task require to be trained with a huge amount labeled data, which difficult and expensive acquire. A recently proposed workaround usage synthetic however differences between real world limit performance. We propose unsupervised domain adaptation strategy adapt supervised training data. learning exploits three components: standard on adversarial able exploit...

10.1109/cvprw.2019.00160 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2019-06-01

Federated Learning (FL) has recently emerged as a novel machine learning paradigm allowing to preserve privacy and account for the distributed nature of process in many real-world settings. Computer vision tasks deal with huge datasets often critical issues, therefore federated approaches have been presented exploit its privacy-preserving nature. Firstly, this paper introduces different FL settings used computer main challenges that need be tackled. Then, it provides comprehensive overview...

10.1109/access.2023.3310400 article EN cc-by-nc-nd IEEE Access 2023-01-01

Open-vocabulary semantic segmentation enables models to identify novel object categories beyond their training data. While this flexibility represents a significant advancement, current approaches still rely on manually specified class names as input, creating an inherent bottleneck in real-world applications. This work proposes Vocabulary-Free Semantic Segmentation pipeline, eliminating the need for predefined vocabularies. Specifically, we address chicken-and-egg problem where users...

10.48550/arxiv.2502.11891 preprint EN arXiv (Cornell University) 2025-02-17
Coming Soon ...