NFDI4DS | UHH-SEMS - Publication Details

Daniele Mari

ORCID: 0000-0003-0727-3725

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5002481506

Research Areas

Remote Sensing and LiDAR Applications
3D Shape Modeling and Analysis
3D Surveying and Cultural Heritage
Computer Graphics and Visualization Techniques
Music and Audio Processing
Speech and Audio Processing
Speech Recognition and Synthesis
Image and Signal Denoising Methods
Advanced Data Compression Techniques
Advanced Malware Detection Techniques
Gait Recognition and Analysis
Hand Gesture Recognition Systems
Image and Object Detection Techniques
Internet Traffic Analysis and Secure E-voting
Landslides and related hazards
Image Processing and 3D Reconstruction
Advanced Image Processing Techniques
Image Processing Techniques and Applications
Advanced Steganography and Watermarking Techniques
Robotics and Sensor-Based Localization

University of Padua
2021-2024

Point Cloud Geometry Scalable Coding Using a Resolution and Quality-conditioned Latents Probability Estimator

OPENALEX - Publications

Daniele Mari André F. R. Guarda Nuno M. M. Rodrigues Simone Milani Fernando Pereira

In the current age, users consume multimedia content in very heterogeneous scenarios terms of network, hardware, and display capabilities. A naive solution to this problem is encode multiple independent streams, each covering a different possible requirement for clients, with an obvious negative impact both storage computational requirements. These drawbacks can be avoided by using codecs that enable scalability, i.e., ability generate progressive bitstream, containing base layer followed...

10.48550/arxiv.2502.14099 preprint EN arXiv (Cornell University) 2025-02-19

The Sound of Silence: Efficiency of First Digit Features in Synthetic Audio Detection

OPENALEX - Publications

Daniele Mari Federica Latora Simone Milani

The recent integration of generative neural strategies and audio processing techniques have fostered the widespread synthetic speech synthesis or transformation algorithms. This capability proves to be harmful in many legal informative processes (news, biometric authentication, evidence courts, etc.). Thus, development efficient detection algorithms is both crucial challenging due heterogeneity forgery techniques.This work investigates discriminative role silenced parts shows how first digit...

10.1109/wifs55849.2022.9975404 article EN 2022-12-12

CACTUS: Content-Aware Compression and Transmission Using Semantics for Automotive LiDAR Data

OPENALEX - Publications

Daniele Mari Elena Camuffo Simone Milani

Many recent cloud or edge computing strategies for automotive applications require transmitting huge amounts of Light Detection and Ranging (LiDAR) data from terminals to centralized processing units. As a matter fact, the development effective Point Cloud (PC) compression that preserve semantic information, which is critical scene understanding, proves be crucial. Segmentation have always been treated as two independent tasks; however, since not all classes are equally important end task,...

10.3390/s23125611 article EN cc-by Sensors 2023-06-15

Enhancing the Rate-Distortion-Perception Flexibility of Learned Image Codecs with Conditional Diffusion Decoders

OPENALEX - Publications

Daniele Mari Simone Milani

Learned image compression codecs have recently achieved impressive performances surpassing the most efficient coding architectures. However, approaches are trained to minimize rate and distortion which often leads unsatisfactory visual results at low bitrates since perceptual metrics not taken into account. In this paper, we show that conditional diffusion models can lead promising in generative task when used as a decoder, that, given compressed representation, they allow creating new...

10.48550/arxiv.2403.02887 preprint EN arXiv (Cornell University) 2024-03-05

Point Cloud Geometry Scalable Coding with a Quality-Conditioned Latents Probability Estimator

OPENALEX - Publications

Daniele Mari André F. R. Guarda Nuno M. M. Rodrigues Simone Milani Fernando Pereira

The widespread usage of point clouds (PC) for immersive visual applications has resulted in the use very heterogeneous receiving conditions and devices, notably terms network, hardware, display capabilities. In this scenario, quality scalability, i.e., ability to reconstruct a signal at different qualities by progressively decoding single bitstream, is major requirement that yet be conveniently addressed, most learning-based PC coding solutions. This paper proposes scalability scheme, named...

10.48550/arxiv.2404.07698 preprint EN arXiv (Cornell University) 2024-04-11

Point Cloud Geometry Scalable Coding with a Quality-Conditioned Latents Probability Estimator

OPENALEX - Publications

Daniele Mari André F. R. Guarda Nuno M. M. Rodrigues Simone Milani Fernando Pereira

10.1109/icip51287.2024.10647489 article EN 2022 IEEE International Conference on Image Processing (ICIP) 2024-09-27

Ternary Neural Networks for Gait Identification in Wearable Devices

OPENALEX - Publications

Giacomo Agnetti Andrea Migliorati Daniele Mari Tiziano Bianchi Simone Milani and 1 more

10.1109/wifs61860.2024.10810715 article EN 2024-12-02

Features Denoising for Learned Image Coding

OPENALEX - Publications

Daniele Mari Simone Milani

In recent years the advancements in neural networks field have fostered advent of end-to-end learned coding schemes capable efficient image representations that reduce required storage space and transmission time.In general, features produced by these encoders are entropy-efficient permit reconstructing coded with low distortion. However, whenever they applied to a generic image, its latent representation might not be optimal one feature since network parameters were trained generalize on...

10.1109/euvip53989.2022.9922837 article EN 2022-09-11

Looking Through Walls: Inferring Scenes from Video-Surveillance Encrypted Traffic

OPENALEX - Publications

Daniele Mari Samuele Giuliano Piazzetta Sara Bordin Luca Pajola Sebastiano Verde and 2 more

Nowadays living environments are characterized by networks of inter-connected sensing devices that accomplish different tasks, e.g., video surveillance an environment a network CCTV cameras. A malicious user could gather sensitive details on people's activities eavesdropping the exchanged data packets. To overcome this problem, streams protected encryption systems, but even secured channels may still leak some information. In paper, we show it is possible to infer visual intercepting...

10.1109/icassp39728.2021.9414391 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021-05-13

All-for-One and One-For-All: Deep learning-based feature fusion for Synthetic Speech Detection

OPENALEX - Publications

Daniele Mari Davide Salvi Paolo Bestagini Simone Milani

Recent advances in deep learning and computer vision have made the synthesis counterfeiting of multimedia content more accessible than ever, leading to possible threats dangers from malicious users. In audio field, we are witnessing growth speech deepfake generation techniques, which solicit development synthetic detection algorithms counter mischievous uses such as frauds or identity thefts. this paper, consider three different feature sets proposed literature for task present a model that...

10.48550/arxiv.2307.15555 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Coming Soon ...