NFDI4DS | UHH-SEMS - Publication Details

Danilo Comminiello

ORCID: 0000-0003-4067-4504

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5019647783

Research Areas

Speech and Audio Processing
Advanced Adaptive Filtering Techniques
Music and Audio Processing
Blind Source Separation Techniques
Image and Signal Denoising Methods
Neural Networks and Applications
Generative Adversarial Networks and Image Synthesis
Machine Learning and ELM
Control Systems and Identification
Digital Filter Design and Implementation
Music Technology and Sound Studies
Speech Recognition and Synthesis
Model Reduction and Neural Networks
Domain Adaptation and Few-Shot Learning
Advanced Image Processing Techniques
Hearing Loss and Rehabilitation
AI in cancer detection
Acoustic Wave Phenomena Research
Neural Networks and Reservoir Computing
Image Retrieval and Classification Techniques
Human Pose and Action Recognition
Advanced Neural Network Applications
Emotion and Mood Recognition
Medical Image Segmentation Techniques
Advanced MRI Techniques and Applications

Sapienza University of Rome
2016-2025

European University of Rome
2021

Institute of Electrical and Electronics Engineers
2019

Canadian Standards Association
2019

Weatherford College
2014

Henan Tianguan Group (China)
2014

Institute of Electronics, Computer and Telecommunication Engineering
2010

Group sparse regularization for deep neural networks

OPENALEX - Publications

Simone Scardapane Danilo Comminiello Amir Hussain Aurelio Uncini

10.1016/j.neucom.2017.02.029 article EN Neurocomputing 2017-02-10

Nonlinear spline adaptive filtering

OPENALEX - Publications

Michele Scarpiniti Danilo Comminiello Raffaele Parisi Aurelio Uncini

10.1016/j.sigpro.2012.09.021 article EN Signal Processing 2012-10-11

Functional Link Adaptive Filters for Nonlinear Acoustic Echo Cancellation

OPENALEX - Publications

Danilo Comminiello Michele Scarpiniti Luis A. Azpicueta-Ruiz Jerónimo Arenas‐García Aurelio Uncini

This paper introduces a new class of nonlinear adaptive filters, whose structure is based on Hammerstein model. Such filters derive from the functional link filter (FLAF) model, defined by input expansion, which enhances representation signal through projection in higher dimensional space, and subsequent filtering. In particular, two robust FLAF-based architectures are proposed designed ad hoc to tackle nonlinearities acoustic echo cancellation (AEC). The simplest architecture split FLAF,...

10.1109/tasl.2013.2255276 article EN IEEE Transactions on Audio Speech and Language Processing 2013-03-27

Online Sequential Extreme Learning Machine With Kernels

OPENALEX - Publications

Simone Scardapane Danilo Comminiello Michele Scarpiniti Aurelio Uncini

The extreme learning machine (ELM) was recently proposed as a unifying framework for different families of algorithms. classical ELM model consists linear combination fixed number nonlinear expansions the input vector. Learning in is hence equivalent to finding optimal weights that minimize error on dataset. update works batch mode, either with explicit feature mappings or implicit defined by kernels. Although an online version has been former, no work done up this point latter, and whether...

10.1109/tnnls.2014.2382094 article EN IEEE Transactions on Neural Networks and Learning Systems 2014-12-31

Attention-map augmentation for hypercomplex breast cancer classification

OPENALEX - Publications

Eleonora Lopez Filippo Betello Federico Carmignani Eleonora Grassucci Danilo Comminiello

Breast cancer is the most widespread neoplasm among women and early detection of this disease critical. Deep learning techniques have become great interest to improve diagnostic performance. However, distinguishing between malignant benign masses in whole mammograms poses a challenge, as they appear nearly identical an untrained eye, region (ROI) constitutes only small fraction entire image. In paper, we propose framework, parameterized hypercomplex attention maps (PHAM), overcome these...

10.1016/j.patrec.2024.04.014 article EN cc-by Pattern Recognition Letters 2024-04-18

Hammerstein uniform cubic spline adaptive filters: Learning and convergence properties

OPENALEX - Publications

Michele Scarpiniti Danilo Comminiello Raffaele Parisi Aurelio Uncini

10.1016/j.sigpro.2014.01.019 article EN Signal Processing 2014-01-27

Novel Cascade Spline Architectures for the Identification of Nonlinear Systems

OPENALEX - Publications

Michele Scarpiniti Danilo Comminiello Raffaele Parisi Aurelio Uncini

In this paper two novel nonlinear cascade adaptive architectures, here called sandwich models, suitable for the identification of general systems are presented. The proposed architectures rely on combination structural blocks, each one implementing a linear filter or memoryless function. All functions involved in adaptation process based spline and can be easily modified during learning using gradient-based techniques. particular, simple form on-line algorithms is derived. addition, we...

10.1109/tcsi.2015.2423791 article EN IEEE Transactions on Circuits and Systems I Regular Papers 2015-06-16

L3DAS22 Challenge: Learning 3D Audio Sources in a Real Office Environment

OPENALEX - Publications

Eric Guizzo C. Marinoni Marco Pennese Xinlei Ren Xiguang Zheng and 4 more

The L3DAS22 Challenge is aimed at encouraging the development of machine learning strategies for 3D speech enhancement and sound localization detection in office-like environments. This challenge improves extends tasks L3DAS21 edition. We generated a new dataset, which maintains same general characteristics datasets, but with an extended number data points adding constrains that improve baseline model's efficiency overcome major difficulties encountered by participants previous challenge....

10.1109/icassp43922.2022.9746872 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022-04-27

Learning Speech Emotion Representations in the Quaternion Domain

OPENALEX - Publications

Eric Guizzo Tillman Weyde Simone Scardapane Danilo Comminiello

The modeling of human emotion expression in speech signals is an important, yet challenging task. high resource demand recognition models, combined with the general scarcity emotion-labelled data are obstacles to development and application effective solutions this field. In paper, we present approach jointly circumvent these difficulties. Our method, named RH-emo, a novel semi-supervised architecture aimed at extracting quaternion embeddings from real-valued monoaural spectrograms, enabling...

10.1109/taslp.2023.3250840 article EN cc-by IEEE/ACM Transactions on Audio Speech and Language Processing 2023-01-01

Semantic Communications Based on Adaptive Generative Models and Information Bottleneck

OPENALEX - Publications

Sergio Barbarossa Danilo Comminiello Eleonora Grassucci Francesco Pezone Stefania Sardellitti and 1 more

Semantic communications represent a significant breakthrough with respect to the current communication paradigm, as they focus on recovering meaning behind transmitted sequence of symbols, rather than symbols themselves. In semantic communications, scope destination is not recover list identical ones, but message that semantically equivalent emitted by source. This paradigm shift introduces many degrees freedom encoding and decoding rules can be exploited make systems much more efficient....

10.1109/mcom.005.2200829 article EN IEEE Communications Magazine 2023-11-01

Nonlinear Acoustic Echo Cancellation Based on Sparse Functional Link Representations

OPENALEX - Publications

Danilo Comminiello Michele Scarpiniti Luis A. Azpicueta-Ruiz Jerónimo Arenas‐García Aurelio Uncini

Recently, a new class of nonlinear adaptive filtering architectures has been introduced based on the functional link filter (FLAF) model. Here we focus specifically split FLAF (SFLAF) architecture, which separates adaptation linear and coefficients using two different filters in parallel. This property makes SFLAF well-suited method for problems like acoustic echo cancellation (NAEC), separation tasks brings some performance improvement. Although flexibility is one main features SFLAF,...

10.1109/taslp.2014.2324175 article EN IEEE/ACM Transactions on Audio Speech and Language Processing 2014-05-14

Nonlinear system identification using IIR Spline Adaptive Filters

OPENALEX - Publications

Michele Scarpiniti Danilo Comminiello Raffaele Parisi Aurelio Uncini

10.1016/j.sigpro.2014.08.045 article EN Signal Processing 2014-09-06

Steady-State Performance of Spline Adaptive Filters

OPENALEX - Publications

Michele Scarpiniti Danilo Comminiello Gaetano Scarano Raffaele Parisi Aurelio Uncini

Recently, a novel class of nonlinear adaptive filters, called spline filters (SAFs), has been introduced and demonstrated to be very effective in many practical applications. The learning rules these architectures are based on the least mean square (LMS) algorithm. In order provide theoretical foundation SAF, this paper we steady-state performance evaluation. particular, after stochastic analysis behavior SAF approach under Gaussian assumption, analytical derivation excess error (EMSE)...

10.1109/tsp.2015.2493986 article EN IEEE Transactions on Signal Processing 2015-10-26

Quaternion Convolutional Neural Networks for Detection and Localization of 3D Sound Events

OPENALEX - Publications

Danilo Comminiello Marco Lella Simone Scardapane Aurelio Uncini

Learning from data in the quaternion domain enables us to exploit internal dependencies of 4D signals and treating them as a single entity. One models that perfectly suits with quaternion-valued processing is represented by 3D acoustic their spherical harmonics decomposition. In this paper, we address problem localizing detecting sound events spatial field using processing. particular, consider harmonic components captured first-order ambisonic microphone process convolutional neural...

10.1109/icassp.2019.8682711 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019-04-17

PHNNs: Lightweight Neural Networks via Parameterized Hypercomplex Convolutions

OPENALEX - Publications

Eleonora Grassucci Aston Zhang Danilo Comminiello

Hypercomplex neural networks have proven to reduce the overall number of parameters while ensuring valuable performance by leveraging properties Clifford algebras. Recently, hypercomplex linear layers been further improved involving efficient parameterized Kronecker products. In this article, we define parameterization convolutional and introduce family (PHNNs) that are lightweight large-scale models. Our method grasps convolution rules filter organization directly from data without...

10.1109/tnnls.2022.3226772 article EN cc-by IEEE Transactions on Neural Networks and Learning Systems 2022-12-13

Generative Semantic Communication: Diffusion Models Beyond Bit Recovery

OPENALEX - Publications

Eleonora Grassucci Sergio Barbarossa Danilo Comminiello

Semantic communication is expected to be one of the cores next-generation AI-based communications. One possibilities offered by semantic capability regenerate, at destination side, images or videos semantically equivalent transmitted ones, without necessarily recovering sequence bits. The current solutions still lack ability build complex scenes from received partial information. Clearly, there an unmet need balance effectiveness generation methods and complexity information, possibly taking...

10.48550/arxiv.2306.04321 preprint EN cc-by-nc-nd arXiv (Cornell University) 2023-01-01

A semi-supervised random vector functional-link network based on the transductive framework

OPENALEX - Publications

Simone Scardapane Danilo Comminiello Michele Scarpiniti Aurelio Uncini

10.1016/j.ins.2015.07.060 article EN Information Sciences 2015-08-22

Deep Recurrent Neural Networks for Audio Classification in Construction Sites

OPENALEX - Publications

Michele Scarpiniti Danilo Comminiello Aurelio Uncini Yong-Cheol Lee

In this paper, we propose a Deep Recurrent Neural Network (DRNN) approach based on Long-Short Term Memory (LSTM) units for the classification of audio signals recorded in construction sites. Five classes multiple vehicles and tools, normally used sites, have been considered. The input provided to DRNN consists concatenation several spectral features, like MFCCs, mel-scaled spectrogram, chroma contrast. proposed architecture feature extraction described. Some experimental results, obtained by...

10.23919/eusipco47968.2020.9287802 article EN 2021 29th European Signal Processing Conference (EUSIPCO) 2020-12-18

L3DAS21 Challenge: Machine Learning for 3D Audio Signal Processing

OPENALEX - Publications

Eric Guizzo Riccardo F. Gramaccioni Saeid Jamili C. Marinoni Edoardo Massaro and 9 more

The L3DAS21 Challenge is aimed at encouraging and fostering collaborative research on machine learning for 3D audio signal processing, with particular focus speech enhancement (SE) sound localization detection (SELD). Alongside the challenge, we release dataset, a 65 hours corpus, accompanied Python API that facilitates data usage results submission stage. Usually, approaches to tasks are based single-perspective Ambisonics recordings or arrays of single-capsule microphones. We propose,...

10.1109/mlsp52302.2021.9596248 preprint EN 2021-10-25

Generative AI Meets Semantic Communication: Evolution and Revolution of Communication Tasks

OPENALEX - Publications

Eleonora Grassucci Jihong Park Sergio Barbarossa Seong‐Lyun Kim Jinho Choi and 1 more

While deep generative models are showing exciting abilities in computer vision and natural language processing, their adoption communication frameworks is still far underestimated. These methods demonstrated to evolve solutions classic problems such as denoising, restoration, or compression. Nevertheless, can unveil real potential semantic frameworks, which the receiver not asked recover sequence of bits used encode transmitted (semantic) message, but only regenerate content that...

10.48550/arxiv.2401.06803 preprint EN cc-by arXiv (Cornell University) 2024-01-01

Frequency domain quaternion adaptive filters: Algorithms and convergence performance

OPENALEX - Publications

Francesca Ortolani Danilo Comminiello Michele Scarpiniti Aurelio Uncini

10.1016/j.sigpro.2016.11.002 article EN Signal Processing 2016-11-08

Diffusion Models for Audio Semantic Communication

OPENALEX - Publications

Eleonora Grassucci C. Marinoni M. Andrea Rodríguez Danilo Comminiello

Directly sending audio signals from a transmitter to receiver across noisy channel may absorb consistent bandwidth and be prone errors when trying recover the transmitted bits. On contrary, recent semantic communication approach proposes send semantics then regenerate semantically content at without exactly recovering bitstream. In this paper, we propose generative framework that faces problem as an inverse problem, therefore being robust different corruptions. Our method transmits...

10.1109/icassp48485.2024.10447612 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024-03-18

A Quaternion-Valued Variational Autoencoder

OPENALEX - Publications

Eleonora Grassucci Danilo Comminiello Aurelio Uncini

Deep probabilistic generative models have achieved incredible success in many fields of application. Among such models, variational autoencoders (VAEs) proved their ability modeling a process by learning latent representation the input. In this paper, we propose novel VAE defined quaternion domain, which exploits properties algebra to improve performance while significantly reducing number parameters required network. The proposed with respect traditional VAEs relies on leverage internal...

10.1109/icassp39728.2021.9413859 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021-05-13

StawGAN: Structural-Aware Generative Adversarial Networks for Infrared Image Translation

OPENALEX - Publications

Luigi Sigillo Eleonora Grassucci Danilo Comminiello

This paper addresses the problem of translating night-time thermal infrared images, which are most adopted image modalities to analyze scenes, daytime color images (NTIT2DC), provide better perceptions objects. We introduce a novel model that focuses on enhancing quality target generation without merely colorizing it. The proposed structural aware (StawGAN) enables translation better-shaped and high-definition objects in domain. test our aerial DroneVeichle dataset containing RGB-IR paired...

10.1109/iscas46773.2023.10181838 article EN 2022 IEEE International Symposium on Circuits and Systems (ISCAS) 2023-05-21

Coming Soon ...