Search Results for author: Danilo Comminiello

Found 35 papers, 18 papers with code

NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement

2 code implementations8 Apr 2024 Giordano Cicchetti, Danilo Comminiello

Real-world documents may suffer various forms of degradation, often resulting in lower accuracy in optical character recognition (OCR) systems.

Binarization Document Enhancement +2

Ship in Sight: Diffusion Models for Ship-Image Super Resolution

1 code implementation27 Mar 2024 Luigi Sigillo, Riccardo Fosco Gramaccioni, Alessandro Nicolosi, Danilo Comminiello

In this context, our method explores in depth the problem of ship image super resolution, which is crucial for coastal and port surveillance.

Denoising Image Generation +4

Towards Explaining Hypercomplex Neural Networks

1 code implementation26 Mar 2024 Eleonora Lopez, Eleonora Grassucci, Debora Capriotti, Danilo Comminiello

To achieve this, we define a type of cosine-similarity transform within the parameterized hypercomplex domain.

Overview of the L3DAS23 Challenge on Audio-Visual Extended Reality

no code implementations14 Feb 2024 Christian Marinoni, Riccardo Fosco Gramaccioni, Changan Chen, Aurelio Uncini, Danilo Comminiello

The primary goal of the L3DAS23 Signal Processing Grand Challenge at ICASSP 2023 is to promote and support collaborative research on machine learning for 3D audio signal processing, with a specific emphasis on 3D speech enhancement and 3D Sound Event Localization and Detection in Extended Reality applications.

Audio Signal Processing Sound Event Localization and Detection +1

Generative AI Meets Semantic Communication: Evolution and Revolution of Communication Tasks

no code implementations10 Jan 2024 Eleonora Grassucci, Jihong Park, Sergio Barbarossa, Seong-Lyun Kim, Jinho Choi, Danilo Comminiello

Disclosing generative models capabilities in semantic communication paves the way for a paradigm shift with respect to conventional communication systems, which has great potential to reduce the amount of data traffic and offers a revolutionary versatility to novel tasks and applications that were not even conceivable a few years ago.

Denoising

SyncFusion: Multimodal Onset-synchronized Video-to-Audio Foley Synthesis

no code implementations23 Oct 2023 Marco Comunità, Riccardo F. Gramaccioni, Emilian Postolache, Emanuele Rodolà, Danilo Comminiello, Joshua D. Reiss

Sound design involves creatively selecting, recording, and editing sound effects for various media like cinema, video games, and virtual/augmented reality.

Generalizing Medical Image Representations via Quaternion Wavelet Networks

1 code implementation16 Oct 2023 Luigi Sigillo, Eleonora Grassucci, Aurelio Uncini, Danilo Comminiello

The proposed quaternion wavelet network (QUAVE) can be easily integrated with any pre-existing medical image analysis or synthesis task, and it can be involved with real, quaternion, or hypercomplex-valued models, generalizing their adoption to single-channel data.

PHYDI: Initializing Parameterized Hypercomplex Neural Networks as Identity Functions

1 code implementation11 Oct 2023 Matteo Mancanelli, Eleonora Grassucci, Aurelio Uncini, Danilo Comminiello

Neural models based on hypercomplex algebra systems are growing and prolificating for a plethora of applications, ranging from computer vision to natural language processing.

Dual Quaternion Rotational and Translational Equivariance in 3D Rigid Motion Modelling

no code implementations11 Oct 2023 Guilherme Vieira, Eleonora Grassucci, Marcos Eduardo Valle, Danilo Comminiello

To overcome these limitations, we employ a dual quaternion representation of rigid motions in the 3D space that jointly describes rotations and translations of point sets, processing each of the points as a single entity.

Human Pose Forecasting

Hypercomplex Multimodal Emotion Recognition from EEG and Peripheral Physiological Signals

1 code implementation11 Oct 2023 Eleonora Lopez, Eleonora Chiarantano, Eleonora Grassucci, Danilo Comminiello

Multimodal emotion recognition from physiological signals is receiving an increasing amount of attention due to the impossibility to control them at will unlike behavioral reactions, thus providing more reliable information.

EEG Multimodal Emotion Recognition

Enhancing Semantic Communication with Deep Generative Models -- An ICASSP Special Session Overview

no code implementations5 Sep 2023 Eleonora Grassucci, Yuki Mitsufuji, Ping Zhang, Danilo Comminiello

Semantic communication is poised to play a pivotal role in shaping the landscape of future AI-driven communication systems.

Generative Semantic Communication: Diffusion Models Beyond Bit Recovery

1 code implementation7 Jun 2023 Eleonora Grassucci, Sergio Barbarossa, Danilo Comminiello

We prove, through an in-depth assessment of multiple scenarios, that our method outperforms existing solutions in generating high-quality images with preserved semantic information even in cases where the received content is significantly degraded.

Hypercomplex Image-to-Image Translation

1 code implementation4 May 2022 Eleonora Grassucci, Luigi Sigillo, Aurelio Uncini, Danilo Comminiello

Image-to-image translation (I2I) aims at transferring the content representation from an input domain to an output one, bouncing along different target domains.

Image-to-Image Translation Translation

Multi-View Hypercomplex Learning for Breast Cancer Screening

1 code implementation12 Apr 2022 Eleonora Lopez, Eleonora Grassucci, Martina Valleriani, Danilo Comminiello

To overcome such limitations, in this paper, we propose a methodological approach for multi-view breast cancer classification based on parameterized hypercomplex neural networks.

 Ranked #1 on Cancer-no cancer per breast classification on InBreast (using extra training data)

Breast Tumour Classification Cancer-no cancer per breast classification +3

Learning Speech Emotion Representations in the Quaternion Domain

1 code implementation5 Apr 2022 Eric Guizzo, Tillman Weyde, Simone Scardapane, Danilo Comminiello

On the one hand, the classifier permits to optimize each latent axis of the embeddings for the classification of a specific emotion-related characteristic: valence, arousal, dominance and overall emotion.

Speech Emotion Recognition

Dual Quaternion Ambisonics Array for Six-Degree-of-Freedom Acoustic Representation

1 code implementation4 Apr 2022 Eleonora Grassucci, Gioia Mancini, Christian Brignone, Aurelio Uncini, Danilo Comminiello

We show that our dual quaternion SELD model with temporal convolution blocks (DualQSELD-TCN) achieves better results with respect to real and quaternion-valued baselines thanks to our augmented representation of the sound field.

Sound Event Localization and Detection

L3DAS22 Challenge: Learning 3D Audio Sources in a Real Office Environment

1 code implementation21 Feb 2022 Eric Guizzo, Christian Marinoni, Marco Pennese, Xinlei Ren, Xiguang Zheng, Chen Zhang, Bruno Masiero, Aurelio Uncini, Danilo Comminiello

The L3DAS22 Challenge is aimed at encouraging the development of machine learning strategies for 3D speech enhancement and 3D sound localization and detection in office-like environments.

Sound Event Localization and Detection Speech Enhancement

PHNNs: Lightweight Neural Networks via Parameterized Hypercomplex Convolutions

4 code implementations8 Oct 2021 Eleonora Grassucci, Aston Zhang, Danilo Comminiello

In this paper, we define the parameterization of hypercomplex convolutional layers and introduce the family of parameterized hypercomplex neural networks (PHNNs) that are lightweight and efficient large-scale models.

Sound Event Detection

A New Class of Efficient Adaptive Filters for Online Nonlinear Modeling

no code implementations19 Apr 2021 Danilo Comminiello, Alireza Nezamdoust, Simone Scardapane, Michele Scarpiniti, Amir Hussain, Aurelio Uncini

In order to make this class of functional link adaptive filters (FLAFs) efficient, we propose low-complexity expansions and frequency-domain adaptation of the parameters.

Acoustic echo cancellation Domain Adaptation

Quaternion Generative Adversarial Networks

3 code implementations19 Apr 2021 Eleonora Grassucci, Edoardo Cicero, Danilo Comminiello

Latest Generative Adversarial Networks (GANs) are gathering outstanding results through a large-scale training, thus employing models composed of millions of parameters requiring extensive computational capabilities.

Image Generation

L3DAS21 Challenge: Machine Learning for 3D Audio Signal Processing

1 code implementation12 Apr 2021 Eric Guizzo, Riccardo F. Gramaccioni, Saeid Jamili, Christian Marinoni, Edoardo Massaro, Claudia Medaglia, Giuseppe Nachira, Leonardo Nucciarelli, Ludovica Paglialunga, Marco Pennese, Sveva Pepe, Enrico Rocchi, Aurelio Uncini, Danilo Comminiello

The L3DAS21 Challenge is aimed at encouraging and fostering collaborative research on machine learning for 3D audio signal processing, with particular focus on 3D speech enhancement (SE) and 3D sound localization and detection (SELD).

Audio Signal Processing BIG-bench Machine Learning +1

A Quaternion-Valued Variational Autoencoder

3 code implementations22 Oct 2020 Eleonora Grassucci, Danilo Comminiello, Aurelio Uncini

Deep probabilistic generative models have achieved incredible success in many fields of application.

Combined Sparse Regularization for Nonlinear Adaptive Filters

no code implementations24 Jul 2020 Danilo Comminiello, Michele Scarpiniti, Simone Scardapane, Luis A. Azpicueta-Ruiz, Aurelio Uncini

Nonlinear adaptive filters often show some sparse behavior due to the fact that not all the coefficients are equally useful for the modeling of any nonlinearity.

A Multimodal Deep Network for the Reconstruction of T2W MR Images

no code implementations8 Aug 2019 Antonio Falvo, Danilo Comminiello, Simone Scardapane, Michele Scarpiniti, Aurelio Uncini

In this paper, we present a deep learning method that is able to reconstruct subsampled MR images obtained by reducing the k-space data, while maintaining a high image quality that can be used to observe brain lesions.

Compressing deep quaternion neural networks with targeted regularization

no code implementations26 Jul 2019 Riccardo Vecchi, Simone Scardapane, Danilo Comminiello, Aurelio Uncini

To this end, we investigate two extensions of l1 and structured regularization to the quaternion domain.

Image Reconstruction

Widely Linear Kernels for Complex-Valued Kernel Activation Functions

no code implementations6 Feb 2019 Simone Scardapane, Steven Van Vaerenbergh, Danilo Comminiello, Aurelio Uncini

Complex-valued neural networks (CVNNs) have been shown to be powerful nonlinear approximators when the input data can be properly modeled in the complex domain.

Image Classification

Quaternion Convolutional Neural Networks for Detection and Localization of 3D Sound Events

no code implementations17 Dec 2018 Danilo Comminiello, Marco Lella, Simone Scardapane, Aurelio Uncini

Learning from data in the quaternion domain enables us to exploit internal dependencies of 4D signals and treating them as a single entity.

Event Detection Sound Event Detection

Improving Graph Convolutional Networks with Non-Parametric Activation Functions

no code implementations26 Feb 2018 Simone Scardapane, Steven Van Vaerenbergh, Danilo Comminiello, Aurelio Uncini

Graph neural networks (GNNs) are a class of neural networks that allow to efficiently perform inference on data that is associated to a graph structure, such as, e. g., citation networks or knowledge graphs.

Knowledge Graphs

Group Sparse Regularization for Deep Neural Networks

1 code implementation2 Jul 2016 Simone Scardapane, Danilo Comminiello, Amir Hussain, Aurelio Uncini

In this paper, we consider the joint task of simultaneously optimizing (i) the weights of a deep neural network, (ii) the number of neurons for each hidden layer, and (iii) the subset of active input features (i. e., feature selection).

feature selection Handwritten Digit Recognition

Effective Blind Source Separation Based on the Adam Algorithm

no code implementations25 May 2016 Michele Scarpiniti, Simone Scardapane, Danilo Comminiello, Raffaele Parisi, Aurelio Uncini

In this paper, we derive a modified InfoMax algorithm for the solution of Blind Signal Separation (BSS) problems by using advanced stochastic methods.

blind source separation Stochastic Optimization

Cannot find the paper you are looking for? You can Submit a new open access paper.