Search Results for author: Danilo Comminiello

Found 35 papers, 18 papers with code

NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement

2 code implementations • 8 Apr 2024 • Giordano Cicchetti, Danilo Comminiello

Real-world documents may suffer various forms of degradation, often resulting in lower accuracy in optical character recognition (OCR) systems.

Ranked #1 on Binarization on DIBCO 2019

Binarization Document Enhancement +2

161

Paper
Code

Ship in Sight: Diffusion Models for Ship-Image Super Resolution

1 code implementation • 27 Mar 2024 • Luigi Sigillo, Riccardo Fosco Gramaccioni, Alessandro Nicolosi, Danilo Comminiello

In this context, our method explores in depth the problem of ship image super resolution, which is crucial for coastal and port surveillance.

Denoising Image Generation +4

Paper
Code

Towards Explaining Hypercomplex Neural Networks

1 code implementation • 26 Mar 2024 • Eleonora Lopez, Eleonora Grassucci, Debora Capriotti, Danilo Comminiello

To achieve this, we define a type of cosine-similarity transform within the parameterized hypercomplex domain.

Paper
Code

Overview of the L3DAS23 Challenge on Audio-Visual Extended Reality

no code implementations • 14 Feb 2024 • Christian Marinoni, Riccardo Fosco Gramaccioni, Changan Chen, Aurelio Uncini, Danilo Comminiello

The primary goal of the L3DAS23 Signal Processing Grand Challenge at ICASSP 2023 is to promote and support collaborative research on machine learning for 3D audio signal processing, with a specific emphasis on 3D speech enhancement and 3D Sound Event Localization and Detection in Extended Reality applications.

Audio Signal Processing Sound Event Localization and Detection +1

Paper
Add Code

Generative AI Meets Semantic Communication: Evolution and Revolution of Communication Tasks

no code implementations • 10 Jan 2024 • Eleonora Grassucci, Jihong Park, Sergio Barbarossa, Seong-Lyun Kim, Jinho Choi, Danilo Comminiello

Disclosing generative models capabilities in semantic communication paves the way for a paradigm shift with respect to conventional communication systems, which has great potential to reduce the amount of data traffic and offers a revolutionary versatility to novel tasks and applications that were not even conceivable a few years ago.

Denoising

Paper
Add Code

GATSY: Graph Attention Network for Music Artist Similarity

no code implementations • 1 Nov 2023 • Andrea Giuseppe Di Francesco, Giuliano Giampietro, Indro Spinelli, Danilo Comminiello

The artist similarity quest has become a crucial subject in social and scientific contexts.

Graph Attention

Paper
Add Code

SyncFusion: Multimodal Onset-synchronized Video-to-Audio Foley Synthesis

no code implementations • 23 Oct 2023 • Marco Comunità, Riccardo F. Gramaccioni, Emilian Postolache, Emanuele Rodolà, Danilo Comminiello, Joshua D. Reiss

Sound design involves creatively selecting, recording, and editing sound effects for various media like cinema, video games, and virtual/augmented reality.

Paper
Add Code

Generalizing Medical Image Representations via Quaternion Wavelet Networks

1 code implementation • 16 Oct 2023 • Luigi Sigillo, Eleonora Grassucci, Aurelio Uncini, Danilo Comminiello

The proposed quaternion wavelet network (QUAVE) can be easily integrated with any pre-existing medical image analysis or synthesis task, and it can be involved with real, quaternion, or hypercomplex-valued models, generalizing their adoption to single-channel data.

Paper
Code

PHYDI: Initializing Parameterized Hypercomplex Neural Networks as Identity Functions

1 code implementation • 11 Oct 2023 • Matteo Mancanelli, Eleonora Grassucci, Aurelio Uncini, Danilo Comminiello

Neural models based on hypercomplex algebra systems are growing and prolificating for a plethora of applications, ranging from computer vision to natural language processing.

Paper
Code

Dual Quaternion Rotational and Translational Equivariance in 3D Rigid Motion Modelling

no code implementations • 11 Oct 2023 • Guilherme Vieira, Eleonora Grassucci, Marcos Eduardo Valle, Danilo Comminiello

To overcome these limitations, we employ a dual quaternion representation of rigid motions in the 3D space that jointly describes rotations and translations of point sets, processing each of the points as a single entity.

Human Pose Forecasting

Paper
Add Code

Hypercomplex Multimodal Emotion Recognition from EEG and Peripheral Physiological Signals

1 code implementation • 11 Oct 2023 • Eleonora Lopez, Eleonora Chiarantano, Eleonora Grassucci, Danilo Comminiello

Multimodal emotion recognition from physiological signals is receiving an increasing amount of attention due to the impossibility to control them at will unlike behavioral reactions, thus providing more reliable information.

EEG Multimodal Emotion Recognition

Paper
Code

Attention-Map Augmentation for Hypercomplex Breast Cancer Classification

no code implementations • 11 Oct 2023 • Eleonora Lopez, Filippo Betello, Federico Carmignani, Eleonora Grassucci, Danilo Comminiello

In this step, a parameterized hypercomplex neural network (PHNN) is employed to perform breast cancer classification.

Breast Cancer Histology Image Classification Breast Tumour Classification +2

Paper
Add Code

Enhancing Semantic Communication with Deep Generative Models -- An ICASSP Special Session Overview

no code implementations • 5 Sep 2023 • Eleonora Grassucci, Yuki Mitsufuji, Ping Zhang, Danilo Comminiello

Semantic communication is poised to play a pivotal role in shaping the landscape of future AI-driven communication systems.

Paper
Add Code

Generative Semantic Communication: Diffusion Models Beyond Bit Recovery

1 code implementation • 7 Jun 2023 • Eleonora Grassucci, Sergio Barbarossa, Danilo Comminiello

We prove, through an in-depth assessment of multiple scenarios, that our method outperforms existing solutions in generating high-quality images with preserved semantic information even in cases where the received content is significantly degraded.

Paper
Code

StawGAN: Structural-Aware Generative Adversarial Networks for Infrared Image Translation

1 code implementation • 18 May 2023 • Luigi Sigillo, Eleonora Grassucci, Danilo Comminiello

We test our model on aerial images of the DroneVeichle dataset containing RGB-IR paired images.

Translation

Paper
Code

Hypercomplex Image-to-Image Translation

1 code implementation • 4 May 2022 • Eleonora Grassucci, Luigi Sigillo, Aurelio Uncini, Danilo Comminiello

Image-to-image translation (I2I) aims at transferring the content representation from an input domain to an output one, bouncing along different target domains.

Ranked #3 on Image-to-Image Translation on CelebA-HQ

Image-to-Image Translation Translation

Paper
Code

Multi-View Hypercomplex Learning for Breast Cancer Screening

1 code implementation • 12 Apr 2022 • Eleonora Lopez, Eleonora Grassucci, Martina Valleriani, Danilo Comminiello

To overcome such limitations, in this paper, we propose a methodological approach for multi-view breast cancer classification based on parameterized hypercomplex neural networks.

Ranked #1 on Cancer-no cancer per breast classification on InBreast (using extra training data)

Breast Tumour Classification Cancer-no cancer per breast classification +3

Paper
Code

Learning Speech Emotion Representations in the Quaternion Domain

1 code implementation • 5 Apr 2022 • Eric Guizzo, Tillman Weyde, Simone Scardapane, Danilo Comminiello

On the one hand, the classifier permits to optimize each latent axis of the embeddings for the classification of a specific emotion-related characteristic: valence, arousal, dominance and overall emotion.

Speech Emotion Recognition

Paper
Code

Dual Quaternion Ambisonics Array for Six-Degree-of-Freedom Acoustic Representation

1 code implementation • 4 Apr 2022 • Eleonora Grassucci, Gioia Mancini, Christian Brignone, Aurelio Uncini, Danilo Comminiello

We show that our dual quaternion SELD model with temporal convolution blocks (DualQSELD-TCN) achieves better results with respect to real and quaternion-valued baselines thanks to our augmented representation of the sound field.

Ranked #1 on Sound Event Localization and Detection on L3DAS21

Sound Event Localization and Detection

Paper
Code

L3DAS22 Challenge: Learning 3D Audio Sources in a Real Office Environment

1 code implementation • 21 Feb 2022 • Eric Guizzo, Christian Marinoni, Marco Pennese, Xinlei Ren, Xiguang Zheng, Chen Zhang, Bruno Masiero, Aurelio Uncini, Danilo Comminiello

The L3DAS22 Challenge is aimed at encouraging the development of machine learning strategies for 3D speech enhancement and 3D sound localization and detection in office-like environments.

Sound Event Localization and Detection Speech Enhancement

Paper
Code

PHNNs: Lightweight Neural Networks via Parameterized Hypercomplex Convolutions

4 code implementations • 8 Oct 2021 • Eleonora Grassucci, Aston Zhang, Danilo Comminiello

In this paper, we define the parameterization of hypercomplex convolutional layers and introduce the family of parameterized hypercomplex neural networks (PHNNs) that are lightweight and efficient large-scale models.

Ranked #1 on Sound Event Detection on L3DAS21

Sound Event Detection

Paper
Code

A New Class of Efficient Adaptive Filters for Online Nonlinear Modeling

no code implementations • 19 Apr 2021 • Danilo Comminiello, Alireza Nezamdoust, Simone Scardapane, Michele Scarpiniti, Amir Hussain, Aurelio Uncini

In order to make this class of functional link adaptive filters (FLAFs) efficient, we propose low-complexity expansions and frequency-domain adaptation of the parameters.

Acoustic echo cancellation Domain Adaptation

Paper
Add Code

Quaternion Generative Adversarial Networks

3 code implementations • 19 Apr 2021 • Eleonora Grassucci, Edoardo Cicero, Danilo Comminiello

Latest Generative Adversarial Networks (GANs) are gathering outstanding results through a large-scale training, thus employing models composed of millions of parameters requiring extensive computational capabilities.

Ranked #1 on Image Generation on Oxford 102 Flowers 128x128

Image Generation

Paper
Code

L3DAS21 Challenge: Machine Learning for 3D Audio Signal Processing

1 code implementation • 12 Apr 2021 • Eric Guizzo, Riccardo F. Gramaccioni, Saeid Jamili, Christian Marinoni, Edoardo Massaro, Claudia Medaglia, Giuseppe Nachira, Leonardo Nucciarelli, Ludovica Paglialunga, Marco Pennese, Sveva Pepe, Enrico Rocchi, Aurelio Uncini, Danilo Comminiello

The L3DAS21 Challenge is aimed at encouraging and fostering collaborative research on machine learning for 3D audio signal processing, with particular focus on 3D speech enhancement (SE) and 3D sound localization and detection (SELD).

Audio Signal Processing BIG-bench Machine Learning +1

Paper
Code

A Quaternion-Valued Variational Autoencoder

3 code implementations • 22 Oct 2020 • Eleonora Grassucci, Danilo Comminiello, Aurelio Uncini

Deep probabilistic generative models have achieved incredible success in many fields of application.

Paper
Code

Combined Sparse Regularization for Nonlinear Adaptive Filters

no code implementations • 24 Jul 2020 • Danilo Comminiello, Michele Scarpiniti, Simone Scardapane, Luis A. Azpicueta-Ruiz, Aurelio Uncini

Nonlinear adaptive filters often show some sparse behavior due to the fact that not all the coefficients are equally useful for the modeling of any nonlinearity.

Paper
Add Code

A Multimodal Deep Network for the Reconstruction of T2W MR Images

no code implementations • 8 Aug 2019 • Antonio Falvo, Danilo Comminiello, Simone Scardapane, Michele Scarpiniti, Aurelio Uncini

In this paper, we present a deep learning method that is able to reconstruct subsampled MR images obtained by reducing the k-space data, while maintaining a high image quality that can be used to observe brain lesions.

Paper
Add Code

Compressing deep quaternion neural networks with targeted regularization

no code implementations • 26 Jul 2019 • Riccardo Vecchi, Simone Scardapane, Danilo Comminiello, Aurelio Uncini

To this end, we investigate two extensions of l1 and structured regularization to the quaternion domain.

Image Reconstruction

Paper
Add Code

Widely Linear Kernels for Complex-Valued Kernel Activation Functions

no code implementations • 6 Feb 2019 • Simone Scardapane, Steven Van Vaerenbergh, Danilo Comminiello, Aurelio Uncini

Complex-valued neural networks (CVNNs) have been shown to be powerful nonlinear approximators when the input data can be properly modeled in the complex domain.

Image Classification

Paper
Add Code

Quaternion Convolutional Neural Networks for Detection and Localization of 3D Sound Events

no code implementations • 17 Dec 2018 • Danilo Comminiello, Marco Lella, Simone Scardapane, Aurelio Uncini

Learning from data in the quaternion domain enables us to exploit internal dependencies of 4D signals and treating them as a single entity.

Event Detection Sound Event Detection

Paper
Add Code

Recurrent Neural Networks with Flexible Gates using Kernel Activation Functions

no code implementations • 11 Jul 2018 • Simone Scardapane, Steven Van Vaerenbergh, Danilo Comminiello, Simone Totaro, Aurelio Uncini

Gated recurrent neural networks have achieved remarkable results in the analysis of sequential data.

Paper
Add Code

Improving Graph Convolutional Networks with Non-Parametric Activation Functions

no code implementations • 26 Feb 2018 • Simone Scardapane, Steven Van Vaerenbergh, Danilo Comminiello, Aurelio Uncini

Graph neural networks (GNNs) are a class of neural networks that allow to efficiently perform inference on data that is associated to a graph structure, such as, e. g., citation networks or knowledge graphs.

Knowledge Graphs

Paper
Add Code

Group Sparse Regularization for Deep Neural Networks

1 code implementation • 2 Jul 2016 • Simone Scardapane, Danilo Comminiello, Amir Hussain, Aurelio Uncini

In this paper, we consider the joint task of simultaneously optimizing (i) the weights of a deep neural network, (ii) the number of neurons for each hidden layer, and (iii) the subset of active input features (i. e., feature selection).

feature selection Handwritten Digit Recognition

Paper
Code

Effective Blind Source Separation Based on the Adam Algorithm

no code implementations • 25 May 2016 • Michele Scarpiniti, Simone Scardapane, Danilo Comminiello, Raffaele Parisi, Aurelio Uncini

In this paper, we derive a modified InfoMax algorithm for the solution of Blind Signal Separation (BSS) problems by using advanced stochastic methods.

blind source separation Stochastic Optimization

Paper
Add Code

Learning activation functions from data using cubic spline interpolation

no code implementations • 18 May 2016 • Simone Scardapane, Michele Scarpiniti, Danilo Comminiello, Aurelio Uncini

Neural networks require a careful design in order to perform properly on a given task.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.