Search Results for author: Nicolae-Catalin Ristea

Found 20 papers, 12 papers with code

Cascaded Cross-Modal Transformer for Audio-Textual Classification

1 code implementation • 15 Jan 2024 • Nicolae-Catalin Ristea, Andrei Anghel, Radu Tudor Ionescu

Subsequently, we combine language-specific Bidirectional Encoder Representations from Transformers (BERT) with Wav2Vec2. 0 audio features via a novel cascaded cross-modal transformer (CCMT).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Code

ICASSP 2023 Acoustic Echo Cancellation Challenge

1 code implementation • 22 Sep 2023 • Ross Cutler, Ando Saabas, Tanel Parnamaa, Marju Purin, Evgenii Indenbom, Nicolae-Catalin Ristea, Jegor Gužvin, Hannes Gamper, Sebastian Braun, Robert Aichner

This is the fourth AEC challenge and it is enhanced by adding a second track for personalized acoustic echo cancellation, reducing the algorithmic + buffering latency to 20ms, as well as including a full-band version of AECMOS.

Acoustic echo cancellation Speech Enhancement

341

Paper
Code

Multi-dimensional Speech Quality Assessment in Crowdsourcing

2 code implementations • 14 Sep 2023 • Babak Naderi, Ross Cutler, Nicolae-Catalin Ristea

The commonly used standard ITU-T Rec.

Speech Enhancement

188

Paper
Code

RoDia: A New Dataset for Romanian Dialect Identification from Speech

1 code implementation • 6 Sep 2023 • Codrut Rotaru, Nicolae-Catalin Ristea, Radu Tudor Ionescu

We introduce RoDia, the first dataset for Romanian dialect identification from speech.

Dialect Identification Speaker Verification +2

Paper
Code

CL-MAE: Curriculum-Learned Masked Autoencoders

1 code implementation • 31 Aug 2023 • Neelu Madan, Nicolae-Catalin Ristea, Kamal Nasrollahi, Thomas B. Moeslund, Radu Tudor Ionescu

In this paper, we propose a curriculum learning approach that updates the masking strategy to continually increase the complexity of the self-supervised reconstruction task.

Representation Learning

Paper
Code

Cascaded Cross-Modal Transformer for Request and Complaint Detection

no code implementations • 27 Jul 2023 • Nicolae-Catalin Ristea, Radu Tudor Ionescu

We propose a novel cascaded cross-modal transformer (CCMT) that combines speech and text transcripts to detect customer requests and complaints in phone conversations.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Self-Distilled Masked Auto-Encoders are Efficient Video Anomaly Detectors

1 code implementation • 21 Jun 2023 • Nicolae-Catalin Ristea, Florinel-Alin Croitoru, Radu Tudor Ionescu, Marius Popescu, Fahad Shahbaz Khan, Mubarak Shah

We propose an efficient abnormal event detection model based on a lightweight masked auto-encoder (AE) applied at the video frame level.

Ranked #13 on Anomaly Detection on CUHK Avenue

Anomaly Detection Event Detection

Paper
Code

Sea Ice Segmentation From SAR Data by Convolutional Transformer Networks

no code implementations • 13 Jun 2023 • Nicolae-Catalin Ristea, Andrei Anghel, Mihai Datcu

Sea ice is a crucial component of the Earth's climate system and is highly sensitive to changes in temperature and atmospheric conditions.

Paper
Add Code

DeepVQE: Real Time Deep Voice Quality Enhancement for Joint Acoustic Echo Cancellation, Noise Suppression and Dereverberation

no code implementations • 5 Jun 2023 • Evgenii Indenbom, Nicolae-Catalin Ristea, Ando Saabas, Tanel Parnamaa, Jegor Guzvin, Ross Cutler

Acoustic echo cancellation (AEC), noise suppression (NS) and dereverberation (DR) are an integral part of modern full-duplex communication systems.

Acoustic echo cancellation

Paper
Add Code

Lightning Fast Video Anomaly Detection via Adversarial Knowledge Distillation

no code implementations • 28 Nov 2022 • Nicolae-Catalin Ristea, Florinel-Alin Croitoru, Dana Dascalescu, Radu Tudor Ionescu, Fahad Shahbaz Khan, Mubarak Shah

We propose a very fast frame-level model for anomaly detection in video, which learns to detect anomalies by distilling knowledge from multiple highly accurate object-level teacher models.

Ranked #16 on Anomaly Detection on CUHK Avenue

Anomaly Detection Knowledge Distillation +1

Paper
Add Code

Self-Supervised Masked Convolutional Transformer Block for Anomaly Detection

1 code implementation • 25 Sep 2022 • Neelu Madan, Nicolae-Catalin Ristea, Radu Tudor Ionescu, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, Mubarak Shah

In this work, we extend our previous self-supervised predictive convolutional attentive block (SSPCAB) with a 3D masked convolutional layer, a transformer for channel-wise attention, as well as a novel self-supervised objective based on Huber loss.

Ranked #4 on Anomaly Detection on CUHK Avenue

Event Detection Fault Detection +1

Paper
Code

LeRaC: Learning Rate Curriculum

no code implementations • 18 May 2022 • Florinel-Alin Croitoru, Nicolae-Catalin Ristea, Radu Tudor Ionescu, Nicu Sebe

In this work, we propose a novel curriculum learning approach termed Learning Rate Curriculum (LeRaC), which leverages the use of a different learning rate for each layer of a neural network to create a data-free curriculum during the initial training epochs.

Ranked #3 on Speech Emotion Recognition on CREMA-D

Audio Classification QNLI +2

Paper
Add Code

Guided deep learning by subaperture decomposition: ocean patterns from SAR imagery

no code implementations • 9 Apr 2022 • Nicolae-Catalin Ristea, Andrei Anghel, Mihai Datcu, Bertrand Chapron

Overall, we encourage the development of data centring approaches, showing that, data preprocessing could bring significant performance improvements over existing deep learning models.

Paper
Add Code

Multimodal Multi-Head Convolutional Attention with Various Kernel Sizes for Medical Image Super-Resolution

1 code implementation • 8 Apr 2022 • Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Andreea-Iuliana Miron, Olivian Savencu, Nicolae-Catalin Ristea, Nicolae Verga, Fahad Shahbaz Khan

Our attention module uses the convolution operation to perform joint spatial-channel attention on multiple concatenated input tensors, where the kernel (receptive field) size controls the reduction rate of the spatial attention, and the number of convolutional filters controls the reduction rate of the channel attention, respectively.

Ranked #1 on Image Super-Resolution on IXI

Computed Tomography (CT) Image Super-Resolution

Paper
Code

SepTr: Separable Transformer for Audio Spectrogram Processing

1 code implementation • 17 Mar 2022 • Nicolae-Catalin Ristea, Radu Tudor Ionescu, Fahad Shahbaz Khan

Following the successful application of vision transformers in multiple computer vision tasks, these models have drawn the attention of the signal processing community.

Ranked #1 on Time Series Analysis on Speech Commands

Audio Classification Speech Emotion Recognition +1

Paper
Code

Self-Supervised Predictive Convolutional Attentive Block for Anomaly Detection

4 code implementations • CVPR 2022 • Nicolae-Catalin Ristea, Neelu Madan, Radu Tudor Ionescu, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, Mubarak Shah

Our block is equipped with a loss that minimizes the reconstruction error with respect to the masked area in the receptive field.

Ranked #1 on Anomaly Detection on CUHK Avenue (TBDC metric)

One-Class Classification

1,684

Paper
Code

CyTran: A Cycle-Consistent Transformer with Multi-Level Consistency for Non-Contrast to Contrast CT Translation

1 code implementation • 12 Oct 2021 • Nicolae-Catalin Ristea, Andreea-Iuliana Miron, Olivian Savencu, Mariana-Iuliana Georgescu, Nicolae Verga, Fahad Shahbaz Khan, Radu Tudor Ionescu

Our neural model can be trained on unpaired images, due to the integration of a multi-level cycle-consistency loss.

Computed Tomography (CT) Style Transfer +1

Paper
Code

Self-paced ensemble learning for speech and audio classification

no code implementations • 22 Mar 2021 • Nicolae-Catalin Ristea, Radu Tudor Ionescu

Instead of just combining the models, we propose a self-paced ensemble learning scheme in which models learn from each other over several iterations.

Ranked #5 on Speech Emotion Recognition on CREMA-D

Audio Classification Ensemble Learning +2

Paper
Add Code

Emotion Recognition System from Speech and Visual Information based on Convolutional Neural Networks

no code implementations • 29 Feb 2020 • Nicolae-Catalin Ristea, Liviu Cristian Dutu, Anamaria Radoi

In order to increase the accuracy of the recognition system, we analyze also the speech data and fuse the information coming from both sources, i. e., visual and audio.

Emotion Recognition

Paper
Add Code

Non-linear Neurons with Human-like Apical Dendrite Activations

1 code implementation • 2 Feb 2020 • Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Nicolae-Catalin Ristea, Nicu Sebe

In order to classify linearly non-separable data, neurons are typically organized into multi-layer neural networks that are equipped with at least one hidden layer.

Ranked #7 on Speech Emotion Recognition on CREMA-D

Speech Emotion Recognition

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.