Search Results for author: Hamid Eghbal-zadeh

Found 25 papers, 17 papers with code

Addressing Parameter Choice Issues in Unsupervised Domain Adaptation by Aggregation

1 code implementation2 May 2023 Marius-Constantin Dinu, Markus Holzleitner, Maximilian Beck, Hoan Duc Nguyen, Andrea Huber, Hamid Eghbal-zadeh, Bernhard A. Moser, Sergei Pereverzyev, Sepp Hochreiter, Werner Zellinger

Our method outperforms deep embedded validation (DEV) and importance weighted validation (IWV) on all datasets, setting a new state-of-the-art performance for solving parameter choice issues in unsupervised domain adaptation with theoretical error guarantees.

Unsupervised Domain Adaptation

Learning General Audio Representations with Large-Scale Training of Patchout Audio Transformers

1 code implementation25 Nov 2022 Khaled Koutini, Shahed Masoudian, Florian Schmid, Hamid Eghbal-zadeh, Jan Schlüter, Gerhard Widmer

Furthermore, we will show that transformers trained on Audioset can be extremely effective representation extractors for a wide range of downstream tasks.

Few-Shot Learning by Dimensionality Reduction in Gradient Space

1 code implementation7 Jun 2022 Martin Gauch, Maximilian Beck, Thomas Adler, Dmytro Kotsur, Stefan Fiel, Hamid Eghbal-zadeh, Johannes Brandstetter, Johannes Kofler, Markus Holzleitner, Werner Zellinger, Daniel Klotz, Sepp Hochreiter, Sebastian Lehner

We introduce SubGD, a novel few-shot learning method which is based on the recent finding that stochastic gradient descent updates tend to live in a low-dimensional parameter subspace.

Dimensionality Reduction Few-Shot Learning

The balancing principle for parameter choice in distance-regularized domain adaptation

1 code implementation NeurIPS 2021 Werner Zellinger, Natalia Shepeleva, Marius-Constantin Dinu, Hamid Eghbal-zadeh, Hoan Nguyen, Bernhard Nessler, Sergei Pereverzyev, Bernhard A. Moser

Our approach starts with the observation that the widely-used method of minimizing the source error, penalized by a distance measure between source and target feature representations, shares characteristics with regularized ill-posed inverse problems.

Unsupervised Domain Adaptation

Efficient Training of Audio Transformers with Patchout

2 code implementations11 Oct 2021 Khaled Koutini, Jan Schlüter, Hamid Eghbal-zadeh, Gerhard Widmer

However, one of the main shortcomings of transformer models, compared to the well-established CNNs, is the computational complexity.

Ranked #3 on Audio Classification on FSD50K (using extra training data)

Acoustic Scene Classification Audio Classification +2

Over-Parameterization and Generalization in Audio Classification

no code implementations19 Jul 2021 Khaled Koutini, Hamid Eghbal-zadeh, Florian Henkel, Jan Schlüter, Gerhard Widmer

Convolutional Neural Networks (CNNs) have been dominating classification tasks in various domains, such as machine vision, machine listening, and natural language processing.

Acoustic Scene Classification Audio Classification +1

Receptive Field Regularization Techniques for Audio Classification and Tagging with Deep Convolutional Neural Networks

1 code implementation26 May 2021 Khaled Koutini, Hamid Eghbal-zadeh, Gerhard Widmer

As state-of-the-art CNN architectures-in computer vision and other domains-tend to go deeper in terms of number of layers, their RF size increases and therefore they degrade in performance in several audio classification and tagging tasks.

Acoustic Scene Classification Audio Classification +2

Receptive-Field Regularized CNNs for Music Classification and Tagging

1 code implementation27 Jul 2020 Khaled Koutini, Hamid Eghbal-zadeh, Verena Haunschmid, Paul Primus, Shreyan Chowdhury, Gerhard Widmer

However, the MIR field is still dominated by the classical VGG-based CNN architecture variants, often in combination with more complex modules such as attention, and/or techniques such as pre-training on large datasets.

Classification General Classification +4

On Data Augmentation and Adversarial Risk: An Empirical Analysis

no code implementations6 Jul 2020 Hamid Eghbal-zadeh, Khaled Koutini, Paul Primus, Verena Haunschmid, Michal Lewandowski, Werner Zellinger, Bernhard A. Moser, Gerhard Widmer

Data augmentation techniques have become standard practice in deep learning, as it has been shown to greatly improve the generalisation abilities of models.

Adversarial Attack Data Augmentation

Emotion and Theme Recognition in Music with Frequency-Aware RF-Regularized CNNs

1 code implementation28 Oct 2019 Khaled Koutini, Shreyan Chowdhury, Verena Haunschmid, Hamid Eghbal-zadeh, Gerhard Widmer

We present CP-JKU submission to MediaEval 2019; a Receptive Field-(RF)-regularized and Frequency-Aware CNN approach for tagging music with emotion/mood labels.

Acoustic Scene Classification Scene Classification

The Receptive Field as a Regularizer in Deep Convolutional Neural Networks for Acoustic Scene Classification

3 code implementations3 Jul 2019 Khaled Koutini, Hamid Eghbal-zadeh, Matthias Dorfer, Gerhard Widmer

To this end, we analyse the receptive field (RF) of these CNNs and demonstrate the importance of the RF to the generalization capability of the models.

Acoustic Scene Classification General Classification +1

On Conditioning GANs to Hierarchical Ontologies

no code implementations16 May 2019 Hamid Eghbal-zadeh, Lukas Fischer, Thomas Hoch

Additionally, we show that the O-GAN achieves better conditioning results evaluated by implicit similarity between the text and the generated image.

Image Generation

Mixture Density Generative Adversarial Networks

1 code implementation CVPR 2019 Hamid Eghbal-zadeh, Werner Zellinger, Gerhard Widmer

Generative Adversarial Networks have surprising ability for generating sharp and realistic images, though they are known to suffer from the so-called mode collapse problem.

Deep SNP: An End-to-end Deep Neural Network with Attention-based Localization for Break-point Detection in SNP Array Genomic data

1 code implementation22 Jun 2018 Hamid Eghbal-zadeh, Lukas Fischer, Niko Popitsch, Florian Kromp, Sabine Taschner-Mandl, Khaled Koutini, Teresa Gerber, Eva Bozsaky, Peter F. Ambros, Inge M. Ambros, Gerhard Widmer, Bernhard A. Moser

We show, that Deep SNP is capable of successfully predicting the presence or absence of a breakpoint in large genomic windows and outperforms state-of-the-art neural network models.

Deep Within-Class Covariance Analysis for Robust Audio Representation Learning

no code implementations10 Nov 2017 Hamid Eghbal-zadeh, Matthias Dorfer, Gerhard Widmer

To tackle this problem, we propose Deep Within-Class Covariance Analysis (DWCCA), a deep neural network layer that significantly reduces the within-class covariance of a DNN's representation, improving performance on unseen test data from a shifted distribution.

Acoustic Scene Classification Classification +3

Probabilistic Generative Adversarial Networks

no code implementations6 Aug 2017 Hamid Eghbal-zadeh, Gerhard Widmer

The central idea is to integrate a probabilistic model (a Gaussian Mixture Model, in our case) into the GAN framework which supports a new kind of loss function (based on likelihood rather than classification loss), and at the same time gives a meaningful measure of the quality of the outputs generated by the network.

Generative Adversarial Network

Likelihood Estimation for Generative Adversarial Networks

no code implementations24 Jul 2017 Hamid Eghbal-zadeh, Gerhard Widmer

We present a simple method for assessing the quality of generated images in Generative Adversarial Networks (GANs).

Cannot find the paper you are looking for? You can Submit a new open access paper.