Search Results for author: Ian McLoughlin

Found 18 papers, 3 papers with code

Joint Generative-Contrastive Representation Learning for Anomalous Sound Detection

no code implementations20 May 2023 Xiao-Min Zeng, Yan Song, Zhu Zhuo, Yu Zhou, Yu-Hong Li, Hui Xue, Li-Rong Dai, Ian McLoughlin

In this paper, we propose a joint generative and contrastive representation learning method (GeCo) for anomalous sound detection (ASD).

Contrastive Learning Representation Learning

AST-SED: An Effective Sound Event Detection Method Based on Audio Spectrogram Transformer

no code implementations7 Mar 2023 Kang Li, Yan Song, Li-Rong Dai, Ian McLoughlin, Xin Fang, Lin Liu

In this paper, we propose an effective sound event detection (SED) method based on the audio spectrogram transformer (AST) model, pretrained on the large-scale AudioSet for audio tagging (AT) task, termed AST-SED.

Audio Tagging Event Detection +1

A Light-weight Deep Learning Model for Remote Sensing Image Classification

no code implementations25 Feb 2023 Lam Pham, Cam Le, Dat Ngo, Anh Nguyen, Jasmin Lampert, Alexander Schindler, Ian McLoughlin

In this paper, we present a high-performance and light-weight deep learning model for Remote Sensing Image Classification (RSIC), the task of identifying the aerial scene of a remote sensing image.

Image Classification Knowledge Distillation +1

Paraformer: Fast and Accurate Parallel Transformer for Non-autoregressive End-to-End Speech Recognition

2 code implementations16 Jun 2022 Zhifu Gao, Shiliang Zhang, Ian McLoughlin, Zhijie Yan

However, due to an independence assumption within the output tokens, performance of single-step NAR is inferior to that of AR models, especially with a large-scale corpus.

Language Modelling speech-recognition +1

Multi-view Audio and Music Classification

no code implementations3 Mar 2021 Huy Phan, Huy Le Nguyen, Oliver Y. Chén, Lam Pham, Philipp Koch, Ian McLoughlin, Alfred Mertins

The learned embedding in the subnetworks are then concatenated to form the multi-view embedding for classification similar to a simple concatenation network.

Classification General Classification +2

Inception-Based Network and Multi-Spectrogram Ensemble Applied For Predicting Respiratory Anomalies and Lung Diseases

no code implementations26 Dec 2020 Lam Pham, Huy Phan, Ross King, Alfred Mertins, Ian McLoughlin

This paper presents an inception-based deep neural network for detecting lung diseases using respiratory sound input.

Incandescent Bulb and LED Brake Lights:Novel Analysis of Reaction Times

no code implementations20 Oct 2020 Ramaswamy Palaniappan, Surej Mouli, Evangelina Fringi, Howard Bowman, Ian McLoughlin

BrakeAcc results also show that experienced subjects were quicker to respond to the activation of brake lights by releasing the accelerator pedal.

Self-Attention Generative Adversarial Network for Speech Enhancement

1 code implementation18 Oct 2020 Huy Phan, Huy Le Nguyen, Oliver Y. Chén, Philipp Koch, Ngoc Q. K. Duong, Ian McLoughlin, Alfred Mertins

Existing generative adversarial networks (GANs) for speech enhancement solely rely on the convolution operation, which may obscure temporal dependencies across the sequence input.

Generative Adversarial Network Speech Enhancement

Robust Deep Learning Framework For Predicting Respiratory Anomalies and Diseases

no code implementations21 Jan 2020 Lam Pham, Ian McLoughlin, Huy Phan, Minh Tran, Truc Nguyen, Ramaswamy Palaniappan

This paper presents a robust deep learning framework developed to detect respiratory diseases from recordings of respiratory sounds.

LSTM-TDNN with convolutional front-end for Dialect Identification in the 2019 Multi-Genre Broadcast Challenge

no code implementations19 Dec 2019 Xiaoxiao Miao, Ian McLoughlin

This paper presents a novel Dialect Identification (DID) system developed for the Fifth Edition of the Multi-Genre Broadcast challenge, the task of Fine-grained Arabic Dialect Identification (MGB-5 ADI Challenge).

Data Augmentation Dialect Identification

Towards More Accurate Automatic Sleep Staging via Deep Transfer Learning

1 code implementation30 Jul 2019 Huy Phan, Oliver Y. Chén, Philipp Koch, Zongqing Lu, Ian McLoughlin, Alfred Mertins, Maarten De Vos

We employ the Montreal Archive of Sleep Studies (MASS) database consisting of 200 subjects as the source domain and study deep transfer learning on three different target domains: the Sleep Cassette subset and the Sleep Telemetry subset of the Sleep-EDF Expanded database, and the Surrey-cEEGrid database.

Automatic Sleep Stage Classification Multimodal Sleep Stage Detection +2

Beyond Equal-Length Snippets: How Long is Sufficient to Recognize an Audio Scene?

no code implementations2 Nov 2018 Huy Phan, Oliver Y. Chén, Philipp Koch, Lam Pham, Ian McLoughlin, Alfred Mertins, Maarten De Vos

Moreover, as model fusion with deep network ensemble is prevalent in audio scene classification, we further study whether, and if so, when model fusion is necessary for this task.

General Classification Scene Classification

Enabling Early Audio Event Detection with Neural Networks

no code implementations6 Dec 2017 Huy Phan, Philipp Koch, Ian McLoughlin, Alfred Mertins

The proposed system consists of a novel inference step coupled with dual parallel tailored-loss deep neural networks (DNNs).

Event Detection

Cannot find the paper you are looking for? You can Submit a new open access paper.