Search Results for author: Andrew Hines

Found 12 papers, 1 papers with code

Dialogue Understandability: Why are we streaming movies with subtitles?

no code implementations • 22 Mar 2024 • Helard Becerra, Alessandro Ragano, Diptasree Debnath, Asad Ullah, Crisron Rudolf Lucas, Martin Walsh, Andrew Hines

Watching movies and TV shows with subtitles enabled is not simply down to audibility or speech intelligibility.

Paper
Add Code

Reduce, Reuse, Recycle: Is Perturbed Data better than Other Language augmentation for Low Resource Self-Supervised Speech Models

no code implementations • 22 Sep 2023 • Asad Ullah, Alessandro Ragano, Andrew Hines

Our findings suggest that for resource constrained languages, in-domain synthetic augmentation can outperform knowledge transfer from accented or other language speech.

Representation Learning Transfer Learning

Paper
Add Code

Exploring the Impact of Noise and Degradations on Heart Sound Classification Models

no code implementations • 14 Nov 2022 • Davoud Shariat Panah, Andrew Hines, Susan Mckeever

We used this dataset to investigate the impact of noise and degradation in heart sound recordings on the performance of different classification models.

Classification Sound Classification

Paper
Add Code

Learning Music Representations with wav2vec 2.0

no code implementations • 27 Oct 2022 • Alessandro Ragano, Emmanouil Benetos, Andrew Hines

In addition, the results are superior to the pre-trained model on speech embeddings, demonstrating that wav2vec 2. 0 pre-trained on music data can be a promising music representation model.

Music Classification

Paper
Add Code

Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset

no code implementations • 14 Sep 2022 • Michael Chinen, Jan Skoglund, Chandan K A Reddy, Alessandro Ragano, Andrew Hines

Non-reference speech quality models are important for a growing number of applications.

Voice Conversion

Paper
Add Code

A Comparison of Deep Learning MOS Predictors for Speech Synthesis Quality

no code implementations • 5 Apr 2022 • Alessandro Ragano, Emmanouil Benetos, Michael Chinen, Helard B. Martinez, Chandan K. A. Reddy, Jan Skoglund, Andrew Hines

In this paper, we evaluate several MOS predictors based on wav2vec 2. 0 and the NISQA speech quality prediction model to explore the role of the training data, the influence of the system type, and the role of cross-domain features in SSL models.

Benchmarking Self-Supervised Learning +1

Paper
Add Code

Exploring the influence of fine-tuning data on wav2vec 2.0 model for blind speech quality prediction

no code implementations • 5 Apr 2022 • Helard Becerra, Alessandro Ragano, Andrew Hines

Further research is needed to evaluate other wav2vec 2. 0 models pre-trained with multi-lingual datasets and to develop prediction models that are more resilient to language diversity.

Paper
Add Code

Supervised Learning based QoE Prediction of Video Streaming in Future Networks: A Tutorial with Comparative Study

no code implementations • 3 Jan 2022 • Arslan Ahmad, Atif Bin Mansoor, Alcardo Alex Barakabitze, Andrew Hines, Luigi Atzori, Ray Walshe

The Quality of Experience (QoE) based service management remains key for successful provisioning of multimedia services in next-generation networks such as 5G/6G, which requires proper tools for quality monitoring, prediction and resource management where machine learning (ML) can play a crucial role.

Edge-computing Feature Engineering +2

Paper
Add Code

More for Less: Non-Intrusive Speech Quality Assessment with Limited Annotations

no code implementations • 19 Aug 2021 • Alessandro Ragano, Emmanouil Benetos, Andrew Hines

This paper indicates that multi-task learning combined with feature representations from unlabelled data is a promising approach to deal with the lack of large MOS annotated datasets.

Clustering Deep Clustering +1

Paper
Add Code

WARP-Q: Quality Prediction For Generative Neural Speech Codecs

2 code implementations • 20 Feb 2021 • Wissam A. Jassim, Jan Skoglund, Michael Chinen, Andrew Hines

Good speech quality has been achieved using waveform matching and parametric reconstruction coders.

Dynamic Time Warping

Paper
Code

How deep is your encoder: an analysis of features descriptors for an autoencoder-based audio-visual quality metric

no code implementations • 24 Mar 2020 • Helard Martinez, Andrew Hines, Mylene C. Q. Farias

The development of audio-visual quality assessment models poses a number of challenges in order to obtain accurate predictions.

Paper
Add Code

Audio Impairment Recognition Using a Correlation-Based Feature Representation

no code implementations • 22 Mar 2020 • Alessandro Ragano, Emmanouil Benetos, Andrew Hines

Audio impairment recognition is based on finding noise in audio files and categorising the impairment type.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.