no code implementations • 22 Mar 2024 • Helard Becerra, Alessandro Ragano, Diptasree Debnath, Asad Ullah, Crisron Rudolf Lucas, Martin Walsh, Andrew Hines
Watching movies and TV shows with subtitles enabled is not simply down to audibility or speech intelligibility.
no code implementations • 22 Sep 2023 • Asad Ullah, Alessandro Ragano, Andrew Hines
Our findings suggest that for resource constrained languages, in-domain synthetic augmentation can outperform knowledge transfer from accented or other language speech.
no code implementations • 14 Nov 2022 • Davoud Shariat Panah, Andrew Hines, Susan Mckeever
We used this dataset to investigate the impact of noise and degradation in heart sound recordings on the performance of different classification models.
no code implementations • 27 Oct 2022 • Alessandro Ragano, Emmanouil Benetos, Andrew Hines
In addition, the results are superior to the pre-trained model on speech embeddings, demonstrating that wav2vec 2. 0 pre-trained on music data can be a promising music representation model.
no code implementations • 14 Sep 2022 • Michael Chinen, Jan Skoglund, Chandan K A Reddy, Alessandro Ragano, Andrew Hines
Non-reference speech quality models are important for a growing number of applications.
no code implementations • 5 Apr 2022 • Alessandro Ragano, Emmanouil Benetos, Michael Chinen, Helard B. Martinez, Chandan K. A. Reddy, Jan Skoglund, Andrew Hines
In this paper, we evaluate several MOS predictors based on wav2vec 2. 0 and the NISQA speech quality prediction model to explore the role of the training data, the influence of the system type, and the role of cross-domain features in SSL models.
no code implementations • 5 Apr 2022 • Helard Becerra, Alessandro Ragano, Andrew Hines
Further research is needed to evaluate other wav2vec 2. 0 models pre-trained with multi-lingual datasets and to develop prediction models that are more resilient to language diversity.
no code implementations • 3 Jan 2022 • Arslan Ahmad, Atif Bin Mansoor, Alcardo Alex Barakabitze, Andrew Hines, Luigi Atzori, Ray Walshe
The Quality of Experience (QoE) based service management remains key for successful provisioning of multimedia services in next-generation networks such as 5G/6G, which requires proper tools for quality monitoring, prediction and resource management where machine learning (ML) can play a crucial role.
no code implementations • 19 Aug 2021 • Alessandro Ragano, Emmanouil Benetos, Andrew Hines
This paper indicates that multi-task learning combined with feature representations from unlabelled data is a promising approach to deal with the lack of large MOS annotated datasets.
2 code implementations • 20 Feb 2021 • Wissam A. Jassim, Jan Skoglund, Michael Chinen, Andrew Hines
Good speech quality has been achieved using waveform matching and parametric reconstruction coders.
no code implementations • 24 Mar 2020 • Helard Martinez, Andrew Hines, Mylene C. Q. Farias
The development of audio-visual quality assessment models poses a number of challenges in order to obtain accurate predictions.
no code implementations • 22 Mar 2020 • Alessandro Ragano, Emmanouil Benetos, Andrew Hines
Audio impairment recognition is based on finding noise in audio files and categorising the impairment type.