Search Results for author: Mahmoud Al Ismail

Found 4 papers, 1 papers with code

PAM: Prompting Audio-Language Models for Audio Quality Assessment

1 code implementation • 1 Feb 2024 • Soham Deshmukh, Dareen Alharthi, Benjamin Elizalde, Hannes Gamper, Mahmoud Al Ismail, Rita Singh, Bhiksha Raj, Huaming Wang

Here, we exploit this capability and introduce PAM, a no-reference metric for assessing audio quality for different audio processing tasks.

Music Generation Text-to-Music Generation

Paper
Code

Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss

no code implementations • 11 Aug 2023 • Mohammad Soleymanpour, Mahmoud Al Ismail, Fahimeh Bahmaninezhad, Kshitiz Kumar, Jian Wu

Our key developments constitute: (a) pronunciation lexicon with grapheme units instead of phone units, (b) a fully bilingual alignment model and subsequently bilingual streaming transformer model, (c) a parallel encoder structure with language identification (LID) loss, (d) parallel encoder with an auxiliary loss for monolingual projections.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Interpreting glottal flow dynamics for detecting COVID-19 from voice

no code implementations • 29 Oct 2020 • Soham Deshmukh, Mahmoud Al Ismail, Rita Singh

In the pathogenesis of COVID-19, impairment of respiratory functions is often one of the key symptoms.

Paper
Add Code

Detection of COVID-19 through the analysis of vocal fold oscillations

no code implementations • 21 Oct 2020 • Mahmoud Al Ismail, Soham Deshmukh, Rita Singh

Phonation, or the vibration of the vocal folds, is the primary source of vocalization in the production of voiced sounds by humans.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.