Search Results for author: Sidney Fels

Found 15 papers, 4 papers with code

A comparative study of two-dimensional vocal tract acoustic modeling based on Finite-Difference Time-Domain methods

no code implementations9 Feb 2021 Debasish Ray Mohapatra, Victor Zappi, Sidney Fels

The two-dimensional (2D) numerical approaches for vocal tract (VT) modelling can afford a better balance between the low computational cost and accurate rendering of acoustic wave propagation.

Acoustic Modelling

SPEAK WITH YOUR HANDS Using Continuous Hand Gestures to control Articulatory Speech Synthesizer

no code implementations2 Feb 2021 Pramit Saha, Debasish Ray Mohapatra, Sidney Fels

Considering the upper palate as fixed and the spline model as the dynamically moving lower surface (tongue) of the vocal tract, we compute 1D area functional values that are fed to the Pink Trombone, generating continuous speech sounds.

Speech Synthesis

Ultra2Speech -- A Deep Learning Framework for Formant Frequency Estimation and Tracking from Ultrasound Tongue Images

no code implementations29 Jun 2020 Pramit Saha, Yadong Liu, Bryan Gick, Sidney Fels

Thousands of individuals need surgical removal of their larynx due to critical diseases every year and therefore, require an alternative form of communication to articulate speech sounds after the loss of their voice box.

Learning Joint Articulatory-Acoustic Representations with Normalizing Flows

no code implementations16 May 2020 Pramit Saha, Sidney Fels

The articulatory geometric configurations of the vocal tract and the acoustic properties of the resultant speech sound are considered to have a strong causal relationship.

Variational Learning with Disentanglement-PyTorch

1 code implementation11 Dec 2019 Amir H. Abdi, Purang Abolmaesumi, Sidney Fels

Unsupervised learning of disentangled representations is an open problem in machine learning.

Disentanglement Scheduling

A Study into Echocardiography View Conversion

1 code implementation5 Dec 2019 Amir H. Abdi, Mohammad H. Jafari, Sidney Fels, Theresa Tsang, Purang Abolmaesumi

The size and length of the left ventricle in the generated target echo view is compared against that of the target ground-truth to assess the validity of the echo view conversion.

A Preliminary Study of Disentanglement With Insights on the Inadequacy of Metrics

no code implementations26 Nov 2019 Amir H. Abdi, Purang Abolmaesumi, Sidney Fels

However, a qualitative study of the encoded latents reveal that there is not a consistent correlation between the reported metrics and the disentanglement potential of the model.

Disentanglement

Variational Shape Completion for Virtual Planning of Jaw Reconstructive Surgery

1 code implementation27 Jun 2019 Amir H. Abdi, Mehran Pesteie, Eitan Prisman, Purang Abolmaesumi, Sidney Fels

The premorbid geometry of the mandible is of significant relevance in jaw reconstructive surgeries and occasionally unknown to the surgical team.

Deep Learning the EEG Manifold for Phonological Categorization from Active Thoughts

no code implementations8 Apr 2019 Pramit Saha, Muhammad Abdul-Mageed, Sidney Fels

Speech-related Brain Computer Interfaces (BCI) aim primarily at finding an alternative vocal communication pathway for people with speaking disabilities.

Binary Classification EEG +2

SPEAK YOUR MIND! Towards Imagined Speech Recognition With Hierarchical Deep Learning

no code implementations8 Apr 2019 Pramit Saha, Muhammad Abdul-Mageed, Sidney Fels

Speech-related Brain Computer Interface (BCI) technologies provide effective vocal communication strategies for controlling devices through speech commands interpreted from brain signals.

Brain Computer Interface General Classification +3

Hierarchical Deep Feature Learning For Decoding Imagined Speech From EEG

no code implementations8 Apr 2019 Pramit Saha, Sidney Fels

We propose a mixed deep neural network strategy, incorporating parallel combination of Convolutional (CNN) and Recurrent Neural Networks (RNN), cascaded with deep autoencoders and fully connected layers towards automatic identification of imagined speech from EEG.

EEG General Classification

Limitations of Source-Filter Coupling In Phonation

no code implementations19 Nov 2018 Debasish Ray Mohapatra, Sidney Fels

The coupling of vocal fold (source) and vocal tract (filter) is one of the most critical factors in source-filter articulation theory.

Muscle Excitation Estimation in Biomechanical Simulation Using NAF Reinforcement Learning

1 code implementation17 Sep 2018 Amir H. Abdi, Pramit Saha, Praneeth Srungarapu, Sidney Fels

In this article, we propose a deep reinforcement learning method to estimate the muscle excitations in simulated biomechanical systems.

Point Tracking reinforcement-learning +1

Towards Automatic Speech Identification from Vocal Tract Shape Dynamics in Real-time MRI

no code implementations29 Jul 2018 Pramit Saha, Praneeth Srungarapu, Sidney Fels

Interestingly, the results show a marked difference in the model performance in the context of speech classification with respect to generic sequence or video classification tasks.

Action Recognition Classification +3

Cannot find the paper you are looking for? You can Submit a new open access paper.