Search Results for author: Sidney Fels

Found 15 papers, 4 papers with code

Speech Audio Synthesis from Tagged MRI and Non-Negative Matrix Factorization via Plastic Transformer

no code implementations • 26 Sep 2023 • Xiaofeng Liu, Fangxu Xing, Maureen Stone, Jiachen Zhuo, Sidney Fels, Jerry L. Prince, Georges El Fakhri, Jonghye Woo

The tongue's intricate 3D structure, comprising localized functional units, plays a crucial role in the production of speech.

Audio Synthesis

Paper
Add Code

A comparative study of two-dimensional vocal tract acoustic modeling based on Finite-Difference Time-Domain methods

no code implementations • 9 Feb 2021 • Debasish Ray Mohapatra, Victor Zappi, Sidney Fels

The two-dimensional (2D) numerical approaches for vocal tract (VT) modelling can afford a better balance between the low computational cost and accurate rendering of acoustic wave propagation.

Acoustic Modelling

Paper
Add Code

SPEAK WITH YOUR HANDS Using Continuous Hand Gestures to control Articulatory Speech Synthesizer

no code implementations • 2 Feb 2021 • Pramit Saha, Debasish Ray Mohapatra, Sidney Fels

Considering the upper palate as fixed and the spline model as the dynamically moving lower surface (tongue) of the vocal tract, we compute 1D area functional values that are fed to the Pink Trombone, generating continuous speech sounds.

Speech Synthesis

Paper
Add Code

Ultra2Speech -- A Deep Learning Framework for Formant Frequency Estimation and Tracking from Ultrasound Tongue Images

no code implementations • 29 Jun 2020 • Pramit Saha, Yadong Liu, Bryan Gick, Sidney Fels

Thousands of individuals need surgical removal of their larynx due to critical diseases every year and therefore, require an alternative form of communication to articulate speech sounds after the loss of their voice box.

Paper
Add Code

Learning Joint Articulatory-Acoustic Representations with Normalizing Flows

no code implementations • 16 May 2020 • Pramit Saha, Sidney Fels

The articulatory geometric configurations of the vocal tract and the acoustic properties of the resultant speech sound are considered to have a strong causal relationship.

Paper
Add Code

Variational Learning with Disentanglement-PyTorch

1 code implementation • 11 Dec 2019 • Amir H. Abdi, Purang Abolmaesumi, Sidney Fels

Unsupervised learning of disentangled representations is an open problem in machine learning.

Disentanglement Scheduling

271

Paper
Code

A Study into Echocardiography View Conversion

1 code implementation • 5 Dec 2019 • Amir H. Abdi, Mohammad H. Jafari, Sidney Fels, Theresa Tsang, Purang Abolmaesumi

The size and length of the left ventricle in the generated target echo view is compared against that of the target ground-truth to assess the validity of the echo view conversion.

Paper
Code

A Preliminary Study of Disentanglement With Insights on the Inadequacy of Metrics

no code implementations • 26 Nov 2019 • Amir H. Abdi, Purang Abolmaesumi, Sidney Fels

However, a qualitative study of the encoded latents reveal that there is not a consistent correlation between the reported metrics and the disentanglement potential of the model.

Disentanglement

Paper
Add Code

Variational Shape Completion for Virtual Planning of Jaw Reconstructive Surgery

1 code implementation • 27 Jun 2019 • Amir H. Abdi, Mehran Pesteie, Eitan Prisman, Purang Abolmaesumi, Sidney Fels

The premorbid geometry of the mandible is of significant relevance in jaw reconstructive surgeries and occasionally unknown to the surgical team.

Paper
Code

Deep Learning the EEG Manifold for Phonological Categorization from Active Thoughts

no code implementations • 8 Apr 2019 • Pramit Saha, Muhammad Abdul-Mageed, Sidney Fels

Speech-related Brain Computer Interfaces (BCI) aim primarily at finding an alternative vocal communication pathway for people with speaking disabilities.

Binary Classification EEG +2

Paper
Add Code

SPEAK YOUR MIND! Towards Imagined Speech Recognition With Hierarchical Deep Learning

no code implementations • 8 Apr 2019 • Pramit Saha, Muhammad Abdul-Mageed, Sidney Fels

Speech-related Brain Computer Interface (BCI) technologies provide effective vocal communication strategies for controlling devices through speech commands interpreted from brain signals.

Brain Computer Interface General Classification +3

Paper
Add Code

Hierarchical Deep Feature Learning For Decoding Imagined Speech From EEG

no code implementations • 8 Apr 2019 • Pramit Saha, Sidney Fels

We propose a mixed deep neural network strategy, incorporating parallel combination of Convolutional (CNN) and Recurrent Neural Networks (RNN), cascaded with deep autoencoders and fully connected layers towards automatic identification of imagined speech from EEG.

EEG General Classification

Paper
Add Code

Limitations of Source-Filter Coupling In Phonation

no code implementations • 19 Nov 2018 • Debasish Ray Mohapatra, Sidney Fels

The coupling of vocal fold (source) and vocal tract (filter) is one of the most critical factors in source-filter articulation theory.

Paper
Add Code

Muscle Excitation Estimation in Biomechanical Simulation Using NAF Reinforcement Learning

1 code implementation • 17 Sep 2018 • Amir H. Abdi, Pramit Saha, Praneeth Srungarapu, Sidney Fels

In this article, we propose a deep reinforcement learning method to estimate the muscle excitations in simulated biomechanical systems.

Point Tracking reinforcement-learning +1

Paper
Code

Towards Automatic Speech Identification from Vocal Tract Shape Dynamics in Real-time MRI

no code implementations • 29 Jul 2018 • Pramit Saha, Praneeth Srungarapu, Sidney Fels

Interestingly, the results show a marked difference in the model performance in the context of speech classification with respect to generic sequence or video classification tasks.

Action Recognition Classification +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.