Search Results for author: Ali Etemad

Found 94 papers, 31 papers with code

CLARE: Cognitive Load Assessment in REaltime with Multimodal Data

no code implementations • 26 Apr 2024 • Anubhav Bhatti, Prithila Angkan, Behnam Behinaein, Zunayed Mahmud, Dirk Rodenburg, Heather Braund, P. James Mclellan, Aaron Ruberto, Geoffery Harrison, Daryl Wilson, Adam Szulewski, Dan Howes, Ali Etemad, Paul Hungler

In contrast, for LOSO, the best performance is achieved by the deep learning model with ECG, EDA, and EEG.

Binary Classification EEG +1

Paper
Add Code

UPose3D: Uncertainty-Aware 3D Human Pose Estimation with Cross-View and Temporal Cues

no code implementations • 23 Apr 2024 • Vandad Davoodnia, Saeed Ghorbani, Marc-André Carbonneau, Alexandre Messier, Ali Etemad

At the core of our method, a pose compiler module refines predictions from a 2D keypoints estimator that operates on a single image by leveraging temporal and cross-view information.

3D Human Pose Estimation Synthetic Data Generation

Paper
Add Code

SkelFormer: Markerless 3D Pose and Shape Estimation using Skeletal Transformers

no code implementations • 19 Apr 2024 • Vandad Davoodnia, Saeed Ghorbani, Alexandre Messier, Ali Etemad

Next, we design a regression-based inverse-kinematic skeletal transformer that maps the joint positions to pose and shape representations from heavily noisy observations.

Keypoint Detection Markerless Motion Capture

Paper
Add Code

A Bag of Tricks for Few-Shot Class-Incremental Learning

no code implementations • 21 Mar 2024 • Shuvendu Roy, Chunjong Park, Aldi Fahrezi, Ali Etemad

FSCIL requires both stability and adaptability, i. e., preserving proficiency in previously learned tasks while learning new ones.

Ranked #2 on Few-Shot Class-Incremental Learning on CUB-200-2011

Few-Shot Class-Incremental Learning Incremental Learning

Paper
Add Code

A collection of the accepted papers for the Human-Centric Representation Learning workshop at AAAI 2024

no code implementations • 14 Mar 2024 • Dimitris Spathis, Aaqib Saeed, Ali Etemad, Sana Tonekaboni, Stefanos Laskaridis, Shohreh Deldari, Chi Ian Tang, Patrick Schwab, Shyam Tailor

This non-archival index is not complete, as some accepted papers chose to opt-out of inclusion.

Representation Learning

Paper
Add Code

Learning under Label Noise through Few-Shot Human-in-the-Loop Refinement

no code implementations • 25 Jan 2024 • Aaqib Saeed, Dimitris Spathis, JungWoo Oh, Edward Choi, Ali Etemad

We show that FHLR achieves significantly better performance when learning from noisy labels and achieves state-of-the-art by a large margin, with up to 19% accuracy improvement under symmetric and asymmetric noise.

Paper
Add Code

SASSL: Enhancing Self-Supervised Learning via Neural Style Transfer

no code implementations • 2 Dec 2023 • Renan A. Rojas-Gomez, Karan Singhal, Ali Etemad, Alex Bijamov, Warren R. Morningstar, Philip Andrew Mansfield

Existing data augmentation in self-supervised learning, while diverse, fails to preserve the inherent structure of natural images.

Data Augmentation Self-Supervised Learning +2

Paper
Add Code

Contrastive Learning of View-Invariant Representations for Facial Expressions Recognition

no code implementations • 12 Nov 2023 • Shuvendu Roy, Ali Etemad

ViewFX learns view-invariant features of expression using a proposed self-supervised contrastive loss which brings together different views of the same subject with a particular expression in the embedding space.

Contrastive Learning Facial Expression Recognition +1

Paper
Add Code

Remote Heart Rate Monitoring in Smart Environments from Videos with Self-supervised Pre-training

no code implementations • 23 Oct 2023 • Divij Gupta, Ali Etemad

Recent advances in deep learning have made it increasingly feasible to estimate heart rate remotely in smart environments by analyzing videos.

Contrastive Learning Heart rate estimation +2

Paper
Add Code

Speech Emotion Recognition with Distilled Prosodic and Linguistic Affect Representations

no code implementations • 9 Sep 2023 • Debaditya Shome, Ali Etemad

We propose EmoDistill, a novel speech emotion recognition (SER) framework that leverages cross-modal knowledge distillation during training to learn strong linguistic and prosodic representations of emotion from speech.

Knowledge Distillation Speech Emotion Recognition

Paper
Add Code

Diffusion Models with Deterministic Normalizing Flow Priors

1 code implementation • 3 Sep 2023 • Mohsen Zand, Ali Etemad, Michael Greenspan

We use normalizing flows to parameterize the noisy data at any arbitrary step of the diffusion process and utilize it as the prior in the reverse diffusion process.

Denoising Image Generation

Paper
Code

Multiscale Residual Learning of Graph Convolutional Sequence Chunks for Human Motion Prediction

1 code implementation • 31 Aug 2023 • Mohsen Zand, Ali Etemad, Michael Greenspan

Our experiments on two challenging benchmark datasets, CMU Mocap and Human3. 6M, demonstrate that our proposed method is able to effectively model the sequence information for motion prediction and outperform other techniques to set a new state-of-the-art.

Human motion prediction motion prediction

Paper
Code

Region-Disentangled Diffusion Model for High-Fidelity PPG-to-ECG Translation

1 code implementation • 25 Aug 2023 • Debaditya Shome, Pritam Sarkar, Ali Etemad

In this work, we introduce Region-Disentangled Diffusion Model (RDDM), a novel diffusion model designed to capture the complex temporal dynamics of ECG.

Blood pressure estimation Denoising +2

Paper
Code

EEG-based Cognitive Load Classification using Feature Masked Autoencoding and Emotion Transfer Learning

no code implementations • 1 Aug 2023 • Dustin Pulver, Prithila Angkan, Paul Hungler, Ali Etemad

We pre-train our model using self-supervised masked autoencoding on emotion-related EEG datasets and use transfer learning with both frozen weights and fine-tuning to perform downstream cognitive load classification.

Classification Decision Making +2

Paper
Add Code

Context-aware Pedestrian Trajectory Prediction with Multimodal Transformer

no code implementations • 7 Jul 2023 • Haleh Damirchi, Michael Greenspan, Ali Etemad

Quantitative results demonstrate the superiority of our proposed model over the current state-of-the-art, which consistently achieves the lowest error for 3 time horizons of 0. 5, 1. 0 and 1. 5 seconds.

Decoder Pedestrian Trajectory Prediction +1

Paper
Add Code

Active Learning with Contrastive Pre-training for Facial Expression Recognition

1 code implementation • 6 Jul 2023 • Shuvendu Roy, Ali Etemad

Even though some prior works have focused on reducing the need for large amounts of labelled data using different unsupervised methods, another promising approach called active learning is barely explored in the context of FER.

Active Learning Facial Expression Recognition +1

Paper
Code

Continual Learning for Out-of-Distribution Pedestrian Detection

1 code implementation • 26 Jun 2023 • Mahdiyar Molahasani, Ali Etemad, Michael Greenspan

A continual learning solution is proposed to address the out-of-distribution generalization problem for pedestrian detection.

Continual Learning object-detection +3

Paper
Code

Can Continual Learning Improve Long-Tailed Recognition? Toward a Unified Framework

no code implementations • 23 Jun 2023 • Mahdiyar Molahasani, Michael Greenspan, Ali Etemad

Next, we assert that by treating the learning of the Head and Tail as two separate and sequential steps, Continual Learning (CL) methods can effectively update the weights of the learner to learn the Tail without forgetting the Head.

Continual Learning

Paper
Add Code

Unmasking Deepfakes: Masked Autoencoding Spatiotemporal Transformers for Enhanced Video Forgery Detection

no code implementations • 12 Jun 2023 • Sayantan Das, Mojtaba Kolahdouzi, Levent Özparlak, Will Hickie, Ali Etemad

We present a novel approach for the detection of deepfake videos using a pair of vision transformers pre-trained by a self-supervised masked autoencoding setup.

Face Swapping Optical Flow Estimation

Paper
Add Code

Toward Fair Facial Expression Recognition with Improved Distribution Alignment

no code implementations • 11 Jun 2023 • Mojtaba Kolahdouzi, Ali Etemad

We present a novel approach to mitigate bias in facial expression recognition (FER) models.

Attribute Facial Expression Recognition +2

Paper
Add Code

Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts

1 code implementation • NeurIPS 2023 • Pritam Sarkar, Ahmad Beirami, Ali Etemad

Video self-supervised learning (VSSL) has made significant progress in recent years.

Open Set Learning Representation Learning +1

Paper
Code

Scaling Up Semi-supervised Learning with Unconstrained Unlabelled Data

1 code implementation • 2 Jun 2023 • Shuvendu Roy, Ali Etemad

We propose UnMixMatch, a semi-supervised learning framework which can learn effective representations from unconstrained unlabelled data in order to scale up performance.

Ranked #1 on Semi-Supervised Image Classification on CIFAR-10, 400 Labels (OpenSet, 6/4)

Network Pruning Semi-Supervised Image Classification

Paper
Code

Exploring the Boundaries of Semi-Supervised Facial Expression Recognition: Learning from In-Distribution, Out-of-Distribution, and Unconstrained Data

no code implementations • 2 Jun 2023 • Shuvendu Roy, Ali Etemad

While semi-supervised learning has shown promise in FER, most current methods from general computer vision literature have not been explored in the context of FER.

Facial Expression Recognition Facial Expression Recognition (FER) +1

Paper
Add Code

Consistency-guided Prompt Learning for Vision-Language Models

2 code implementations • 1 Jun 2023 • Shuvendu Roy, Ali Etemad

Our approach improves the generalization of large foundation models when fine-tuned on downstream tasks in a few-shot setting.

Ranked #2 on Prompt Engineering on Oxford-IIIT Pet Dataset

Domain Generalization Few-Shot Learning +1

Paper
Code

Privacy-Preserving Remote Heart Rate Estimation from Facial Videos

no code implementations • 1 Jun 2023 • Divij Gupta, Ali Etemad

Remote Photoplethysmography (rPPG) is the process of estimating PPG from facial videos.

Heart rate estimation Privacy Preserving

Paper
Add Code

An Ensemble Semi-Supervised Adaptive Resonance Theory Model with Explanation Capability for Pattern Classification

no code implementations • 19 May 2023 • Farhad Pourpanah, Chee Peng Lim, Ali Etemad, Q. M. Jonathan Wu

Firstly, SSL-ART adopts an unsupervised fuzzy ART network to create a number of prototype nodes using unlabeled samples.

Paper
Add Code

In-Distribution and Out-of-Distribution Self-supervised ECG Representation Learning for Arrhythmia Detection

no code implementations • 13 Apr 2023 • Sahar Soltanieh, Javad Hashemi, Ali Etemad

To further assess the performance of these methods on both In-Distribution (ID) and Out-of-Distribution (OOD) ECG data, we conduct cross-dataset training and testing experiments.

Arrhythmia Detection Representation Learning +1

Paper
Add Code

Multimodal Brain-Computer Interface for In-Vehicle Driver Cognitive Load Measurement: Dataset and Baselines

1 code implementation • 9 Apr 2023 • Prithila Angkan, Behnam Behinaein, Zunayed Mahmud, Anubhav Bhatti, Dirk Rodenburg, Paul Hungler, Ali Etemad

Through this paper, we introduce a novel driver cognitive load assessment dataset, CL-Drive, which contains Electroencephalogram (EEG) signals along with other physiological signals such as Electrocardiography (ECG) and Electrodermal Activity (EDA) as well as eye tracking data.

Brain Computer Interface EEG +1

Paper
Code

A Study on Bias and Fairness In Deep Speaker Recognition

no code implementations • 14 Mar 2023 • Amirhossein Hajavi, Ali Etemad

With the ubiquity of smart devices that use speaker recognition (SR) systems as a means of authenticating individuals and personalizing their services, fairness of SR systems has becomes an important point of focus.

Fairness Speaker Recognition

Paper
Add Code

Human Pose Estimation from Ambiguous Pressure Recordings with Spatio-temporal Masked Transformers

no code implementations • 10 Mar 2023 • Vandad Davoodnia, Ali Etemad

Moreover, we observe that increasing the number of temporal crops in the early stages of the network positively impacts the performance while pre-training the network in a self-supervised setting using a masked auto-encoder approach also further improves the results.

Decoder Pose Estimation

Paper
Add Code

Partial Label Learning for Emotion Recognition from EEG

1 code implementation • 25 Feb 2023 • Guangyi Zhang, Ali Etemad

However, PLL methods have not yet been adopted for EEG representation learning or implemented for emotion recognition tasks.

EEG Emotion Recognition +2

Paper
Code

Audio Representation Learning by Distilling Video as Privileged Information

no code implementations • 6 Feb 2023 • Amirhossein Hajavi, Ali Etemad

In this work, we propose a novel approach for deep audio representation learning using audio-visual data when the video modality is absent at inference.

Knowledge Distillation Representation Learning +2

Paper
Add Code

Impact of Labelled Set Selection and Supervision Policies on Semi-supervised Learning

no code implementations • 27 Nov 2022 • Shuvendu Roy, Ali Etemad

All these labelled samples are then used along with the unlabelled data throughout the training process.

Representation Learning

Paper
Add Code

XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation Learning

1 code implementation • 25 Nov 2022 • Pritam Sarkar, Ali Etemad

First, masked data reconstruction is performed to learn modality-specific representations from audio and visual streams.

Ranked #1 on Self-Supervised Action Recognition on Kinetics-400

Action Classification Classification +6

Paper
Code

FaceTopoNet: Facial Expression Recognition using Face Topology Learning

no code implementations • 13 Sep 2022 • Mojtaba Kolahdouzi, Alireza Sepas-Moghaddam, Ali Etemad

We perform extensive experiments on four large-scale in-the-wild facial expression datasets - namely AffectNet, FER2013, ExpW, and RAF-DB - and one lab-controlled dataset (CK+) to evaluate our approach.

Facial Expression Recognition Facial Expression Recognition (FER)

Paper
Add Code

Temporal Contrastive Learning with Curriculum

no code implementations • 2 Sep 2022 • Shuvendu Roy, Ali Etemad

We present ConCur, a contrastive video representation learning method that uses curriculum learning to impose a dynamic sampling strategy in contrastive training.

Action Recognition Contrastive Learning +4

Paper
Add Code

Self-Supervised Human Activity Recognition with Localized Time-Frequency Contrastive Representation Learning

no code implementations • 26 Aug 2022 • Setareh Rahimi Taghanaki, Michael Rainbow, Ali Etemad

We aim to develop a model that learns strong representations from accelerometer signals, in order to perform robust human activity classification, while reducing the model's reliance on class labels.

Classification Contrastive Learning +4

Paper
Add Code

Analysis of Semi-Supervised Methods for Facial Expression Recognition

2 code implementations • 31 Jul 2022 • Shuvendu Roy, Ali Etemad

To reduce the reliance of deep neural solutions on labeled data, state-of-the-art semi-supervised methods have been proposed in the literature.

Facial Expression Recognition Facial Expression Recognition (FER) +1

Paper
Code

Multimodal Estimation of End Point Force During Quasi-dynamic and Dynamic Muscle Contractions Using Deep Learning

no code implementations • 20 Jul 2022 • Gelareh Hajian, Evelyn Morin, Ali Etemad

We propose a novel method to accurately model the generated force under isotonic, isokinetic (quasi-dynamic), and fully dynamic conditions.

Paper
Add Code

ObjectBox: From Centers to Boxes for Anchor-Free Object Detection

1 code implementation • 14 Jul 2022 • Mohsen Zand, Ali Etemad, Michael Greenspan

We present ObjectBox, a novel single-stage anchor-free and highly generalizable object detection approach.

Object object-detection +1

131

Paper
Code

Multistream Gaze Estimation with Anatomical Eye Region Isolation by Synthetic to Real Transfer Learning

1 code implementation • 18 Jun 2022 • Zunayed Mahmud, Paul Hungler, Ali Etemad

The eye region isolation is performed with a U-Net style network which we train using a synthetic dataset that contains eye region masks for the visible eyeball and the iris region.

Anatomy Gaze Estimation +1

Paper
Code

Estimating Pose from Pressure Data for Smart Beds with Deep Image-based Pose Estimators

no code implementations • 13 Jun 2022 • Vandad Davoodnia, Saeed Ghorbani, Ali Etemad

In-bed pose estimation has shown value in fields such as hospital patient monitoring, sleep studies, and smart homes.

Domain Adaptation Pose Estimation

Paper
Add Code

AttX: Attentive Cross-Connections for Fusion of Wearable Signals in Emotion Recognition

no code implementations • 9 Jun 2022 • Anubhav Bhatti, Behnam Behinaein, Paul Hungler, Ali Etemad

We perform extensive experiments on three public multimodal wearable datasets, WESAD, SWELL-KW, and CASE, and demonstrate that our method can effectively regulate and share information between different modalities to learn better representations.

Emotion Recognition Representation Learning

Paper
Add Code

Learning Sequential Contexts using Transformer for 3D Hand Pose Estimation

no code implementations • 1 Jun 2022 • Leyla Khaleghi, Joshua Marshall, Ali Etemad

3D hand pose estimation (HPE) is the process of locating the joints of the hand in 3D from any visual input.

3D Hand Pose Estimation

Paper
Add Code

Analysis of Augmentations for Contrastive ECG Representation Learning

no code implementations • 30 May 2022 • Sahar Soltanieh, Ali Etemad, Javad Hashemi

For instance, when adding Gaussian noise, a sigma in the range of 0. 1 to 0. 2 achieves better results, while poor training occurs when the added noise is too small or too large (outside of the specified range).

Arrhythmia Detection Contrastive Learning +2

Paper
Add Code

AVCAffe: A Large Scale Audio-Visual Dataset of Cognitive Load and Affect for Remote Work

1 code implementation • 13 May 2022 • Pritam Sarkar, Aaron Posen, Ali Etemad

We introduce AVCAffe, the first Audio-Visual dataset consisting of Cognitive load and Affect attributes.

Management

Paper
Code

Multiscale Crowd Counting and Localization By Multitask Point Supervision

1 code implementation • 21 Feb 2022 • Mohsen Zand, Haleh Damirchi, Andrew Farley, Mahdiyar Molahasani, Michael Greenspan, Ali Etemad

As the detection and localization tasks are well-correlated and can be jointly tackled, our model benefits from a multitask solution by learning multiscale representations of encoded crowd images, and subsequently fusing them.

Crowd Counting

Paper
Code

PARSE: Pairwise Alignment of Representations in Semi-Supervised EEG Learning for Emotion Recognition

1 code implementation • 11 Feb 2022 • Guangyi Zhang, Vandad Davoodnia, Ali Etemad

To reduce the potential distribution mismatch between the large amounts of unlabeled data and the limited amount of labeled data, PARSE uses pairwise representation alignment.

Data Augmentation EEG +2

Paper
Code

Gaze Estimation with Eye Region Segmentation and Self-Supervised Multistream Learning

no code implementations • 15 Dec 2021 • Zunayed Mahmud, Paul Hungler, Ali Etemad

We first create a synthetic dataset containing eye region masks detailing the visible eyeball and iris using a simulator.

Contrastive Learning Gaze Estimation

Paper
Add Code

Face Trees for Expression Recognition

no code implementations • 5 Dec 2021 • Mojtaba Kolahdouzi, Alireza Sepas-Moghaddam, Ali Etemad

We propose an end-to-end architecture for facial expression recognition.

Facial Expression Recognition Facial Expression Recognition (FER)

Paper
Add Code

Towards Personalization of User Preferences in Partially Observable Smart Home Environments

no code implementations • 2 Dec 2021 • Shashi Suman, Francois Rivest, Ali Etemad

In this paper, we propose a Bayesian Reinforcement learning framework that can approximate the current occupant state in a partially observable smart home environment using its thermal preference, and then identify the occupant as a new user or someone is already known to the system.

Hierarchical Reinforcement Learning reinforcement-learning +1

Paper
Add Code

Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity

1 code implementation • 9 Nov 2021 • Pritam Sarkar, Ali Etemad

We present CrissCross, a self-supervised framework for learning audio-visual representations.

Ranked #1 on Audio Classification on DCASE

Retrieval Self-Supervised Action Recognition +5

Paper
Code

Holistic Semi-Supervised Approaches for EEG Representation Learning

no code implementations • 24 Sep 2021 • Guangyi Zhang, Ali Etemad

Recently, supervised methods, which often require substantial amounts of class labels, have achieved promising results for EEG representation learning.

EEG Emotion Recognition +1

Paper
Add Code

Multi-View Video-Based 3D Hand Pose Estimation

1 code implementation • 24 Sep 2021 • Leyla Khaleghi, Alireza Sepas Moghaddam, Joshua Marshall, Ali Etemad

Recent works have shown that videos or multi-view images carry rich information regarding the hand, allowing for the development of more robust HPE systems.

3D Hand Pose Estimation

Paper
Code

Wearable-based Classification of Running Styles with Deep Learning

no code implementations • 1 Sep 2021 • Setareh Rahimi Taghanaki, Michael Rainbow, Ali Etemad

To develop a system capable of classifying running styles using wearables, we collect a dataset from 10 healthy runners performing 8 different pre-defined running styles.

Classification

Paper
Add Code

A Transformer Architecture for Stress Detection from ECG

no code implementations • 22 Aug 2021 • Behnam Behinaein, Anubhav Bhatti, Dirk Rodenburg, Paul Hungler, Ali Etemad

Electrocardiogram (ECG) has been widely used for emotion recognition.

Emotion Recognition

Paper
Add Code

Self-supervised Contrastive Learning of Multi-view Facial Expressions

no code implementations • 15 Aug 2021 • Shuvendu Roy, Ali Etemad

The model is then fine-tuned with labeled data in a supervised setting.

Contrastive Learning Facial Expression Recognition +1

Paper
Add Code

Spatiotemporal Contrastive Learning of Facial Expressions in Videos

no code implementations • 6 Aug 2021 • Shuvendu Roy, Ali Etemad

Experiments are performed on the Oulu-CASIA dataset and the performance is compared to other works in FER.

Contrastive Learning Facial Expression Recognition +1

Paper
Add Code

Attentive Cross-modal Connections for Deep Multimodal Wearable-based Emotion Recognition

no code implementations • 4 Aug 2021 • Anubhav Bhatti, Behnam Behinaein, Dirk Rodenburg, Paul Hungler, Ali Etemad

Classification of human emotions can play an essential role in the design and improvement of human-machine systems.

Classification Emotion Classification +1

Paper
Add Code

Deep Recurrent Semi-Supervised EEG Representation Learning for Emotion Recognition

no code implementations • 28 Jul 2021 • Guangyi Zhang, Ali Etemad

We evaluate our framework using both a stacked autoencoder and an attention-based recurrent autoencoder.

Deep Attention EEG +2

Paper
Add Code

Multi-Perspective LSTM for Joint Visual Representation Learning

1 code implementation • CVPR 2021 • Alireza Sepas-Moghaddam, Fernando Pereira, Paulo Lobato Correia, Ali Etemad

We validate the performance of our proposed architecture in the context of two multi-perspective visual recognition tasks namely lip reading and face recognition.

Face Recognition Lip Reading +1

Paper
Code

Distilling EEG Representations via Capsules for Affective Computing

no code implementations • 30 Apr 2021 • Guangyi Zhang, Ali Etemad

Then, we employ the teacher network to learn the discriminative features embedded in capsules by adopting a lightweight model (student network) to mimic the teacher using the privileged knowledge.

EEG Knowledge Distillation

Paper
Add Code

Oriented Bounding Boxes for Small and Freely Rotated Objects

no code implementations • 24 Apr 2021 • Mohsen Zand, Ali Etemad, Michael Greenspan

A novel object detection method is presented that handles freely rotated objects of arbitrary sizes, including tiny objects as small as $2\times 2$ pixels.

Novel Object Detection object-detection +1

Paper
Add Code

Flow-based Spatio-Temporal Structured Prediction of Motion Dynamics

1 code implementation • 9 Apr 2021 • Mohsen Zand, Ali Etemad, Michael Greenspan

We specifically propose to use conditional priors to factorize the latent space for the time dependent modeling.

motion prediction Structured Prediction +3

Paper
Code

Vote from the Center: 6 DoF Pose Estimation in RGB-D Images by Radial Keypoint Voting

1 code implementation • 6 Apr 2021 • Yangzheng Wu, Mohsen Zand, Ali Etemad, Michael Greenspan

We propose a novel keypoint voting scheme based on intersecting spheres, that is more accurate than existing schemes and allows for fewer, more disperse keypoints.

Ranked #1 on 6D Pose Estimation using RGBD on YCB-Video (ADDS AUC metric)

6D Pose Estimation using RGBD

Paper
Code

Teacher-Student Adversarial Depth Hallucination to Improve Face Recognition

1 code implementation • ICCV 2021 • Hardik Uppal, Alireza Sepas-Moghaddam, Michael Greenspan, Ali Etemad

Moreover, face recognition experiments demonstrate that our hallucinated depth along with the input RGB images boosts performance across various architectures when compared to a single RGB modality by average values of +1. 2%, +2. 6%, and +2. 6% for IIIT-D, EURECOM, and LFW datasets respectively.

Face Recognition Generative Adversarial Network +1

Paper
Code

Identity and Posture Recognition in Smart Beds with Deep Multitask Learning

no code implementations • 5 Apr 2021 • Vandad Davoodnia, Ali Etemad

Sleep posture analysis is widely used for clinical patient monitoring and sleep studies.

Paper
Add Code

Potential Impacts of Smart Homes on Human Behavior: A Reinforcement Learning Approach

no code implementations • 26 Feb 2021 • Shashi Suman, Ali Etemad, Francois Rivest

We then investigate the possibility of human behavior being altered as a result of the smart home and the human model adapting to one-another.

Hierarchical Reinforcement Learning Q-Learning +2

Paper
Add Code

Deep Gait Recognition: A Survey

no code implementations • 18 Feb 2021 • Alireza Sepas-Moghaddam, Ali Etemad

Gait recognition is an appealing biometric modality which aims to identify individuals based on the way they walk.

Gait Recognition

Paper
Add Code

CapsField: Light Field-based Face and Expression Recognition in the Wild using Capsule Routing

no code implementations • 10 Jan 2021 • Alireza Sepas-Moghaddam, Ali Etemad, Fernando Pereira, Paulo Lobato Correia

A subset of the in the wild dataset contains facial images with different expressions, annotated for usage in the context of face expression recognition tests.

Paper
Add Code

Depth as Attention for Face Representation Learning

1 code implementation • 3 Jan 2021 • Hardik Uppal, Alireza Sepas-Moghaddam, Michael Greenspan, Ali Etemad

Our novel attention mechanism directs the deep network "where to look" for visual features in the RGB image by focusing the attention of the network using depth features extracted by a Convolution Neural Network (CNN).

Face Recognition Representation Learning

Paper
Code

Detection of Maternal and Fetal Stress from the Electrocardiogram with Self-Supervised Representation Learning

2 code implementations • 3 Nov 2020 • Pritam Sarkar, Silvia Lobmaier, Bibiana Fabre, Diego González, Alexander Mueller, Martin G. Frasch, Marta C. Antonelli, Ali Etemad

Our DL models accurately detect the chronic stress exposure group (AUROC=0. 982+/-0. 002), the individual psychological stress score (R2=0. 943+/-0. 009) and FSI at 34 weeks of gestation (R2=0. 946+/-0. 013), as well as the maternal hair cortisol at birth reflecting chronic stress exposure (0. 931+/-0. 006).

Representation Learning Self-Supervised Learning

Paper
Code

Self-supervised Human Activity Recognition by Learning to Predict Cross-Dimensional Motion

no code implementations • 21 Oct 2020 • Setareh Rahimi Taghanaki, Michael Rainbow, Ali Etemad

We propose the use of self-supervised learning for human activity recognition with smartphone accelerometer data.

Human Activity Recognition Self-Supervised Learning

Paper
Add Code

View-Invariant Gait Recognition with Attentive Recurrent Learning of Partial Representations

no code implementations • 18 Oct 2020 • Alireza Sepas-Moghaddam, Ali Etemad

Our proposed model has been extensively tested on two large-scale CASIA-B and OU-MVLP gait datasets using four different test protocols and has been compared to a number of state-of-the-art and baseline solutions.

Gait Recognition

Paper
Add Code

Gait Recognition using Multi-Scale Partial Representation Transformation with Capsules

no code implementations • 18 Oct 2020 • Alireza Sepas-Moghaddam, Saeed Ghorbani, Nikolaus F. Troje, Ali Etemad

In this context, we propose a novel deep network, learning to transfer multi-scale partial gait representations using capsules to obtain more discriminative gait features.

Gait Recognition

Paper
Add Code

CardioGAN: Attentive Generative Adversarial Network with Dual Discriminators for Synthesis of ECG from PPG

2 code implementations • 30 Sep 2020 • Pritam Sarkar, Ali Etemad

Electrocardiogram (ECG) is the electrical measurement of cardiac activity, whereas Photoplethysmogram (PPG) is the optical measurement of volumetric changes in blood circulation.

Generative Adversarial Network

Paper
Code

Siamese Capsule Network for End-to-End Speaker Recognition In The Wild

no code implementations • 28 Sep 2020 • Amirhossein Hajavi, Ali Etemad

Our model uses thin-ResNet for extracting speaker embeddings from utterances and a Siamese capsule network and dynamic routing as the Back-end to calculate a similarity score between the embeddings.

Speaker Recognition Speaker Verification

Paper
Add Code

FluentNet: End-to-End Detection of Speech Disfluency with Deep Learning

no code implementations • 23 Sep 2020 • Tedd Kourkounakis, Amirhossein Hajavi, Ali Etemad

We also evaluate FluentNet on this dataset, showing the strong performance of our model versus a number of benchmark techniques.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

End-to-End Prediction of Parcel Delivery Time with Deep Learning for Smart-City Applications

1 code implementation • 23 Sep 2020 • Arthur Cruz de Araujo, Ali Etemad

The acquisition of massive data on parcel delivery motivates postal operators to foster the development of predictive systems to improve customer service.

Paper
Code

Fine-grained Early Frequency Attention for Deep Speaker Representation Learning

no code implementations • 3 Sep 2020 • Amirhossein Hajavi, Ali Etemad

We evaluate the proposed model on three tasks of speaker recognition, speech emotion recognition, and spoken digit recognition.

Representation Learning Speaker Recognition +3

Paper
Add Code

Unsupervised Multi-Modal Representation Learning for Affective Computing with Multi-Corpus Wearable Data

no code implementations • 24 Aug 2020 • Kyle Ross, Paul Hungler, Ali Etemad

The results show the wide-spread applicability for stacked convolutional autoencoders to be used with machine learning for affective computing.

BIG-bench Machine Learning Emotion Recognition +1

Paper
Add Code

Spatio-Temporal EEG Representation Learning on Riemannian Manifold and Euclidean Space

1 code implementation • 19 Aug 2020 • Guangyi Zhang, Ali Etemad

Moreover, our proposed method learns the temporal information via differential entropy and logarithm power spectrum density features extracted from EEG signals in a Euclidean space using a deep long short-term memory network with a soft attention mechanism.

Binary Classification Decision Making +6

Paper
Code

Deep Multitask Learning for Pervasive BMI Estimation and Identity Recognition in Smart Beds

no code implementations • 18 Jun 2020 • Vandad Davoodnia, Monet Slinowsky, Ali Etemad

Smart devices in the Internet of Things (IoT) paradigm provide a variety of unobtrusive and pervasive means for continuous monitoring of bio-metrics and health information.

BIG-bench Machine Learning

Paper
Add Code

Two-Level Attention-based Fusion Learning for RGB-D Face Recognition

1 code implementation • 29 Feb 2020 • Hardik Uppal, Alireza Sepas-Moghaddam, Michael Greenspan, Ali Etemad

A novel attention aware method is proposed to fuse two image modalities, RGB and depth, for enhanced RGB-D facial recognition.

Face Recognition Transfer Learning +1

Paper
Code

Self-supervised ECG Representation Learning for Emotion Recognition

2 code implementations • 4 Feb 2020 • Pritam Sarkar, Ali Etemad

Six different signal transformations are applied to the ECG signals, and transformation recognition is performed as pretext tasks.

Emotion Recognition Multi-Task Learning +1

Paper
Code

Capsule Attention for Multimodal EEG-EOG Representation Learning with Application to Driver Vigilance Estimation

no code implementations • 17 Dec 2019 • Guangyi Zhang, Ali Etemad

To enable the system to focus on the most salient parts of the learned multimodal representations, we propose an architecture composed of a capsule attention mechanism following a deep Long Short-Term Memory (LSTM) network.

Brain Computer Interface EEG +1

Paper
Add Code

Detecting Multiple Speech Disfluencies using a Deep Residual Network with Bidirectional Long Short-Term Memory

no code implementations • IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019 • Tedd Kourkounakis, Amirhossein Hajavi, Ali Etemad

Stuttering is a speech impediment affecting tens of millions of people on an everyday basis.

General Classification speech-recognition +1

Paper
Add Code

Self-supervised Learning for ECG-based Emotion Recognition

2 code implementations • 14 Oct 2019 • Pritam Sarkar, Ali Etemad

Our proposed architecture consists of two main networks, a signal transformation recognition network and an emotion recognition network.

Emotion Recognition Self-Supervised Learning

Paper
Code

In-bed Pressure-based Pose Estimation using Image Space Representation Learning

no code implementations • 21 Aug 2019 • Vandad Davoodnia, Saeed Ghorbani, Ali Etemad

Recent advances in deep pose estimation models have proven to be effective in a wide range of applications such as health monitoring, sports, animations, and robotics.

Pose Estimation Representation Learning

Paper
Add Code

Classification of Hand Movements from EEG using a Deep Attention-based LSTM Network

no code implementations • 6 Aug 2019 • Guangyi Zhang, Vandad Davoodnia, Alireza Sepas-Moghaddam, Yaoxue Zhang, Ali Etemad

Classifying limb movements using brain activity is an important task in Brain-computer Interfaces (BCI) that has been successfully used in multiple application domains, ranging from human-computer interaction to medical and biomedical applications.

Deep Attention EEG +3

Paper
Add Code

Auto-labelling of Markers in Optical Motion Capture by Permutation Learning

no code implementations • 31 Jul 2019 • Saeed Ghorbani, Ali Etemad, Nikolaus F. Troje

Optical marker-based motion capture is a vital tool in applications such as motion and behavioural analysis, animation, and biomechanics.

Paper
Add Code

Classification of Cognitive Load and Expertise for Adaptive Simulation using Deep Multitask Learning

no code implementations • 31 Jul 2019 • Pritam Sarkar, Kyle Ross, Aaron J. Ruberto, Dirk Rodenburg, Paul Hungler, Ali Etemad

Simulations are a pedagogical means of enabling a risk-free way for healthcare practitioners to learn, maintain, or enhance their knowledge and skills.

General Classification

Paper
Add Code

A Deep Neural Network for Short-Segment Speaker Recognition

no code implementations • 22 Jul 2019 • Amirhossein Hajavi, Ali Etemad

Todays interactive devices such as smart-phone assistants and smart speakers often deal with short-duration speech segments.

Speaker Recognition

Paper
Add Code

Long Short-Term Memory with Gate and State Level Fusion for Light Field-Based Face Recognition

no code implementations • 11 May 2019 • Alireza Sepas-Moghaddam, Ali Etemad, Fernando Pereira, Paulo Lobato Correia

In this context, this paper proposes two novel LSTM cell architectures that are able to jointly learn from multiple sequences simultaneously acquired, targeting to create richer and more effective models for recognition tasks.

Benchmarking Face Recognition +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.