Search Results for author: Karthik Raveendran

Found 8 papers, 5 papers with code

Blendshapes GHUM: Real-time Monocular Facial Blendshape Prediction

no code implementations • 11 Sep 2023 • Ivan Grishchenko, Geng Yan, Eduard Gabriel Bazavan, Andrei Zanfir, Nikolai Chinaev, Karthik Raveendran, Matthias Grundmann, Cristian Sminchisescu

We present Blendshapes GHUM, an on-device ML pipeline that predicts 52 facial blendshape coefficients at 30+ FPS on modern mobile phones, from a single monocular RGB image and enables facial motion capture applications like virtual avatars.

Paper
Add Code

Efficient Heterogeneous Video Segmentation at the Edge

no code implementations • 24 Aug 2022 • Jamie Menjay Lin, Siargey Pisarchyk, Juhyun Lee, David Tian, Tingbo Hou, Karthik Raveendran, Raman Sarokin, George Sung, Trent Tolley, Matthias Grundmann

We introduce an efficient video segmentation system for resource-limited edge devices leveraging heterogeneous compute.

Video Segmentation Video Semantic Segmentation

Paper
Add Code

BlazePose GHUM Holistic: Real-time 3D Human Landmarks and Pose Estimation

no code implementations • 23 Jun 2022 • Ivan Grishchenko, Valentin Bazarevsky, Andrei Zanfir, Eduard Gabriel Bazavan, Mihai Zanfir, Richard Yee, Karthik Raveendran, Matsvei Zhdanovich, Matthias Grundmann, Cristian Sminchisescu

We present BlazePose GHUM Holistic, a lightweight neural network pipeline for 3D human body landmarks and pose estimation, specifically tailored to real-time on-device inference.

3D Human Pose Estimation

Paper
Add Code

Attention Mesh: High-fidelity Face Mesh Prediction in Real-time

1 code implementation • 19 Jun 2020 • Ivan Grishchenko, Artsiom Ablavatski, Yury Kartynnik, Karthik Raveendran, Matthias Grundmann

We present Attention Mesh, a lightweight architecture for 3D face mesh prediction that uses attention to semantically meaningful regions.

Vocal Bursts Intensity Prediction

25,495

Paper
Code

Real-time Pupil Tracking from Monocular Video for Digital Puppetry

1 code implementation • 19 Jun 2020 • Artsiom Ablavatski, Andrey Vakunov, Ivan Grishchenko, Karthik Raveendran, Matsvei Zhdanovich

We present a simple, real-time approach for pupil tracking from live video on mobile devices.

Pupil Tracking

Paper
Code

BlazePose: On-device Real-time Body Pose tracking

7 code implementations • 17 Jun 2020 • Valentin Bazarevsky, Ivan Grishchenko, Karthik Raveendran, Tyler Zhu, Fan Zhang, Matthias Grundmann

We present BlazePose, a lightweight convolutional neural network architecture for human pose estimation that is tailored for real-time inference on mobile devices.

Ranked #1 on 3D Pose Estimation on Google-Yoga

2D Human Pose Estimation 3D Human Pose Estimation +4

25,495

Paper
Code

BlazeFace: Sub-millisecond Neural Face Detection on Mobile GPUs

10 code implementations • 11 Jul 2019 • Valentin Bazarevsky, Yury Kartynnik, Andrey Vakunov, Karthik Raveendran, Matthias Grundmann

We present BlazeFace, a lightweight and well-performing face detector tailored for mobile GPU inference.

Face Detection

12,074

Paper
Code

Floors are Flat: Leveraging Semantics for Real-Time Surface Normal Prediction

1 code implementation • 16 Jun 2019 • Steven Hickson, Karthik Raveendran, Alireza Fathi, Kevin Murphy, Irfan Essa

We propose 4 insights that help to significantly improve the performance of deep learning models that predict surface normals and semantic labels from a single RGB image.

Ranked #1 on Semantic Segmentation on ScanNetV2 (Pixel Accuracy metric)

Semantic Segmentation Surface Normals Estimation +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.