Search Results for author: Suhas Lohit

Found 24 papers, 7 papers with code

TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models

1 code implementation • 25 Apr 2024 • Haomiao Ni, Bernhard Egger, Suhas Lohit, Anoop Cherian, Ye Wang, Toshiaki Koike-Akino, Sharon X. Huang, Tim K. Marks

To guide video generation with the additional image input, we propose a "repeat-and-slide" strategy that modulates the reverse denoising process, allowing the frozen diffusion model to synthesize a video frame-by-frame starting from the provided image.

Denoising Image to Video Generation

417

Paper
Code

Multimodal 3D Object Detection on Unseen Domains

no code implementations • 17 Apr 2024 • Deepti Hegde, Suhas Lohit, Kuan-Chuan Peng, Michael J. Jones, Vishal M. Patel

To this end, we propose CLIX$^\text{3D}$, a multimodal fusion and supervised contrastive learning framework for 3D object detection that performs alignment of object features from same-class samples of different domains while pushing the features from different classes apart.

3D Object Detection Autonomous Driving +5

Paper
Add Code

Equivariant Spatio-Temporal Self-Supervision for LiDAR Object Detection

no code implementations • 17 Apr 2024 • Deepti Hegde, Suhas Lohit, Kuan-Chuan Peng, Michael J. Jones, Vishal M. Patel

This can enable improved performance in downstream tasks that are equivariant to such transformations.

3D Object Detection Object +3

Paper
Add Code

G-RepsNet: A Fast and General Construction of Equivariant Networks for Arbitrary Matrix Groups

no code implementations • 23 Feb 2024 • Sourya Basu, Suhas Lohit, Matthew Brand

Recent work by Finzi et al. (2021) directly solves the equivariance constraint for arbitrary matrix groups to obtain equivariant MLPs (EMLPs).

Image Classification Inductive Bias

Paper
Add Code

Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis

no code implementations • ICCV 2023 • Nithin Gopalakrishnan Nair, Anoop Cherian, Suhas Lohit, Ye Wang, Toshiaki Koike-Akino, Vishal M. Patel, Tim K. Marks

To this end, and capitalizing on the powerful fine-grained generative control offered by the recent diffusion-based generative models, we introduce Steered Diffusion, a generalized framework for photorealistic zero-shot conditional image generation using a diffusion model trained for unconditional generation.

Colorization Conditional Image Generation +2

Paper
Add Code

Tensor Factorization for Leveraging Cross-Modal Knowledge in Data-Constrained Infrared Object Detection

no code implementations • 28 Sep 2023 • Manish Sharma, Moitreya Chatterjee, Kuan-Chuan Peng, Suhas Lohit, Michael Jones

We first pretrain these factor matrices on the RGB modality, for which plenty of training data are assumed to exist and then augment only a few trainable parameters for training on the IR modality to avoid over-fitting, while encouraging them to capture complementary cues from those trained only on the RGB modality.

object-detection Object Detection +1

Paper
Add Code

Pixel-Grounded Prototypical Part Networks

no code implementations • 25 Sep 2023 • Zachariah Carmichael, Suhas Lohit, Anoop Cherian, Michael Jones, Walter Scheirer

Prototypical part neural networks (ProtoPartNNs), namely PROTOPNET and its derivatives, are an intrinsically interpretable approach to machine learning.

Object

Paper
Add Code

Are Deep Neural Networks SMARTer than Second Graders?

1 code implementation • CVPR 2023 • Anoop Cherian, Kuan-Chuan Peng, Suhas Lohit, Kevin A. Smith, Joshua B. Tenenbaum

To answer this question, we propose SMART: a Simple Multimodal Algorithmic Reasoning Task and the associated SMART-101 dataset, for evaluating the abstraction, deduction, and generalization abilities of neural networks in solving visuo-linguistic puzzles designed specifically for children in the 6--8 age group.

Language Modelling Meta-Learning +1

Paper
Code

Cross-Modal Knowledge Transfer Without Task-Relevant Source Data

no code implementations • 8 Sep 2022 • Sk Miraj Ahmed, Suhas Lohit, Kuan-Chuan Peng, Michael J. Jones, Amit K. Roy-Chowdhury

In such cases, transferring knowledge from a neural network trained on a well-labeled large dataset in the source modality (RGB) to a neural network that works on a target modality (depth, infrared, etc.)

Autonomous Navigation Transfer Learning

Paper
Add Code

Learning Partial Equivariances from Data

1 code implementation • 19 Oct 2021 • David W. Romero, Suhas Lohit

Frequently, transformations occurring in data can be better represented by a subset of a group than by a group as a whole, e. g., rotations in $[-90^{\circ}, 90^{\circ}]$.

Image Classification Rotated MNIST

Paper
Code

Understanding the Success of Knowledge Distillation -- A Data Augmentation Perspective

no code implementations • 29 Sep 2021 • Huan Wang, Suhas Lohit, Michael Jeffrey Jones, Yun Fu

We achieve new state-of-the-art accuracy by using the original KD loss armed with stronger augmentation schemes, compared to existing state-of-the-art methods that employ more advanced distillation losses.

Active Learning Data Augmentation +1

Paper
Add Code

Rotation-Invariant Autoencoders for Signals on Spheres

no code implementations • 8 Dec 2020 • Suhas Lohit, Shubhendu Trivedi

These newly proposed convolutional layers naturally extend the notion of convolution to functions on the unit sphere $S^2$ and the group of rotations $SO(3)$ and these layers are equivariant to 3D rotations.

Clustering Retrieval

Paper
Add Code

Model Compression Using Optimal Transport

no code implementations • 7 Dec 2020 • Suhas Lohit, Michael Jones

Model compression methods are important to allow for easier deployment of deep learning models in compute, memory and energy-constrained environments such as mobile phones.

Image Classification Knowledge Distillation +1

Paper
Add Code

Multi-head Knowledge Distillation for Model Compression

no code implementations • 5 Dec 2020 • Huan Wang, Suhas Lohit, Michael Jones, Yun Fu

We add loss terms for training the student that measure the dissimilarity between student and teacher outputs of the auxiliary classifiers.

Image Classification Knowledge Distillation +1

Paper
Add Code

What Makes a "Good" Data Augmentation in Knowledge Distillation -- A Statistical Perspective

1 code implementation • 5 Dec 2020 • Huan Wang, Suhas Lohit, Mike Jones, Yun Fu

What makes a "good" DA in KD?

Active Learning Data Augmentation +1

Paper
Code

Recovering Trajectories of Unmarked Joints in 3D Human Actions Using Latent Space Optimization

no code implementations • 3 Dec 2020 • Suhas Lohit, Rushil Anirudh, Pavan Turaga

Motion capture (mocap) and time-of-flight based sensing of human actions are becoming increasingly popular modalities to perform robust activity analysis.

Action Recognition

Paper
Add Code

Generative Patch Priors for Practical Compressive Image Recovery

1 code implementation • 18 Jun 2020 • Rushil Anirudh, Suhas Lohit, Pavan Turaga

In this paper, we propose the generative patch prior (GPP) that defines a generative prior for compressive image recovery, based on patch-manifold models.

Compressive Sensing Image Reconstruction +1

Paper
Code

Temporal Transformer Networks: Joint Learning of Invariant and Discriminative Time Warping

1 code implementation • CVPR 2019 • Suhas Lohit, Qiao Wang, Pavan Turaga

We call this a temporal transformer network (TTN).

3D Action Recognition General Classification +3

Paper
Code

Rate-Adaptive Neural Networks for Spatial Multiplexers

no code implementations • 8 Sep 2018 • Suhas Lohit, Rajhans Singh, Kuldeep Kulkarni, Pavan Turaga

Using standard datasets, we demonstrate that, when tested over a range of MRs, a rate-adaptive network can provide high quality reconstruction over a the entire range, resulting in up to about 15 dB improvement over previous methods, where the network is valid for only one MR. We demonstrate the effectiveness of our approach for sample-efficient object tracking where video frames are acquired at dynamically varying MRs. We also extend this algorithm to learn the measurement operator in conjunction with image recognition networks.

Object Tracking valid

Paper
Add Code

CS-VQA: Visual Question Answering with Compressively Sensed Images

no code implementations • 8 Jun 2018 • Li-Chi Huang, Kuldeep Kulkarni, Anik Jha, Suhas Lohit, Suren Jayasuriya, Pavan Turaga

Visual Question Answering (VQA) is a complex semantic task requiring both natural language processing and visual recognition.

Question Answering Visual Question Answering

Paper
Add Code

Learning Invariant Riemannian Geometric Representations Using Deep Nets

no code implementations • 30 Aug 2017 • Suhas Lohit, Pavan Turaga

Non-Euclidean constraints are inherent in many kinds of data in computer vision and machine learning, typically as a result of specific invariance requirements that need to be respected during high-level inference.

BIG-bench Machine Learning Image Classification

Paper
Add Code

Convolutional Neural Networks for Non-iterative Reconstruction of Compressively Sensed Images

no code implementations • 15 Aug 2017 • Suhas Lohit, Kuldeep Kulkarni, Ronan Kerviche, Pavan Turaga, Amit Ashok

We show empirically that our algorithm yields reconstructions with higher PSNRs compared to iterative algorithms at low measurement rates and in presence of measurement noise.

Compressive Sensing Object Tracking

Paper
Add Code

ReconNet: Non-Iterative Reconstruction of Images From Compressively Sensed Measurements

no code implementations • CVPR 2016 • Kuldeep Kulkarni, Suhas Lohit, Pavan Turaga, Ronan Kerviche, Amit Ashok

The intermediate reconstruction is fed into an off-the-shelf denoiser to obtain the final reconstructed image.

Real-Time Visual Tracking

Paper
Add Code

ReconNet: Non-Iterative Reconstruction of Images from Compressively Sensed Random Measurements

1 code implementation • CVPR 2016 • Kuldeep Kulkarni, Suhas Lohit, Pavan Turaga, Ronan Kerviche, Amit Ashok

The intermediate reconstruction is fed into an off-the-shelf denoiser to obtain the final reconstructed image.

Real-Time Visual Tracking

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.