Search Results for author: Suhas Lohit

Found 24 papers, 7 papers with code

TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models

1 code implementation25 Apr 2024 Haomiao Ni, Bernhard Egger, Suhas Lohit, Anoop Cherian, Ye Wang, Toshiaki Koike-Akino, Sharon X. Huang, Tim K. Marks

To guide video generation with the additional image input, we propose a "repeat-and-slide" strategy that modulates the reverse denoising process, allowing the frozen diffusion model to synthesize a video frame-by-frame starting from the provided image.

Denoising Image to Video Generation

Multimodal 3D Object Detection on Unseen Domains

no code implementations17 Apr 2024 Deepti Hegde, Suhas Lohit, Kuan-Chuan Peng, Michael J. Jones, Vishal M. Patel

To this end, we propose CLIX$^\text{3D}$, a multimodal fusion and supervised contrastive learning framework for 3D object detection that performs alignment of object features from same-class samples of different domains while pushing the features from different classes apart.

3D Object Detection Autonomous Driving +5

G-RepsNet: A Fast and General Construction of Equivariant Networks for Arbitrary Matrix Groups

no code implementations23 Feb 2024 Sourya Basu, Suhas Lohit, Matthew Brand

Recent work by Finzi et al. (2021) directly solves the equivariance constraint for arbitrary matrix groups to obtain equivariant MLPs (EMLPs).

Image Classification Inductive Bias

Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis

no code implementations ICCV 2023 Nithin Gopalakrishnan Nair, Anoop Cherian, Suhas Lohit, Ye Wang, Toshiaki Koike-Akino, Vishal M. Patel, Tim K. Marks

To this end, and capitalizing on the powerful fine-grained generative control offered by the recent diffusion-based generative models, we introduce Steered Diffusion, a generalized framework for photorealistic zero-shot conditional image generation using a diffusion model trained for unconditional generation.

Colorization Conditional Image Generation +2

Tensor Factorization for Leveraging Cross-Modal Knowledge in Data-Constrained Infrared Object Detection

no code implementations28 Sep 2023 Manish Sharma, Moitreya Chatterjee, Kuan-Chuan Peng, Suhas Lohit, Michael Jones

We first pretrain these factor matrices on the RGB modality, for which plenty of training data are assumed to exist and then augment only a few trainable parameters for training on the IR modality to avoid over-fitting, while encouraging them to capture complementary cues from those trained only on the RGB modality.

object-detection Object Detection +1

Pixel-Grounded Prototypical Part Networks

no code implementations25 Sep 2023 Zachariah Carmichael, Suhas Lohit, Anoop Cherian, Michael Jones, Walter Scheirer

Prototypical part neural networks (ProtoPartNNs), namely PROTOPNET and its derivatives, are an intrinsically interpretable approach to machine learning.

Object

Are Deep Neural Networks SMARTer than Second Graders?

1 code implementation CVPR 2023 Anoop Cherian, Kuan-Chuan Peng, Suhas Lohit, Kevin A. Smith, Joshua B. Tenenbaum

To answer this question, we propose SMART: a Simple Multimodal Algorithmic Reasoning Task and the associated SMART-101 dataset, for evaluating the abstraction, deduction, and generalization abilities of neural networks in solving visuo-linguistic puzzles designed specifically for children in the 6--8 age group.

Language Modelling Meta-Learning +1

Cross-Modal Knowledge Transfer Without Task-Relevant Source Data

no code implementations8 Sep 2022 Sk Miraj Ahmed, Suhas Lohit, Kuan-Chuan Peng, Michael J. Jones, Amit K. Roy-Chowdhury

In such cases, transferring knowledge from a neural network trained on a well-labeled large dataset in the source modality (RGB) to a neural network that works on a target modality (depth, infrared, etc.)

Autonomous Navigation Transfer Learning

Learning Partial Equivariances from Data

1 code implementation19 Oct 2021 David W. Romero, Suhas Lohit

Frequently, transformations occurring in data can be better represented by a subset of a group than by a group as a whole, e. g., rotations in $[-90^{\circ}, 90^{\circ}]$.

Image Classification Rotated MNIST

Understanding the Success of Knowledge Distillation -- A Data Augmentation Perspective

no code implementations29 Sep 2021 Huan Wang, Suhas Lohit, Michael Jeffrey Jones, Yun Fu

We achieve new state-of-the-art accuracy by using the original KD loss armed with stronger augmentation schemes, compared to existing state-of-the-art methods that employ more advanced distillation losses.

Active Learning Data Augmentation +1

Rotation-Invariant Autoencoders for Signals on Spheres

no code implementations8 Dec 2020 Suhas Lohit, Shubhendu Trivedi

These newly proposed convolutional layers naturally extend the notion of convolution to functions on the unit sphere $S^2$ and the group of rotations $SO(3)$ and these layers are equivariant to 3D rotations.

Clustering Retrieval

Model Compression Using Optimal Transport

no code implementations7 Dec 2020 Suhas Lohit, Michael Jones

Model compression methods are important to allow for easier deployment of deep learning models in compute, memory and energy-constrained environments such as mobile phones.

Image Classification Knowledge Distillation +1

Multi-head Knowledge Distillation for Model Compression

no code implementations5 Dec 2020 Huan Wang, Suhas Lohit, Michael Jones, Yun Fu

We add loss terms for training the student that measure the dissimilarity between student and teacher outputs of the auxiliary classifiers.

Image Classification Knowledge Distillation +1

Recovering Trajectories of Unmarked Joints in 3D Human Actions Using Latent Space Optimization

no code implementations3 Dec 2020 Suhas Lohit, Rushil Anirudh, Pavan Turaga

Motion capture (mocap) and time-of-flight based sensing of human actions are becoming increasingly popular modalities to perform robust activity analysis.

Action Recognition

Generative Patch Priors for Practical Compressive Image Recovery

1 code implementation18 Jun 2020 Rushil Anirudh, Suhas Lohit, Pavan Turaga

In this paper, we propose the generative patch prior (GPP) that defines a generative prior for compressive image recovery, based on patch-manifold models.

Compressive Sensing Image Reconstruction +1

Rate-Adaptive Neural Networks for Spatial Multiplexers

no code implementations8 Sep 2018 Suhas Lohit, Rajhans Singh, Kuldeep Kulkarni, Pavan Turaga

Using standard datasets, we demonstrate that, when tested over a range of MRs, a rate-adaptive network can provide high quality reconstruction over a the entire range, resulting in up to about 15 dB improvement over previous methods, where the network is valid for only one MR. We demonstrate the effectiveness of our approach for sample-efficient object tracking where video frames are acquired at dynamically varying MRs. We also extend this algorithm to learn the measurement operator in conjunction with image recognition networks.

Object Tracking valid

CS-VQA: Visual Question Answering with Compressively Sensed Images

no code implementations8 Jun 2018 Li-Chi Huang, Kuldeep Kulkarni, Anik Jha, Suhas Lohit, Suren Jayasuriya, Pavan Turaga

Visual Question Answering (VQA) is a complex semantic task requiring both natural language processing and visual recognition.

Question Answering Visual Question Answering

Learning Invariant Riemannian Geometric Representations Using Deep Nets

no code implementations30 Aug 2017 Suhas Lohit, Pavan Turaga

Non-Euclidean constraints are inherent in many kinds of data in computer vision and machine learning, typically as a result of specific invariance requirements that need to be respected during high-level inference.

BIG-bench Machine Learning Image Classification

Convolutional Neural Networks for Non-iterative Reconstruction of Compressively Sensed Images

no code implementations15 Aug 2017 Suhas Lohit, Kuldeep Kulkarni, Ronan Kerviche, Pavan Turaga, Amit Ashok

We show empirically that our algorithm yields reconstructions with higher PSNRs compared to iterative algorithms at low measurement rates and in presence of measurement noise.

Compressive Sensing Object Tracking

Cannot find the paper you are looking for? You can Submit a new open access paper.