Search Results for author: Animesh Sinha

Found 8 papers, 1 papers with code

Gen2Det: Generate to Detect

no code implementations7 Dec 2023 Saksham Suri, Fanyi Xiao, Animesh Sinha, Sean Chang Culatana, Raghuraman Krishnamoorthi, Chenchen Zhu, Abhinav Shrivastava

In the long-tailed detection setting on LVIS, Gen2Det improves the performance on rare categories by a large margin while also significantly improving the performance on other categories, e. g. we see an improvement of 2. 13 Box AP and 1. 84 Mask AP over just training on real data on LVIS with Mask R-CNN.

Image Generation Object +2

Context Diffusion: In-Context Aware Image Generation

no code implementations6 Dec 2023 Ivona Najdenkoska, Animesh Sinha, Abhimanyu Dubey, Dhruv Mahajan, Vignesh Ramanathan, Filip Radenovic

We propose Context Diffusion, a diffusion-based framework that enables image generation models to learn from visual examples presented in context.

Image Generation In-Context Learning

Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression

no code implementations17 Nov 2023 Animesh Sinha, Bo Sun, Anmol Kalia, Arantxa Casanova, Elliot Blanchard, David Yan, Winnie Zhang, Tony Nelli, Jiahui Chen, Hardik Shah, Licheng Yu, Mitesh Kumar Singh, Ankit Ramchandani, Maziar Sanjabi, Sonal Gupta, Amy Bearman, Dhruv Mahajan

Evaluation results show our method improves visual quality by 14%, prompt alignment by 16. 2% and scene diversity by 15. 3%, compared to prompt engineering the base Emu model for stickers generation.

Image Generation Prompt Engineering

CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval

no code implementations15 Feb 2022 Licheng Yu, Jun Chen, Animesh Sinha, Mengjiao MJ Wang, Hugo Chen, Tamara L. Berg, Ning Zhang

We introduce CommerceMM - a multimodal model capable of providing a diverse and granular understanding of commerce topics associated to the given piece of content (image, text, image+text), and having the capability to generalize to a wide range of tasks, including Multimodal Categorization, Image-Text Retrieval, Query-to-Product Retrieval, Image-to-Product Retrieval, etc.

Representation Learning Retrieval +1

Large-Scale Attribute-Object Compositions

no code implementations24 May 2021 Filip Radenovic, Animesh Sinha, Albert Gordo, Tamara Berg, Dhruv Mahajan

We study the problem of learning how to predict attribute-object compositions from images, and its generalization to unseen compositions missing from the training data.

Attribute Object

Qubit Routing using Graph Neural Network aided Monte Carlo Tree Search

1 code implementation1 Apr 2021 Animesh Sinha, Utkarsh Azad, Harjinder Singh

Near-term quantum hardware can support two-qubit operations only on the qubits that can interact with each other.

Cannot find the paper you are looking for? You can Submit a new open access paper.