1 code implementation • 22 Aug 2023 • Silky Singh, Shripad Deshmukh, Mausoom Sarkar, Balaji Krishnamurthy
We demonstrate the effectiveness of our approach, named LOCATE, on multiple standard video object segmentation, image saliency detection, and object segmentation benchmarks, achieving results on par with and, in many cases surpassing state-of-the-art methods.
no code implementations • 10 Jul 2023 • Silky Singh, Shripad Deshmukh, Mausoom Sarkar, Rishabh Jain, Mayur Hemani, Balaji Krishnamurthy
Segmentation of objects in a video is challenging due to the nuances such as motion blurring, parallax, occlusions, changes in illumination, etc.
1 code implementation • 28 Jun 2023 • Sukriti Verma, Ayush Chopra, Jayakumar Subramanian, Mausoom Sarkar, Nikaash Puri, Piyush Gupta, Balaji Krishnamurthy
The two-time scale nature of SAC, which is an actor-critic algorithm, is characterised by the fact that the critic estimate has not converged for the actor at any given time, but since the critic learns faster than the actor, it ensures eventual consistency between the two.
no code implementations • CVPR 2023 • Mausoom Sarkar, Nikitha SR, Mayur Hemani, Rishabh Jain, Balaji Krishnamurthy
Face parsing is defined as the per-pixel labeling of images containing human faces.
no code implementations • 17 Jan 2023 • Tarun Ram Menta, Surgan Jandial, Akash Patil, Vimal KB, Saketh Bachu, Balaji Krishnamurthy, Vineeth N. Balasubramanian, Chirag Agarwal, Mausoom Sarkar
As transfer learning techniques are increasingly used to transfer knowledge from the source model to the target task, it becomes important to quantify which source models are suitable for a given target task without performing computationally expensive fine tuning.
no code implementations • ICCV 2023 • Rishabh Jain, Mayur Hemani, Duygu Ceylan, Krishna Kumar Singh, Jingwan Lu, Mausoom Sarkar, Balaji Krishnamurthy
Numerous pose-guided human editing methods have been explored by the vision community due to their extensive practical applications.
no code implementations • CVPR 2023 • Rishabh Jain, Krishna Kumar Singh, Mayur Hemani, Jingwan Lu, Mausoom Sarkar, Duygu Ceylan, Balaji Krishnamurthy
The task of human reposing involves generating a realistic image of a person standing in an arbitrary conceivable pose.
no code implementations • 12 Sep 2022 • Abhinav Java, Shripad Deshmukh, Milan Aggarwal, Surgan Jandial, Mausoom Sarkar, Balaji Krishnamurthy
MONOMER fuses context from visual, textual, and spatial modalities of snippets and documents to find query snippet in target documents.
no code implementations • 8 Sep 2021 • Sumedh A Sontakke, Sumegh Roychowdhury, Mausoom Sarkar, Nikaash Puri, Balaji Krishnamurthy, Laurent Itti
Humans excel at learning long-horizon tasks from demonstrations augmented with textual commentary, as evidenced by the burgeoning popularity of tutorial videos online.
1 code implementation • EMNLP 2020 • Milan Aggarwal, Hiresh Gupta, Mausoom Sarkar, Balaji Krishnamurthy
To mitigate this, we propose Form2Seq, a novel sequence-to-sequence (Seq2Seq) inspired framework for structure extraction using text, with a specific focus on forms, which leverages relative spatial arrangement of structures.
1 code implementation • 9 Jul 2021 • Milan Aggarwal, Mausoom Sarkar, Hiresh Gupta, Balaji Krishnamurthy
Experimental results show the effectiveness of our approach achieving a recall of 90. 29%, 73. 80%, 83. 12%, and 52. 72% for the above structures, respectively, outperforming semantic segmentation baselines significantly.
1 code implementation • 6 Oct 2020 • Sumegh Roychowdhury, Sumedh A. Sontakke, Nikaash Puri, Mausoom Sarkar, Milan Aggarwal, Pinkesh Badjatiya, Balaji Krishnamurthy, Laurent Itti
Also, they are believed to be arranged hierarchically, allowing for an efficient representation of complex long-horizon experiences.
no code implementations • 3 Sep 2020 • Surgan Jandial, Pinkesh Badjatiya, Pranit Chawla, Ayush Chopra, Mausoom Sarkar, Balaji Krishnamurthy
The ability to efficiently search for images is essential for improving the user experiences across various products.
no code implementations • 24 Jun 2020 • Surgan Jandial, Ayush Chopra, Mausoom Sarkar, Piyush Gupta, Balaji Krishnamurthy, Vineeth Balasubramanian
Deep neural networks (DNNs) are powerful learning machines that have enabled breakthroughs in several domains.
no code implementations • 15 Jan 2020 • Pinkesh Badjatiya, Mausoom Sarkar, Abhishek Sinha, Siddharth Singh, Nikaash Puri, Jayakumar Subramanian, Balaji Krishnamurthy
We show how agents trained with SQLoss evolve cooperative behavior in several social dilemma matrix games.
no code implementations • ECCV 2020 • Mausoom Sarkar, Milan Aggarwal, Arneh Jain, Hiresh Gupta, Balaji Krishnamurthy
We introduce our new human-annotated forms dataset and show that our method significantly outperforms different segmentation baselines on this dataset in extracting hierarchical structures.
no code implementations • 25 Sep 2019 • Ayush Chopra, Surgan Jandial, Mausoom Sarkar, Balaji Krishnamurthy, Vineeth Balasubramanian
Deep neural networks are powerful learning machines that have enabled breakthroughs in several domains.
1 code implementation • 23 Apr 2018 • Akilesh B, Abhishek Sinha, Mausoom Sarkar, Balaji Krishnamurthy
We develop an attention mechanism for multi-modal fusion of visual and textual modalities that allows the agent to learn to complete the task and achieve language grounding.
no code implementations • ICLR 2018 • Abhishek Sinha, Akilesh B, Mausoom Sarkar, Balaji Krishnamurthy
In this work, we focus on the problem of grounding language by training an agent to follow a set of natural language instructions and navigate to a target object in a 2D grid environment.
no code implementations • 17 Apr 2017 • Abhishek Sinha, Mausoom Sarkar, Aahitagni Mukherjee, Balaji Krishnamurthy
In this paper, we explore the idea of learning weight evolution pattern from a simple network for accelerating training of novel neural networks.