Search Results for author: Chancharik Mitra

Found 2 papers, 2 papers with code

Compositional Chain-of-Thought Prompting for Large Multimodal Models

1 code implementation27 Nov 2023 Chancharik Mitra, Brandon Huang, Trevor Darrell, Roei Herzig

The combination of strong visual backbones and Large Language Model (LLM) reasoning has led to Large Multimodal Models (LMMs) becoming the current standard for a wide range of vision and language (VL) tasks.

Language Modelling Large Language Model +1

Which One? Leveraging Context Between Objects and Multiple Views for Language Grounding

2 code implementations12 Nov 2023 Chancharik Mitra, Abrar Anwar, Rodolfo Corona, Dan Klein, Trevor Darrell, Jesse Thomason

When connecting objects and their language referents in an embodied 3D environment, it is important to note that: (1) an object can be better characterized by leveraging comparative information between itself and other objects, and (2) an object's appearance can vary with camera position.

Object Position

Cannot find the paper you are looking for? You can Submit a new open access paper.