1 code implementation • 4 Feb 2022 • Andrea Burns, Deniz Arsan, Sanjna Agrawal, Ranjitha Kumar, Kate Saenko, Bryan A. Plummer
To study VLN with unknown command feasibility, we introduce a new dataset Mobile app Tasks with Iterative Feedback (MoTIF), where the goal is to complete a natural language command in a mobile app.
1 code implementation • ICCV 2021 • Samarth Mishra, Zhongping Zhang, Yuan Shen, Ranjitha Kumar, Venkatesh Saligrama, Bryan Plummer
This enables our model to identify that two images contain the same attribute, but can have it deemed irrelevant (e. g., due to fine-grained differences between them) and ignored for measuring similarity between the two images.
1 code implementation • 17 Apr 2021 • Andrea Burns, Deniz Arsan, Sanjna Agrawal, Ranjitha Kumar, Kate Saenko, Bryan A. Plummer
In recent years, vision-language research has shifted to study tasks which require more complex reasoning, such as interactive question answering, visual common sense reasoning, and question-answer plausibility prediction.
no code implementations • 18 Mar 2020 • Yuan Shen, Shanduojiao Jiang, Muhammad Rizky Wellyanto, Ranjitha Kumar
Finally, we trained a deep learning model that can explicitly predict and explain high level fashion concepts in a product image with its low level and domain specific fashion features.
2 code implementations • ECCV 2018 • Mariya I. Vasileva, Bryan A. Plummer, Krishna Dusad, Shreya Rajpal, Ranjitha Kumar, David Forsyth
Outfits in online fashion data are composed of items of many different types (e. g. top, bottom, shoes) that share some stylistic relationship with one another.
no code implementations • 17 Jun 2016 • Haizi Yu, Lav R. Varshney, Guy E. Garnett, Ranjitha Kumar
Throughout music history, theorists have identified and documented interpretable rules that capture the decisions of composers.