no code implementations • 29 Apr 2022 • Mahdi M. Kalayeh, Shervin Ardeshir, Lingyi Liu, Nagendra Kamath, Ashok Chandrashekar
The abundance and ease of utilizing sound, along with the fact that auditory clues reveal a plethora of information about what happens in a scene, make the audio-visual space an intuitive choice for representation learning.
no code implementations • 13 Apr 2022 • Shervin Ardeshir, Nagendra Kamath, Hossein Taghavi
Prominence and interactions: Character(s) in the thumbnail should be important character(s) in the video, to prevent the algorithm from suggesting non-representative frames as candidates.
no code implementations • NeurIPS 2021 • Mahdi M. Kalayeh, Nagendra Kamath, Lingyi Liu, Ashok Chandrashekar
The abundance and ease of utilizing sound, along with the fact that auditory clues reveal so much about what happens in the scene, make the audio-visual space a perfectly intuitive choice for self-supervised representation learning.
no code implementations • 25 Jul 2019 • Dong Liu, Rohit Puri, Nagendra Kamath, Subhabrata Bhattachary
In this work, we propose to model the image composition information as the mutual dependency of its local regions, and design a novel architecture to leverage such information to boost the performance of aesthetics assessment.