no code implementations • 16 Sep 2020 • Sathyanarayanan N. Aakur, Sanjoy Kundu, Nikhil Gunti
Building upon the compositional representation offered by Grenander's Pattern Theory formalism, we show that attention and commonsense knowledge can be used to enable the self-supervised discovery of novel actions in egocentric videos in an open-world setting, where data from the observed environment (the target domain) is open i. e., the vocabulary is partially known and training examples (both labeled and unlabeled) are not available.