no code implementations • ICCV 2023 • Yujin Jeong, Wonjeong Ryoo, SeungHyun Lee, Dabin Seo, Wonmin Byeon, Sangpil Kim, Jinkyu Kim
Hence, we propose The Power of Sound (TPoS) model to incorporate audio input that includes both changeable temporal semantics and magnitude.
1 code implementation • 10 Nov 2022 • Yujin Jeong, Seongbeom Park, Suhong Moon, Jinkyu Kim
Here, we propose a model that predicts visual commonsense immorality in a zero-shot manner.