no code implementations • 13 Sep 2023 • Anith Selvakumar, Homa Fashandi
In this paper, we present a methodology for achieving robust multimodal person representations optimized for open-set audio-visual speaker verification.
no code implementations • CVPR 2021 • Shengdong Zhang, Ehsan Nezhadarya, Homa Fashandi, Jiayi Liu, Darin Graham, Mohak Shah
BN uses scaling and shifting to normalize activations of mini-batches to accelerate convergence and improve generalization.