no code implementations • 26 May 2023 • Yunhao Ge, Jie Ren, Jiaping Zhao, KaiFeng Chen, Andrew Gallagher, Laurent Itti, Balaji Lakshminarayanan
Despite considerable effort, the problem remains significantly challenging in deep learning models due to their propensity to output over-confident predictions for OOD inputs.
1 code implementation • CVPR 2023 • Yunhao Ge, Jie Ren, Andrew Gallagher, Yuxiao Wang, Ming-Hsuan Yang, Hartwig Adam, Laurent Itti, Balaji Lakshminarayanan, Jiaping Zhao
We also show that our method improves across ImageNet shifted datasets, four other datasets, and other model architectures such as LiT.
no code implementations • 3 Mar 2020 • Warren R. Morningstar, Sharad M. Vikram, Cusuh Ham, Andrew Gallagher, Joshua V. Dillon
Automatic Differentiation Variational Inference (ADVI) is a useful tool for efficiently learning probabilistic models in machine learning.
1 code implementation • 5 Jan 2019 • Joseph Roth, Sourish Chaudhuri, Ondrej Klejch, Radhika Marvin, Andrew Gallagher, Liat Kaver, Sharadh Ramaswamy, Arkadiusz Stopczynski, Cordelia Schmid, Zhonghua Xi, Caroline Pantofaru
The dataset contains temporally labeled face tracks in video, where each face instance is labeled as speaking or not, and whether the speech is audible.
Audio-Visual Active Speaker Detection speaker-diarization +2
1 code implementation • 30 Sep 2018 • Seong Joon Oh, Kevin Murphy, Jiyan Pan, Joseph Roth, Florian Schroff, Andrew Gallagher
Instance embeddings are an efficient and versatile image representation that facilitates applications like recognition, verification, retrieval, and clustering.
1 code implementation • 2 Aug 2018 • Sourish Chaudhuri, Joseph Roth, Daniel P. W. Ellis, Andrew Gallagher, Liat Kaver, Radhika Marvin, Caroline Pantofaru, Nathan Reale, Loretta Guarino Reid, Kevin Wilson, Zhonghua Xi
Speech activity detection (or endpointing) is an important processing step for applications such as speech recognition, language identification and speaker diarization.
Sound Audio and Speech Processing
no code implementations • 13 Jun 2018 • Amir Sadovnik, Wassim Gharbi, Thanh Vu, Andrew Gallagher
In this work we propose the new, subjective task of quantifying perceived face similarity between a pair of faces.
no code implementations • CVPR 2013 • Zhaoyin Jia, Andrew Gallagher, Ashutosh Saxena, Tsuhan Chen
Our algorithm incorporates the intuition that a good 3D representation of the scene is the one that fits the data well, and is a stable, self-supporting (i. e., one that does not topple) arrangement of objects.
no code implementations • CVPR 2013 • Amir Sadovnik, Andrew Gallagher, Tsuhan Chen
However, this is not a trivial task.
no code implementations • CVPR 2013 • Adarsh Kowdle, Andrew Gallagher, Tsuhan Chen
We cast the problem of depth-layer segmentation as a discrete labeling problem on a spatiotemporal Markov Random Field (MRF) that uses the motion occlusion cues along with monocular cues and a smooth motion prior for the moving object.
no code implementations • ECCV 2012 • Huizhong Chen, Andrew Gallagher, Bernd Girod
Describing clothing appearance with semantic attributes is an appealing technique for many important applications.