1 code implementation • 10 Dec 2023 • Mike Ranzinger, Greg Heinrich, Jan Kautz, Pavlo Molchanov
A handful of visual foundation models (VFMs) have recently emerged as the backbones for numerous downstream tasks.
2 code implementations • 9 Jun 2023 • Ali Hatamizadeh, Greg Heinrich, Hongxu Yin, Andrew Tao, Jose M. Alvarez, Jan Kautz, Pavlo Molchanov
At a high level, global self-attentions enable the efficient cross-window communication at lower costs.
8 code implementations • 20 Jun 2022 • Ali Hatamizadeh, Hongxu Yin, Greg Heinrich, Jan Kautz, Pavlo Molchanov
Pre-trained GC ViT backbones in downstream tasks of object detection, instance segmentation, and semantic segmentation using MS COCO and ADE20K datasets outperform prior work consistently.
Ranked #132 on Semantic Segmentation on ADE20K
no code implementations • 7 Feb 2019 • Greg Heinrich, Iuri Frosio
Training intelligent agents through reinforcement learning is a notoriously unstable procedure.