2 code implementations • CVPR 2022 • Brady Zhou, Philipp Krähenbühl
The architecture consists of a convolutional image encoder for each view and cross-view transformer layers to infer a map-view semantic segmentation.
Ranked #8 on Bird's-Eye View Semantic Segmentation on nuScenes
1 code implementation • 27 Aug 2020 • Brady Zhou, Nimit Kalra, Philipp Krähenbühl
We use these recognition datasets to link up a source and target domain to transfer models between them in a task distillation framework.
9 code implementations • 27 Dec 2019 • Dian Chen, Brady Zhou, Vladlen Koltun, Philipp Krähenbühl
We first train an agent that has access to privileged information.
Ranked #16 on Autonomous Driving on CARLA Leaderboard
no code implementations • 30 May 2019 • Brady Zhou, Philipp Krähenbühl, Vladlen Koltun
Thus the central question of our work: Does computer vision matter for action?
no code implementations • ICLR 2019 • Brady Zhou, Philipp Krähenbühl
We experimentally show that any GAN objective, including Wasserstein GANs, benefit from adversarial robustness both quantitatively and qualitatively.