Search Results for author: Amir Zamir

Found 25 papers, 10 papers with code

BRAVE: Broadening the visual encoding of vision-language models

no code implementations10 Apr 2024 Oğuzhan Fatih Kar, Alessio Tonioni, Petra Poklukar, Achin Kulshrestha, Amir Zamir, Federico Tombari

Our results highlight the potential of incorporating different visual biases for a more broad and contextualized visual understanding of VLMs.

Hallucination Language Modelling +1

Controlled Training Data Generation with Diffusion Models

no code implementations22 Mar 2024 Teresa Yeo, Andrei Atanov, Harold Benoit, Aleksandr Alekseev, Ruchira Ray, Pooya Esmaeil Akhoondi, Amir Zamir

In this work, we present a method to control a text-to-image generative model to produce training data specifically "useful" for supervised learning.

Language Modelling

Unraveling the Key Components of OOD Generalization via Diversification

no code implementations26 Dec 2023 Harold Benoit, Liangze Jiang, Andrei Atanov, Oğuzhan Fatih Kar, Mattia Rigotti, Amir Zamir

We show that (1) diversification methods are highly sensitive to the distribution of the unlabeled data used for diversification and can underperform significantly when away from a method-specific sweet spot.

4M: Massively Multimodal Masked Modeling

no code implementations NeurIPS 2023 David Mizrahi, Roman Bachmann, Oğuzhan Fatih Kar, Teresa Yeo, Mingfei Gao, Afshin Dehghan, Amir Zamir

Current machine learning models for vision are often highly specialized and limited to a single modality and task.

An Information-Theoretic Approach to Transferability in Task Transfer Learning

no code implementations20 Dec 2022 Yajie Bao, Yang Li, Shao-Lun Huang, Lin Zhang, Lizhong Zheng, Amir Zamir, Leonidas Guibas

Task transfer learning is a popular technique in image processing applications that uses pre-trained models to reduce the supervision cost of related tasks.

Model Selection Transfer Learning

PALMER: Perception-Action Loop with Memory for Long-Horizon Planning

no code implementations8 Dec 2022 Onur Beker, Mohammad Mohammadi, Amir Zamir

For training these perceptual representations, we combine Q-learning with contrastive representation learning to create a latent space where the distance between the embeddings of two states captures how easily an optimal policy can traverse between them.

Q-Learning Representation Learning

Task Discovery: Finding the Tasks that Neural Networks Generalize on

no code implementations1 Dec 2022 Andrei Atanov, Andrei Filatov, Teresa Yeo, Ajay Sohmshetty, Amir Zamir

An intriguing question would be: what if, instead of fixing the task and searching in the model space, we fix the model and search in the task space?

MultiMAE: Multi-modal Multi-task Masked Autoencoders

1 code implementation4 Apr 2022 Roman Bachmann, David Mizrahi, Andrei Atanov, Amir Zamir

We show this pre-training strategy leads to a flexible, simple, and efficient framework with improved transfer results to downstream tasks.

Depth Estimation Image Classification +1

3D Common Corruptions and Data Augmentation

1 code implementation CVPR 2022 Oğuzhan Fatih Kar, Teresa Yeo, Andrei Atanov, Amir Zamir

We introduce a set of image transformations that can be used as corruptions to evaluate the robustness of models as well as data augmentation mechanisms for training neural networks.

Benchmarking Data Augmentation

Simple Control Baselines for Evaluating Transfer Learning

no code implementations7 Feb 2022 Andrei Atanov, Shijian Xu, Onur Beker, Andrei Filatov, Amir Zamir

Transfer learning has witnessed remarkable progress in recent years, for example, with the introduction of augmentation-based contrastive self-supervised learning methods.

Image Classification Self-Supervised Learning +1

Measuring the Effectiveness of Self-Supervised Learning using Calibrated Learning Curves

no code implementations29 Sep 2021 Andrei Atanov, Shijian Xu, Onur Beker, Andrey Filatov, Amir Zamir

Self-supervised learning has witnessed remarkable progress in recent years, in particular with the introduction of augmentation-based contrastive methods.

Image Classification Self-Supervised Learning +1

Robustness via Cross-Domain Ensembles

no code implementations ICCV 2021 Teresa Yeo, Oğuzhan Fatih Kar, Alexander Sax, Amir Zamir

We present a method for making neural network predictions robust to shifts from the training data distribution.

Robustness via Probabilistic Cross-Task Ensembles

no code implementations1 Jan 2021 Teresa Yeo, Oguzhan Fatih Kar, Amir Zamir

We present a method for making predictions using neural networks that, at the test time, is robust against shifts from the training data distribution.

Side-Tuning: A Baseline for Network Adaptation via Additive Side Networks

2 code implementations ECCV 2020 Jeffrey O. Zhang, Alexander Sax, Amir Zamir, Leonidas Guibas, Jitendra Malik

When training a neural network for a desired task, one may prefer to adapt a pre-trained network rather than starting from randomly initialized weights.

Imitation Learning Incremental Learning +3

Learning to Navigate Using Mid-Level Visual Priors

1 code implementation23 Dec 2019 Alexander Sax, Jeffrey O. Zhang, Bradley Emi, Amir Zamir, Silvio Savarese, Leonidas Guibas, Jitendra Malik

How much does having visual priors about the world (e. g. the fact that the world is 3D) assist in learning to perform downstream motor tasks (e. g. navigating a complex environment)?

Navigate reinforcement-learning +2

Gibson Env: Real-World Perception for Embodied Agents

5 code implementations CVPR 2018 Fei Xia, Amir Zamir, Zhi-Yang He, Alexander Sax, Jitendra Malik, Silvio Savarese

Developing visual perception models for active agents and sensorimotor control are cumbersome to be done in the physical world, as existing algorithms are too slow to efficiently learn in real-time and robots are fragile and costly.

Domain Adaptation General Reinforcement Learning +1

Semantic Cross-View Matching

no code implementations31 Oct 2015 Francesco Castaldo, Amir Zamir, Roland Angst, Francesco Palmieri, Silvio Savarese

In this paper, we therefore explore this idea and propose an automatic method for detecting and representing the semantic information of an RGB image with the goal of performing cross-view matching with a (non-RGB) geographic information system (GIS).

Unsupervised Semantic Parsing of Video Collections

no code implementations ICCV 2015 Ozan Sener, Amir Zamir, Silvio Savarese, Ashutosh Saxena

The proposed method is capable of providing a semantic "storyline" of the video composed of its objective steps.

Unsupervised semantic parsing

Cannot find the paper you are looking for? You can Submit a new open access paper.