Search Results for author: Manmohan Chandraker

Found 112 papers, 27 papers with code

Single-Shot Neural Relighting and SVBRDF Estimation

no code implementations • ECCV 2020 • Shen Sang, Manmohan Chandraker

We present a novel physically-motivated deep network for joint shape and material estimation, as well as relighting under novel illumination conditions, using a single image captured by a mobile phone camera.

Inverse Rendering SVBRDF Estimation

Paper
Add Code

Instantaneous Perception of Moving Objects in 3D

no code implementations • 5 May 2024 • Di Liu, Bingbing Zhuang, Dimitris N. Metaxas, Manmohan Chandraker

Specifically, due to the lack of correspondences between consecutive frames of sparse Lidar point clouds, static objects might appear to be moving - the so-called swimming effect.

Motion Estimation

Paper
Add Code

LidaRF: Delving into Lidar for Neural Radiance Field on Street Scenes

no code implementations • 1 May 2024 • Shanlin Sun, Bingbing Zhuang, Ziyu Jiang, Buyu Liu, Xiaohui Xie, Manmohan Chandraker

In this paper, we propose several insights that allow a better utilization of Lidar data to improve NeRF quality on street scenes.

Autonomous Driving Novel View Synthesis

Paper
Add Code

Efficient Transformer Encoders for Mask2Former-style models

no code implementations • 23 Apr 2024 • Manyi Yao, Abhishek Aich, Yumin Suh, Amit Roy-Chowdhury, Christian Shelton, Manmohan Chandraker

The third step is to use the aforementioned derived dataset to train a gating network that predicts the number of encoder layers to be used, conditioned on the input image.

Computational Efficiency Image Segmentation +3

Paper
Add Code

Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation

no code implementations • 23 Apr 2024 • Abhishek Aich, Yumin Suh, Samuel Schulter, Manmohan Chandraker

With efficiency being a high priority for scaling such models, we observed that the state-of-the-art method Mask2Former uses ~50% of its compute only on the transformer encoder.

Universal Segmentation

Paper
Add Code

Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement

no code implementations • 6 Apr 2024 • Zaid Khan, Vijay Kumar BG, Samuel Schulter, Yun Fu, Manmohan Chandraker

We propose a method where we exploit existing annotations for a vision-language task to improvise a coarse reward signal for that task, treat the LLM as a policy, and apply reinforced self-training to improve the visual program synthesis ability of the LLM for that task.

object-detection Object Detection +4

Paper
Add Code

AIDE: An Automatic Data Engine for Object Detection in Autonomous Driving

no code implementations • 26 Mar 2024 • Mingfu Liang, Jong-Chyi Su, Samuel Schulter, Sparsh Garg, Shiyu Zhao, Ying Wu, Manmohan Chandraker

This necessitates an expensive process of continuously curating and annotating data with significant human effort.

Autonomous Driving object-detection +1

Paper
Add Code

Tell, Don't Show!: Language Guidance Eases Transfer Across Domains in Images and Videos

no code implementations • 8 Mar 2024 • Tarun Kalluri, Bodhisattwa Prasad Majumder, Manmohan Chandraker

We introduce LaGTran, a novel framework that utilizes readily available or easily acquired text descriptions to guide robust transfer of discriminative knowledge from labeled source to unlabeled target data with domain shifts.

Paper
Add Code

TextureDreamer: Image-guided Texture Synthesis through Geometry-aware Diffusion

no code implementations • 17 Jan 2024 • Yu-Ying Yeh, Jia-Bin Huang, Changil Kim, Lei Xiao, Thu Nguyen-Phuoc, Numair Khan, Cheng Zhang, Manmohan Chandraker, Carl S Marshall, Zhao Dong, Zhengqin Li

In contrast, TextureDreamer can transfer highly detailed, intricate textures from real-world environments to arbitrary objects with only a few casually captured images, potentially significantly democratizing texture creation.

Texture Synthesis

Paper
Add Code

What You See is What You GAN: Rendering Every Pixel for High-Fidelity Geometry in 3D GANs

no code implementations • 4 Jan 2024 • Alex Trevithick, Matthew Chan, Towaki Takikawa, Umar Iqbal, Shalini De Mello, Manmohan Chandraker, Ravi Ramamoorthi, Koki Nagano

3D-aware Generative Adversarial Networks (GANs) have shown remarkable progress in learning to generate multi-view-consistent images and 3D geometries of scenes from collections of 2D images via neural volume rendering.

Neural Rendering Super-Resolution

Paper
Add Code

Controllable Safety-Critical Closed-loop Traffic Simulation via Guided Diffusion

no code implementations • 31 Dec 2023 • Wei-Jer Chang, Francesco Pittaluga, Masayoshi Tomizuka, Wei Zhan, Manmohan Chandraker

These findings affirm that guided diffusion models provide a robust and versatile foundation for safety-critical, interactive traffic simulation, extending their utility across the broader landscape of autonomous driving.

Autonomous Driving Denoising

Paper
Add Code

LLM-Assist: Enhancing Closed-Loop Planning with Language-Based Reasoning

no code implementations • 30 Dec 2023 • S P Sharan, Francesco Pittaluga, Vijay Kumar B G, Manmohan Chandraker

Although planning is a crucial component of the autonomous driving stack, researchers have yet to develop robust planning algorithms that are capable of safely handling the diverse range of possible driving scenarios.

Autonomous Driving Common Sense Reasoning

Paper
Add Code

Generating Enhanced Negatives for Training Language-Based Object Detectors

1 code implementation • 29 Dec 2023 • Shiyu Zhao, Long Zhao, Vijay Kumar B. G, Yumin Suh, Dimitris N. Metaxas, Manmohan Chandraker, Samuel Schulter

The recent progress in language-based open-vocabulary object detection can be largely attributed to finding better ways of leveraging large-scale data with free-form text annotations.

Object object-detection +1

Paper
Code

OpEnCam: Lensless Optical Encryption Camera

no code implementations • 2 Dec 2023 • Salman S. Khan, Xiang Yu, Kaushik Mitra, Manmohan Chandraker, Francesco Pittaluga

OpEnCam encrypts the incoming light before capturing it using the modulating ability of optical masks.

Paper
Add Code

Domain Generalization Guided by Gradient Signal to Noise Ratio of Parameters

no code implementations • ICCV 2023 • Mateusz Michalkiewicz, Masoud Faraki, Xiang Yu, Manmohan Chandraker, Mahsa Baktashmotlagh

Overfitting to the source domain is a common issue in gradient-based training of deep neural networks.

Domain Generalization Face Anti-Spoofing +1

Paper
Add Code

Efficient Controllable Multi-Task Architectures

no code implementations • ICCV 2023 • Abhishek Aich, Samuel Schulter, Amit K. Roy-Chowdhury, Manmohan Chandraker, Yumin Suh

Further, we present a simple but effective search algorithm that translates user constraints to runtime width configurations of both the shared encoder and task decoders, for sampling the sub-architectures.

Decoder Knowledge Distillation

Paper
Add Code

A Theory of Topological Derivatives for Inverse Rendering of Geometry

no code implementations • ICCV 2023 • Ishit Mehta, Manmohan Chandraker, Ravi Ramamoorthi

We introduce a theoretical framework for differentiable surface evolution that allows discrete topology changes through the use of topological derivatives for variational optimization of image functionals.

3D Reconstruction Image Reconstruction +3

Paper
Add Code

Taming Self-Training for Open-Vocabulary Object Detection

2 code implementations • 11 Aug 2023 • Shiyu Zhao, Samuel Schulter, Long Zhao, Zhixing Zhang, Vijay Kumar B. G, Yumin Suh, Manmohan Chandraker, Dimitris N. Metaxas

This work identifies two challenges of using self-training in OVD: noisy PLs from VLMs and frequent distribution changes of PLs.

Object object-detection +1

Paper
Code

Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!

1 code implementation • CVPR 2023 • Zaid Khan, Vijay Kumar BG, Samuel Schulter, Xiang Yu, Yun Fu, Manmohan Chandraker

We introduce SelTDA (Self-Taught Data Augmentation), a strategy for finetuning large VLMs on small-scale VQA datasets.

counterfactual Data Augmentation +5

Paper
Code

NeurOCS: Neural NOCS Supervision for Monocular 3D Object Localization

no code implementations • CVPR 2023 • Zhixiang Min, Bingbing Zhuang, Samuel Schulter, Buyu Liu, Enrique Dunn, Manmohan Chandraker

Monocular 3D object localization in driving scenes is a crucial task, but challenging due to its ill-posed nature.

Monocular 3D Object Localization Object

Paper
Add Code

Tuned Contrastive Learning

no code implementations • 18 May 2023 • Chaitanya Animesh, Manmohan Chandraker

A recent state-of-the-art, supervised contrastive (SupCon) loss, extends self-supervised contrastive learning to supervised setting by generalizing to multiple positives and negatives in a batch and improves upon the cross-entropy loss.

Contrastive Learning Representation Learning +1

Paper
Add Code

Spatiotemporally Consistent HDR Indoor Lighting Estimation

no code implementations • 7 May 2023 • Zhengqin Li, Li Yu, Mikhail Okunev, Manmohan Chandraker, Zhao Dong

For training, we significantly enhance the OpenRooms public dataset of photorealistic synthetic indoor scenes with around 360K HDR environment maps of much higher resolution and 38K video sequences, rendered with GPU-based path tracing.

Decoder Lighting Estimation

Paper
Add Code

Real-Time Radiance Fields for Single-Image Portrait View Synthesis

no code implementations • 3 May 2023 • Alex Trevithick, Matthew Chan, Michael Stengel, Eric R. Chan, Chao Liu, Zhiding Yu, Sameh Khamis, Manmohan Chandraker, Ravi Ramamoorthi, Koki Nagano

We present a one-shot method to infer and render a photorealistic 3D representation from a single unposed image (e. g., face portrait) in real-time.

Data Augmentation Novel View Synthesis

Paper
Add Code

Factorized Inverse Path Tracing for Efficient and Accurate Material-Lighting Estimation

1 code implementation • ICCV 2023 • Liwen Wu, Rui Zhu, Mustafa B. Yaldiz, Yinhao Zhu, Hong Cai, Janarbek Matai, Fatih Porikli, Tzu-Mao Li, Manmohan Chandraker, Ravi Ramamoorthi

Inverse path tracing has recently been applied to joint material and lighting estimation, given geometry and multi-view HDR observations of an indoor scene.

Inverse Rendering Lighting Estimation

Paper
Code

GeoNet: Benchmarking Unsupervised Adaptation across Geographies

no code implementations • CVPR 2023 • Tarun Kalluri, Wangdong Xu, Manmohan Chandraker

In recent years, several efforts have been aimed at improving the robustness of vision models to domains and environments unseen during training.

Benchmarking Image Classification +2

Paper
Add Code

Open-world Instance Segmentation: Top-down Learning with Bottom-up Supervision

no code implementations • 9 Mar 2023 • Tarun Kalluri, Weiyao Wang, Heng Wang, Manmohan Chandraker, Lorenzo Torresani, Du Tran

Many top-down architectures for instance segmentation achieve significant success when trained and tested on pre-defined closed-world taxonomy.

Open-World Instance Segmentation Segmentation +1

Paper
Add Code

Learning Phase Mask for Privacy-Preserving Passive Depth Estimation

no code implementations • European Conference on Computer Vision (ECCV) 2022 • Zaid Tasneem, Giovanni Milione, Yi-Hsuan Tsai, Xiang Yu, Ashok Veeraraghavan, Manmohan Chandraker, Francesco Pittaluga

With over a billion sold each year, cameras are not only becoming ubiquitous and omnipresent, but are driving progress in a wide range of applications such as augmented/virtual reality, robotics, surveillance, security, autonomous navigation and many others.

Autonomous Navigation Depth Estimation +2

Paper
Add Code

Long-HOT: A Modular Hierarchical Approach for Long-Horizon Object Transport

no code implementations • 28 Oct 2022 • Sriram Narayanan, Dinesh Jayaraman, Manmohan Chandraker

We address key challenges in long-horizon embodied exploration and navigation by proposing a new object transport task and a novel modular framework for temporally extended navigation.

Motion Planning

Paper
Add Code

IDD-3D: Indian Driving Dataset for 3D Unstructured Road Scenes

no code implementations • 23 Oct 2022 • Shubham Dokania, A. H. Abdul Hafez, Anbumani Subramanian, Manmohan Chandraker, C. V. Jawahar

Autonomous driving and assistance systems rely on annotated data from traffic and road scenarios to model and learn the various object relations in complex real-world scenarios.

3D Object Detection Autonomous Driving +2

Paper
Add Code

TRoVE: Transforming Road Scene Datasets into Photorealistic Virtual Environments

1 code implementation • 16 Aug 2022 • Shubham Dokania, Anbumani Subramanian, Manmohan Chandraker, C. V. Jawahar

We show that using annotations and visual cues from existing datasets, we can facilitate automated multi-modal data generation, mimicking real scene properties with high-fidelity, along with mechanisms to diversify samples in a physically meaningful way.

Semantic Segmentation Synthetic Data Generation

Paper
Code

Cluster-to-adapt: Few Shot Domain Adaptation for Semantic Segmentation across Disjoint Labels

no code implementations • 4 Aug 2022 • Tarun Kalluri, Manmohan Chandraker

Domain adaptation for semantic segmentation across datasets consisting of the same categories has seen several recent successes.

Clustering Domain Adaptation +2

Paper
Add Code

ALBench: A Framework for Evaluating Active Learning in Object Detection

1 code implementation • 27 Jul 2022 • Zhanpeng Feng, Shiliang Zhang, Rinyoichi Takezoe, Wenze Hu, Manmohan Chandraker, Li-Jia Li, Vijay K. Narayanan, Xiaoyu Wang

To facilitate the research in this field, this paper contributes an active learning benchmark framework named as ALBench for evaluating active learning in object detection.

Active Learning Image Classification +4

559

Paper
Code

MemSAC: Memory Augmented Sample Consistency for Large Scale Unsupervised Domain Adaptation

1 code implementation • 25 Jul 2022 • Tarun Kalluri, Astuti Sharma, Manmohan Chandraker

Practical real world datasets with plentiful categories introduce new challenges for unsupervised domain adaptation like small inter-class discriminability, that existing approaches relying on domain invariance alone cannot handle sufficiently well.

Fine-Grained Visual Recognition Unsupervised Domain Adaptation

Paper
Code

Exploiting Unlabeled Data with Vision and Language Models for Object Detection

1 code implementation • 18 Jul 2022 • Shiyu Zhao, Zhixing Zhang, Samuel Schulter, Long Zhao, Vijay Kumar B. G, Anastasis Stathopoulos, Manmohan Chandraker, Dimitris Metaxas

We propose a novel method that leverages the rich semantics available in recent vision and language models to localize and classify objects in unlabeled images, effectively generating pseudo labels for object detection.

Ranked #15 on Open Vocabulary Object Detection on MSCOCO (using extra training data)

Object object-detection +3

Paper
Code

PhotoScene: Photorealistic Material and Lighting Transfer for Indoor Scenes

1 code implementation • CVPR 2022 • Yu-Ying Yeh, Zhengqin Li, Yannick Hold-Geoffroy, Rui Zhu, Zexiang Xu, Miloš Hašan, Kalyan Sunkavalli, Manmohan Chandraker

Most indoor 3D scene reconstruction methods focus on recovering 3D geometry and scene layout.

3D Scene Reconstruction

Paper
Code

IRISformer: Dense Vision Transformers for Single-Image Inverse Rendering in Indoor Scenes

no code implementations • CVPR 2022 • Rui Zhu, Zhengqin Li, Janarbek Matai, Fatih Porikli, Manmohan Chandraker

Indoor scenes exhibit significant appearance variations due to myriad interactions between arbitrarily diverse object shapes, spatially-changing materials, and complex lighting.

Inverse Rendering

Paper
Add Code

Physically-Based Editing of Indoor Scene Lighting from a Single Image

no code implementations • 19 May 2022 • Zhengqin Li, Jia Shi, Sai Bi, Rui Zhu, Kalyan Sunkavalli, Miloš Hašan, Zexiang Xu, Ravi Ramamoorthi, Manmohan Chandraker

We tackle this problem using two novel components: 1) a holistic scene reconstruction method that estimates scene reflectance and parametric 3D lighting, and 2) a neural rendering framework that re-renders the scene from our predictions.

Inverse Rendering Lighting Estimation +1

Paper
Add Code

A Level Set Theory for Neural Implicit Evolution under Explicit Flows

no code implementations • 14 Apr 2022 • Ishit Mehta, Manmohan Chandraker, Ravi Ramamoorthi

Our method uses the flow field to deform parametric implicit surfaces by extending the classical theory of level sets.

Inverse Rendering

Paper
Add Code

Controllable Dynamic Multi-Task Architectures

no code implementations • CVPR 2022 • Dripta S. Raychaudhuri, Yumin Suh, Samuel Schulter, Xiang Yu, Masoud Faraki, Amit K. Roy-Chowdhury, Manmohan Chandraker

In contrast to the existing dynamic multi-task approaches that adjust only the weights within a fixed architecture, our approach affords the flexibility to dynamically control the total computational cost and match the user-preferred task importance better.

Multi-Task Learning

Paper
Add Code

Single-Stream Multi-Level Alignment for Vision-Language Pretraining

1 code implementation • 27 Mar 2022 • Zaid Khan, Vijay Kumar BG, Xiang Yu, Samuel Schulter, Manmohan Chandraker, Yun Fu

Self-supervised vision-language pretraining from pure images and text with a contrastive loss is effective, but ignores fine-grained alignment due to a dual-stream architecture that aligns image and text representations only on a global level.

Question Answering Referring Expression +4

Paper
Code

On Generalizing Beyond Domains in Cross-Domain Continual Learning

no code implementations • CVPR 2022 • Christian Simon, Masoud Faraki, Yi-Hsuan Tsai, Xiang Yu, Samuel Schulter, Yumin Suh, Mehrtash Harandi, Manmohan Chandraker

Humans have the ability to accumulate knowledge of new tasks in varying conditions, but deep neural networks often suffer from catastrophic forgetting of previously learned knowledge after learning a new task.

Continual Learning Knowledge Distillation

Paper
Add Code

Learning Semantic Segmentation from Multiple Datasets with Label Shifts

no code implementations • 28 Feb 2022 • Dongwan Kim, Yi-Hsuan Tsai, Yumin Suh, Masoud Faraki, Sparsh Garg, Manmohan Chandraker, Bohyung Han

First, a gradient conflict in training due to mismatched label spaces is identified and a class-independent binary cross-entropy loss is proposed to alleviate such label conflicts.

Semantic Segmentation

Paper
Add Code

YMIR: A Rapid Data-centric Development Platform for Vision Applications

1 code implementation • 19 Nov 2021 • Phoenix X. Huang, Wenze Hu, William Brendel, Manmohan Chandraker, Li-Jia Li, Xiaoyu Wang

This paper introduces an open source platform to support the rapid development of computer vision applications at scale.

Active Learning

559

Paper
Code

Learning to Learn across Diverse Data Biases in Deep Face Recognition

no code implementations • CVPR 2022 • Chang Liu, Xiang Yu, Yi-Hsuan Tsai, Ramin Moslemi, Masoud Faraki, Manmohan Chandraker, Yun Fu

Convolutional Neural Networks have achieved remarkable success in face recognition, in part due to the abundant availability of data.

Face Recognition Meta-Learning

Paper
Add Code

Learning Cross-modal Contrastive Features for Video Domain Adaptation

no code implementations • ICCV 2021 • Donghyun Kim, Yi-Hsuan Tsai, Bingbing Zhuang, Xiang Yu, Stan Sclaroff, Kate Saenko, Manmohan Chandraker

Learning transferable and domain adaptive feature representations from videos is important for video-relevant tasks such as action recognition.

Action Recognition Contrastive Learning +2

Paper
Add Code

OpenRooms: An Open Framework for Photorealistic Indoor Scene Datasets

no code implementations • CVPR 2021 • Zhengqin Li, Ting-Wei Yu, Shen Sang, Sarah Wang, Meng Song, YuHan Liu, Yu-Ying Yeh, Rui Zhu, Nitesh Gundavarapu, Jia Shi, Sai Bi, Hong-Xing Yu, Zexiang Xu, Kalyan Sunkavalli, Milos Hasan, Ravi Ramamoorthi, Manmohan Chandraker

Finally, we demonstrate that our framework may also be integrated with physics engines, to create virtual robotics environments with unique ground truth such as friction coefficients and correspondence to real scenes.

Friction Inverse Rendering +1

Paper
Add Code

Divide-and-Conquer for Lane-Aware Diverse Trajectory Prediction

no code implementations • CVPR 2021 • Sriram Narayanan, Ramin Moslemi, Francesco Pittaluga, Buyu Liu, Manmohan Chandraker

Our second contribution is a novel trajectory prediction framework called ALAN that uses existing lane centerlines as anchors to provide trajectories constrained to the input lanes.

Autonomous Vehicles Trajectory Prediction

Paper
Add Code

Fusing the Old with the New: Learning Relative Camera Pose with Geometry-Guided Uncertainty

no code implementations • CVPR 2021 • Bingbing Zhuang, Manmohan Chandraker

While we focus on relative pose, we envision that our pipeline is broadly applicable for fusing classical geometry and deep learning.

Pose Estimation

Paper
Add Code

Weakly But Deeply Supervised Occlusion-Reasoned Parametric Road Layouts

no code implementations • CVPR 2022 • Buyu Liu, Bingbing Zhuang, Manmohan Chandraker

We propose an end-to-end network that takes a single perspective RGB image of a complex road scene as input, to produce occlusion-reasoned layouts in perspective space as well as a parametric bird's-eye-view (BEV) space.

Paper
Add Code

Modulated Periodic Activations for Generalizable Local Functional Representations

2 code implementations • ICCV 2021 • Ishit Mehta, Michaël Gharbi, Connelly Barnes, Eli Shechtman, Ravi Ramamoorthi, Manmohan Chandraker

Our approach produces generalizable functional representations of images, videos and shapes, and achieves higher reconstruction quality than prior works that are optimized for a single signal.

452

Paper
Code

Instance Level Affinity-Based Transfer for Unsupervised Domain Adaptation

1 code implementation • CVPR 2021 • Astuti Sharma, Tarun Kalluri, Manmohan Chandraker

Domain adaptation deals with training models using large scale labeled data from a specific source domain and then adapting the knowledge to certain target domains that have few or no labels.

Clustering Unsupervised Domain Adaptation

Paper
Code

Cross-Domain Similarity Learning for Face Recognition in Unseen Domains

no code implementations • CVPR 2021 • Masoud Faraki, Xiang Yu, Yi-Hsuan Tsai, Yumin Suh, Manmohan Chandraker

Intuitively, it discriminatively correlates explicit metrics derived from one domain, with triplet samples from another domain in a unified loss function to be minimized within a network, which leads to better alignment of the training domains.

Face Recognition Metric Learning

Paper
Add Code

FLAVR: Flow-Agnostic Video Representations for Fast Frame Interpolation

1 code implementation • 15 Dec 2020 • Tarun Kalluri, Deepak Pathak, Manmohan Chandraker, Du Tran

A majority of methods for video frame interpolation compute bidirectional optical flow between adjacent frames of a video, followed by a suitable warping algorithm to generate the output frames.

Ranked #2 on Video Frame Interpolation on GoPro

Action Recognition Motion Magnification +2

411

Paper
Code

Uncertainty-Aware Physically-Guided Proxy Tasks for Unseen Domain Face Anti-spoofing

no code implementations • 28 Nov 2020 • Junru Wu, Xiang Yu, Buyu Liu, Zhangyang Wang, Manmohan Chandraker

Face anti-spoofing (FAS) seeks to discriminate genuine faces from fake ones arising from any type of spoofing attack.

Attribute Domain Generalization +1

Paper
Add Code

Voting-based Approaches For Differentially Private Federated Learning

no code implementations • 9 Oct 2020 • Yuqing Zhu, Xiang Yu, Yi-Hsuan Tsai, Francesco Pittaluga, Masoud Faraki, Manmohan Chandraker, Yu-Xiang Wang

Differentially Private Federated Learning (DPFL) is an emerging field with many applications.

Federated Learning Transfer Learning

Paper
Add Code

Object Detection with a Unified Label Space from Multiple Datasets

no code implementations • ECCV 2020 • Xiangyun Zhao, Samuel Schulter, Gaurav Sharma, Yi-Hsuan Tsai, Manmohan Chandraker, Ying Wu

To address this challenge, we design a framework which works with such partial annotations, and we exploit a pseudo labeling approach that we adapt for our specific case.

Object object-detection +1

Paper
Add Code

Domain Adaptive Semantic Segmentation Using Weak Labels

no code implementations • ECCV 2020 • Sujoy Paul, Yi-Hsuan Tsai, Samuel Schulter, Amit K. Roy-Chowdhury, Manmohan Chandraker

In this work, we propose a novel framework for domain adaptation in semantic segmentation with image-level weak labels in the target domain.

Segmentation Semantic Segmentation +1

Paper
Add Code

Deep Keypoint-Based Camera Pose Estimation with Geometric Constraints

1 code implementation • 29 Jul 2020 • You-Yi Jau, Rui Zhu, Hao Su, Manmohan Chandraker

Estimating relative camera poses from consecutive frames is a fundamental problem in visual odometry (VO) and simultaneous localization and mapping (SLAM), where classic methods consisting of hand-crafted features and sampling-based outlier rejection have been a dominant choice for over a decade.

Pose Estimation Simultaneous Localization and Mapping +1

152

Paper
Code

SMART: Simultaneous Multi-Agent Recurrent Trajectory Prediction

no code implementations • ECCV 2020 • Sriram N. N, Buyu Liu, Francesco Pittaluga, Manmohan Chandraker

Our second contribution is a novel method that generates diverse predictions while accounting for scene semantics and multi-agent interactions, with constant-time inference independent of the number of agents.

Motion Forecasting Trajectory Forecasting

Paper
Add Code

OpenRooms: An End-to-End Open Framework for Photorealistic Indoor Scene Datasets

no code implementations • 25 Jul 2020 • Zhengqin Li, Ting-Wei Yu, Shen Sang, Sarah Wang, Meng Song, YuHan Liu, Yu-Ying Yeh, Rui Zhu, Nitesh Gundavarapu, Jia Shi, Sai Bi, Zexiang Xu, Hong-Xing Yu, Kalyan Sunkavalli, Miloš Hašan, Ravi Ramamoorthi, Manmohan Chandraker

Friction Inverse Rendering +2

Paper
Add Code

Neural Mesh Flow: 3D Manifold Mesh Generation via Diffeomorphic Flows

1 code implementation • NeurIPS 2020 • Kunal Gupta, Manmohan Chandraker

Applications like rendering, simulations and 3D printing require meshes to be manifold so that they can interact with the world like the real objects they represent.

Paper
Code

Learning Monocular Visual Odometry via Self-Supervised Long-Term Modeling

no code implementations • ECCV 2020 • Yuliang Zou, Pan Ji, Quoc-Huy Tran, Jia-Bin Huang, Manmohan Chandraker

Monocular visual odometry (VO) suffers severely from error accumulation during frame-to-frame pose estimation.

Monocular Visual Odometry Pose Estimation +2

Paper
Add Code

Single View Metrology in the Wild

1 code implementation • ECCV 2020 • Rui Zhu, Xingyi Yang, Yannick Hold-Geoffroy, Federico Perazzi, Jonathan Eisenmann, Kalyan Sunkavalli, Manmohan Chandraker

Most 3D reconstruction methods may only recover scene properties up to a global scale ambiguity.

3D Reconstruction

Paper
Code

Improving Face Recognition by Clustering Unlabeled Faces in the Wild

no code implementations • ECCV 2020 • Aruni RoyChowdhury, Xiang Yu, Kihyuk Sohn, Erik Learned-Miller, Manmohan Chandraker

While deep face recognition has benefited significantly from large-scale labeled data, current research is focused on leveraging unlabeled data to further boost performance, reducing the cost of human annotation.

Clustering Face Clustering +3

Paper
Add Code

Understanding Road Layout from Videos as a Whole

no code implementations • CVPR 2020 • Buyu Liu, Bingbing Zhuang, Samuel Schulter, Pan Ji, Manmohan Chandraker

(2) Introducing the LSTM and FTM modules improves the prediction consistency in videos.

Paper
Add Code

Pseudo RGB-D for Self-Improving Monocular SLAM and Depth Prediction

1 code implementation • ECCV 2020 • Lokender Tiwari, Pan Ji, Quoc-Huy Tran, Bingbing Zhuang, Saket Anand, Manmohan Chandraker

Classical monocular Simultaneous Localization And Mapping (SLAM) and the recently emerging convolutional neural networks (CNNs) for monocular depth prediction represent two largely disjoint approaches towards building a 3D map of the surrounding environment.

Depth Estimation Depth Prediction +1

Paper
Code

Through the Looking Glass: Neural 3D Reconstruction of Transparent Shapes

1 code implementation • CVPR 2020 • Zhengqin Li, Yu-Ying Yeh, Manmohan Chandraker

Recovering the 3D shape of transparent objects using a small number of unconstrained natural images is an ill-posed problem.

3D Point Cloud Reconstruction Point cloud reconstruction +1

112

Paper
Code

Towards Universal Representation Learning for Deep Face Recognition

no code implementations • CVPR 2020 • Yichun Shi, Xiang Yu, Kihyuk Sohn, Manmohan Chandraker, Anil K. Jain

Recognizing wild faces is extremely hard as they appear with all kinds of variations.

Face Recognition Representation Learning

Paper
Add Code

DAVID: Dual-Attentional Video Deblurring

no code implementations • 7 Dec 2019 • Junru Wu, Xiang Yu, Ding Liu, Manmohan Chandraker, Zhangyang Wang

To train and evaluate on more diverse blur severity levels, we propose a Challenging DVD dataset generated from the raw DVD video set by pooling frames with different temporal windows.

Deblurring

Paper
Add Code

Adversarial Learning of Privacy-Preserving and Task-Oriented Representations

no code implementations • 22 Nov 2019 • Taihong Xiao, Yi-Hsuan Tsai, Kihyuk Sohn, Manmohan Chandraker, Ming-Hsuan Yang

For instance, there could be a potential privacy risk of machine learning systems via the model inversion attack, whose goal is to reconstruct the input data from the latent representation of deep networks.

Attribute BIG-bench Machine Learning +2

Paper
Add Code

Degeneracy in Self-Calibration Revisited and a Deep Learning Solution for Uncalibrated SLAM

no code implementations • 30 Jul 2019 • Bingbing Zhuang, Quoc-Huy Tran, Pan Ji, Gim Hee Lee, Loong Fah Cheong, Manmohan Chandraker

Self-calibration of camera intrinsics and radial distortion has a long history of research in the computer vision community.

Simultaneous Localization and Mapping

Paper
Add Code

Pose-variant 3D Facial Attribute Generation

no code implementations • 24 Jul 2019 • Feng-Ju Chang, Xiang Yu, Ram Nevatia, Manmohan Chandraker

We address the challenging problem of generating facial attributes using a single image in an unconstrained pose.

3D Reconstruction Attribute +1

Paper
Add Code

Adaptation Across Extreme Variations using Unlabeled Domain Bridges

no code implementations • 5 Jun 2019 • Shuyang Dai, Kihyuk Sohn, Yi-Hsuan Tsai, Lawrence Carin, Manmohan Chandraker

We tackle an unsupervised domain adaptation problem for which the domain discrepancy between labeled source and unlabeled target domains is large, due to many factors of inter and intra-domain variation.

Object Recognition Semantic Segmentation +1

Paper
Add Code

Inverse Rendering for Complex Indoor Scenes: Shape, Spatially-Varying Lighting and SVBRDF from a Single Image

1 code implementation • CVPR 2020 • Zhengqin Li, Mohammad Shafiei, Ravi Ramamoorthi, Kalyan Sunkavalli, Manmohan Chandraker

Our inverse rendering network incorporates physical insights -- including a spatially-varying spherical Gaussian lighting representation, a differentiable rendering layer to model scene appearance, a cascade structure to iteratively refine the predictions and a bilateral solver for refinement -- allowing us to jointly reason about shape, lighting, and reflectance.

Inverse Rendering

280

Paper
Code

Domain Adaptation for Structured Output via Disentangled Patch Representations

no code implementations • ICLR 2019 • Yi-Hsuan Tsai, Kihyuk Sohn, Samuel Schulter, Manmohan Chandraker

To this end, we propose to learn discriminative feature representations of patches based on label histograms in the source domain, through the construction of a disentangled space.

Domain Adaptation Semantic Segmentation

Paper
Add Code

Unsupervised Domain Adaptation for Distance Metric Learning

no code implementations • ICLR 2019 • Kihyuk Sohn, Wenling Shang, Xiang Yu, Manmohan Chandraker

Unsupervised domain adaptation is a promising avenue to enhance the performance of deep neural networks on a target domain, using labels only from a source domain.

Face Recognition Metric Learning +1

Paper
Add Code

Active Adversarial Domain Adaptation

no code implementations • 16 Apr 2019 • Jong-Chyi Su, Yi-Hsuan Tsai, Kihyuk Sohn, Buyu Liu, Subhransu Maji, Manmohan Chandraker

Our approach, active adversarial domain adaptation (AADA), explores a duality between two related problems: adversarial domain alignment and importance sampling for adapting models across domains.

Active Learning Domain Adaptation +3

Paper
Add Code

Domain Adaptation for Structured Output via Discriminative Patch Representations

8 code implementations • ICCV 2019 • Yi-Hsuan Tsai, Kihyuk Sohn, Samuel Schulter, Manmohan Chandraker

Predicting structured outputs such as semantic segmentation relies on expensive per-pixel annotations to learn supervised models like convolutional neural networks.

Ranked #22 on Image-to-Image Translation on SYNTHIA-to-Cityscapes

Domain Adaptation Segmentation +2

839

Paper
Code

A Parametric Top-View Representation of Complex Road Scenes

no code implementations • CVPR 2019 • Ziyan Wang, Buyu Liu, Samuel Schulter, Manmohan Chandraker

In this paper, we address the problem of inferring the layout of complex road scenes given a single camera as input.

Decision Making Domain Adaptation

Paper
Add Code

IDD: A Dataset for Exploring Problems of Autonomous Navigation in Unconstrained Environments

2 code implementations • 26 Nov 2018 • Girish Varma, Anbumani Subramanian, Anoop Namboodiri, Manmohan Chandraker, C. V. Jawahar

It also reflects label distributions of road scenes significantly different from existing datasets, with most classes displaying greater within-class diversity.

Autonomous Navigation Domain Adaptation +3

Paper
Code

Universal Semi-Supervised Semantic Segmentation

1 code implementation • ICCV 2019 • Tarun Kalluri, Girish Varma, Manmohan Chandraker, C. V. Jawahar

In recent years, the need for semantic segmentation has arisen across several different applications and environments.

Ranked #27 on Semantic Segmentation on DensePASS (using extra training data)

Segmentation Semi-Supervised Semantic Segmentation +1

Paper
Code

Learning To Simulate

no code implementations • ICLR 2019 • Nataniel Ruiz, Samuel Schulter, Manmohan Chandraker

Simulation is a useful tool in situations where training data for machine learning models is costly to annotate or even hard to acquire.

Paper
Add Code

Materials for Masses: SVBRDF Acquisition with a Single Mobile Phone Image

no code implementations • ECCV 2018 • Zhengqin Li, Kalyan Sunkavalli, Manmohan Chandraker

We propose a material acquisition approach to recover the spatially-varying BRDF and normal map of a near-planar surface from a single image captured by a handheld mobile phone camera.

Paper
Add Code

Memory Warps for Learning Long-Term Online Video Representations

no code implementations • 28 Mar 2018 • Tuan-Hung Vu, Wongun Choi, Samuel Schulter, Manmohan Chandraker

This paper proposes a novel memory-based online video representation that is efficient, accurate and predictive.

object-detection Object Detection

Paper
Add Code

Learning to Look around Objects for Top-View Representations of Outdoor Scenes

no code implementations • ECCV 2018 • Samuel Schulter, Menghua Zhai, Nathan Jacobs, Manmohan Chandraker

Given a single RGB image of a complex outdoor road scene in the perspective view, we address the novel problem of estimating an occlusion-reasoned semantic scene layout in the top-view.

Semantic Segmentation

Paper
Add Code

Feature Transfer Learning for Deep Face Recognition with Under-Represented Data

no code implementations • 23 Mar 2018 • Xi Yin, Xiang Yu, Kihyuk Sohn, Xiaoming Liu, Manmohan Chandraker

In this paper, we propose a center-based feature transfer framework to augment the feature space of under-represented subjects from the regular subjects that have sufficiently diverse samples.

Disentanglement Face Recognition +1

Paper
Add Code

Hierarchical Metric Learning and Matching for 2D and 3D Geometric Correspondences

no code implementations • ECCV 2018 • Mohammed E. Fathy, Quoc-Huy Tran, M. Zeeshan Zia, Paul Vernaza, Manmohan Chandraker

Further, we propose to use activation maps at different layers of a CNN, as an effective and principled replacement for the multi-resolution image pyramids often used for matching tasks.

Geometric Matching Metric Learning +1

Paper
Add Code

Gotta Adapt 'Em All: Joint Pixel and Feature-Level Domain Adaptation for Recognition in the Wild

1 code implementation • CVPR 2019 • Luan Tran, Kihyuk Sohn, Xiang Yu, Xiaoming Liu, Manmohan Chandraker

Recent developments in deep domain adaptation have allowed knowledge transfer from a labeled source domain to an unlabeled target domain at the level of intermediate features or input pixels.

Attribute Domain Adaptation +2

Paper
Code

Learning to Adapt Structured Output Space for Semantic Segmentation

12 code implementations • CVPR 2018 • Yi-Hsuan Tsai, Wei-Chih Hung, Samuel Schulter, Kihyuk Sohn, Ming-Hsuan Yang, Manmohan Chandraker

In this paper, we propose an adversarial learning method for domain adaptation in the context of semantic segmentation.

Ranked #3 on Domain Adaptation on Synscapes-to-Cityscapes

Domain Adaptation Segmentation +2

839

Paper
Code

Learning random-walk label propagation for weakly-supervised semantic segmentation

no code implementations • CVPR 2017 • Paul Vernaza, Manmohan Chandraker

Large-scale training for semantic segmentation is challenging due to the expense of obtaining training data for this task relative to other vision tasks.

Segmentation Weakly supervised Semantic Segmentation +1

Paper
Add Code

Deep Supervision with Intermediate Concepts

no code implementations • 8 Jan 2018 • Chi Li, M. Zeeshan Zia, Quoc-Huy Tran, Xiang Yu, Gregory D. Hager, Manmohan Chandraker

In this work, we explore an approach for injecting prior domain structure into neural network training by supervising hidden layers of a CNN with intermediate concepts that normally are not observed in practice.

Image Classification

Paper
Add Code

Learning Efficient Object Detection Models with Knowledge Distillation

no code implementations • NeurIPS 2017 • Guobin Chen, Wongun Choi, Xiang Yu, Tony Han, Manmohan Chandraker

In this work, we propose a new framework to learn compact and fast ob- ject detection networks with improved accuracy using knowledge distillation [20] and hint learning [34].

Knowledge Distillation Model Compression +4

Paper
Add Code

Unsupervised Domain Adaptation for Face Recognition in Unlabeled Videos

no code implementations • ICCV 2017 • Kihyuk Sohn, Sifei Liu, Guangyu Zhong, Xiang Yu, Ming-Hsuan Yang, Manmohan Chandraker

Despite rapid advances in face recognition, there remains a clear gap between the performance of still image-based face recognition and video-based face recognition, due to the vast difference in visual quality between the domains and the difficulty of curating diverse large-scale video datasets.

Data Augmentation Face Recognition +1

Paper
Add Code

Robust Energy Minimization for BRDF-Invariant Shape From Light Fields

no code implementations • CVPR 2017 • Zhengqin Li, Zexiang Xu, Ravi Ramamoorthi, Manmohan Chandraker

On the other hand, recent works have explored PDE invariants for shape recovery with complex BRDFs, but they have not been incorporated into robust numerical optimization frameworks.

Paper
Add Code

Deep Network Flow for Multi-Object Tracking

no code implementations • CVPR 2017 • Samuel Schulter, Paul Vernaza, Wongun Choi, Manmohan Chandraker

In this work, we demonstrate that it is possible to learn features for network-flow-based data association via backpropagation, by expressing the optimum of a smoothed network flow problem as a differentiable function of the pairwise association costs.

Graph Matching Multi-Object Tracking +1

Paper
Add Code

Weakly supervised 3D Reconstruction with Adversarial Constraint

2 code implementations • 31 May 2017 • JunYoung Gwak, Christopher B. Choy, Animesh Garg, Manmohan Chandraker, Silvio Savarese

Supervised 3D reconstruction has witnessed a significant progress through the use of deep neural networks.

3D Reconstruction

1,329

Paper
Code

Towards Large-Pose Face Frontalization in the Wild

no code implementations • ICCV 2017 • Xi Yin, Xiang Yu, Kihyuk Sohn, Xiaoming Liu, Manmohan Chandraker

Despite recent advances in face recognition using deep learning, severe accuracy drops are observed for large pose variations in unconstrained environments.

3D Reconstruction Face Recognition +1

Paper
Add Code

DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents

3 code implementations • CVPR 2017 • Namhoon Lee, Wongun Choi, Paul Vernaza, Christopher B. Choy, Philip H. S. Torr, Manmohan Chandraker

DESIRE effectively predicts future locations of objects in multiple scenes by 1) accounting for the multi-modal nature of the future prediction (i. e., given the same context, future may vary), 2) foreseeing the potential future outcomes and make a strategic prediction based on that, and 3) reasoning not only from the past motion history, but also from the scene context as well as the interactions among the agents.

Ranked #1 on Trajectory Prediction on PAID

Future prediction Multi Future Trajectory Prediction +1

Paper
Code

Reconstruction-Based Disentanglement for Pose-invariant Face Recognition

no code implementations • ICCV 2017 • Xi Peng, Xiang Yu, Kihyuk Sohn, Dimitris Metaxas, Manmohan Chandraker

Finally, we propose a new feature reconstruction metric learning to explicitly disentangle identity and pose, by demanding alignment between the feature reconstructions through various combinations of identity and pose features, which is obtained from two images of the same subject.

Disentanglement Face Recognition +2

Paper
Add Code

Deep Supervision with Shape Concepts for Occlusion-Aware 3D Object Parsing

no code implementations • CVPR 2017 • Chi Li, M. Zeeshan Zia, Quoc-Huy Tran, Xiang Yu, Gregory D. Hager, Manmohan Chandraker

Monocular 3D object parsing is highly desirable in various scenarios including occlusion reasoning and holistic scene interpretation.

Instance Segmentation Semantic Segmentation

Paper
Add Code

A 4D Light-Field Dataset and CNN Architectures for Material Recognition

no code implementations • 24 Aug 2016 • Ting-Chun Wang, Jun-Yan Zhu, Ebi Hiroaki, Manmohan Chandraker, Alexei A. Efros, Ravi Ramamoorthi

We introduce a new light-field dataset of materials, and take advantage of the recent success of deep learning to perform material recognition on the 4D light-field.

Image Classification Image Segmentation +4

Paper
Add Code

Universal Correspondence Network

no code implementations • NeurIPS 2016 • Christopher B. Choy, JunYoung Gwak, Silvio Savarese, Manmohan Chandraker

We present a deep learning framework for accurate visual correspondences and demonstrate its effectiveness for both geometric and semantic matching, spanning across rigid motions to intra-class shape or appearance variations.

Metric Learning Semantic Similarity +1

Paper
Add Code

SVBRDF-Invariant Shape and Reflectance Estimation From Light-Field Cameras

no code implementations • CVPR 2016 • Ting-Chun Wang, Manmohan Chandraker, Alexei A. Efros, Ravi Ramamoorthi

Light-field cameras have recently emerged as a powerful tool for one-shot passive 3D shape capture.

Paper
Add Code

A Continuous Occlusion Model for Road Scene Understanding

no code implementations • CVPR 2016 • Vikas Dhiman, Quoc-Huy Tran, Jason J. Corso, Manmohan Chandraker

We present a physically interpretable, continuous 3D model for handling occlusions with applications to road scene understanding.

Motion Segmentation object-detection +3

Paper
Add Code

Deep Deformation Network for Object Landmark Localization

no code implementations • 3 May 2016 • Xiang Yu, Feng Zhou, Manmohan Chandraker

We propose a novel cascaded framework, namely deep deformation network (DDN), for localizing landmarks in non-rigid objects.

Face Alignment Object +1

Paper
Add Code

WarpNet: Weakly Supervised Matching for Single-view Reconstruction

no code implementations • CVPR 2016 • Angjoo Kanazawa, David W. Jacobs, Manmohan Chandraker

This is in contrast to prior works that require part annotations, since matching objects across class and pose variations is challenging with appearance features alone.

Paper
Add Code

Person Re-identification in the Wild

no code implementations • CVPR 2017 • Liang Zheng, Hengheng Zhang, Shaoyan Sun, Manmohan Chandraker, Yi Yang, Qi Tian

Our baselines address three issues: the performance of various combinations of detectors and recognizers, mechanisms for pedestrian detection to help improve overall re-identification accuracy and assessing the effectiveness of different detectors for re-identification.

Benchmarking Pedestrian Detection +2

Paper
Add Code

Joint SFM and Detection Cues for Monocular 3D Localization in Road Scenes

no code implementations • CVPR 2015 • Shiyu Song, Manmohan Chandraker

Experiments on the KITTI dataset show the efficacy of our cues, as well as the accuracy and robustness of our 3D object localization relative to ground truth and prior works.

Autonomous Driving Motion Segmentation +5

Paper
Add Code

Robust Scale Estimation in Real-Time Monocular SFM for Autonomous Driving

no code implementations • CVPR 2014 • Shiyu Song, Manmohan Chandraker

Experiments on the KITTI dataset demonstrate the accuracy of our ground plane estimation, monocular SFM and object localization relative to ground truth, with detailed comparisons to prior art.

Autonomous Driving Object +3

Paper
Add Code

What Camera Motion Reveals About Shape With Unknown BRDF

no code implementations • CVPR 2014 • Manmohan Chandraker

For the perspective case, we show that three differential motions suffice to yield surface depth for unknown isotropic BRDF and unknown directional lighting, while additional constraints are obtained with restrictions on BRDF or lighting.

Object

Paper
Add Code

Dense Object Reconstruction with Semantic Priors

no code implementations • CVPR 2013 • Sid Yingze Bao, Manmohan Chandraker, Yuanqing Lin, Silvio Savarese

Given multiple images of an unseen instance, we collate information from 2D object detectors to align the structure from motion point cloud with the mean shape, which is subsequently warped and refined to approach the actual shape.

Object object-detection +2

Paper
Add Code

What Object Motion Reveals about Shape with Unknown BRDF and Lighting

no code implementations • CVPR 2013 • Manmohan Chandraker, Dikpal Reddy, Yizhou Wang, Ravi Ramamoorthi

Under orthographic projection, we prove that three differential motions suffice to yield an invariant that relates shape to image derivatives, regardless of BRDF and illumination.

Surface Reconstruction

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.