Search Results for author: Shenlong Wang

Found 56 papers, 10 papers with code

Video2Game: Real-time, Interactive, Realistic and Browser-Compatible Environment from a Single Video

no code implementations • 15 Apr 2024 • Hongchi Xia, Zhi-Hao Lin, Wei-Chiu Ma, Shenlong Wang

Creating high-quality and interactive virtual environments, such as games and simulators, often involves complex and costly manual modeling processes.

Paper
Add Code

MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance

no code implementations • 12 Apr 2024 • Yuqun Wu, Jae Yong Lee, Chuhang Zou, Shenlong Wang, Derek Hoiem

Our experiments show 4x the performance of RegNeRF and 8x that of FreeNeRF on average F1@2cm for ETH3D MVS benchmark, suggesting a fruitful research direction to improve the geometric accuracy of NeRF-based models, and sheds light on a potential future approach to enable NeRF-based optimization to eventually outperform traditional MVS.

Novel View Synthesis SSIM

Paper
Add Code

GoMAvatar: Efficient Animatable Human Modeling from Monocular Video Using Gaussians-on-Mesh

no code implementations • 11 Apr 2024 • Jing Wen, Xiaoming Zhao, Zhongzheng Ren, Alexander G. Schwing, Shenlong Wang

We introduce GoMAvatar, a novel approach for real-time, memory-efficient, high-quality animatable human modeling.

Computational Efficiency

Paper
Add Code

Physical Property Understanding from Language-Embedded Feature Fields

no code implementations • 5 Apr 2024 • Albert J. Zhai, Yuan Shen, Emily Y. Chen, Gloria X. Wang, Xinlei Wang, Sheng Wang, Kaiyu Guan, Shenlong Wang

Can computers perceive the physical properties of objects solely through vision?

Friction

Paper
Add Code

LidarDM: Generative LiDAR Simulation in a Generated World

1 code implementation • 3 Apr 2024 • Vlas Zyrianov, Henry Che, Zhijian Liu, Shenlong Wang

We present LidarDM, a novel LiDAR generative model capable of producing realistic, layout-aware, physically plausible, and temporally coherent LiDAR videos.

Autonomous Driving Point Cloud Generation

Paper
Code

RoboEXP: Action-Conditioned Scene Graph via Interactive Exploration for Robotic Manipulation

1 code implementation • 23 Feb 2024 • Hanxiao Jiang, Binghao Huang, Ruihai Wu, Zhuoran Li, Shubham Garg, Hooshang Nayyeri, Shenlong Wang, Yunzhu Li

Robots need to explore their surroundings to adapt to and tackle tasks in unknown environments.

Paper
Code

IRIS: Inverse Rendering of Indoor Scenes from Low Dynamic Range Images

no code implementations • 23 Jan 2024 • Zhi-Hao Lin, Jia-Bin Huang, Zhengqin Li, Zhao Dong, Christian Richardt, Tuotuo Li, Michael Zollhöfer, Johannes Kopf, Shenlong Wang, Changil Kim

While numerous 3D reconstruction and novel-view synthesis methods allow for photorealistic rendering of a scene from multi-view images easily captured with consumer cameras, they bake illumination in their representations and fall short of supporting advanced applications like material editing, relighting, and virtual object insertion.

3D Reconstruction Inverse Rendering +1

Paper
Add Code

Objects With Lighting: A Real-World Dataset for Evaluating Reconstruction and Rendering for Object Relighting

1 code implementation • 17 Jan 2024 • Benjamin Ummenhofer, Sanskar Agrawal, Rene Sepulveda, Yixing Lao, Kai Zhang, Tianhang Cheng, Stephan Richter, Shenlong Wang, German Ros

Reconstructing an object from photos and placing it virtually in a new environment goes beyond the standard novel view synthesis task as the appearance of the object has to not only adapt to the novel viewpoint but also to the new lighting conditions and yet evaluations of inverse rendering methods rely on novel view synthesis data or simplistic synthetic datasets for quantitative analysis.

Inverse Rendering Novel View Synthesis

Paper
Code

Structure from Duplicates: Neural Inverse Graphics from a Pile of Objects

1 code implementation • NeurIPS 2023 • Tianhang Cheng, Wei-Chiu Ma, Kaiyu Guan, Antonio Torralba, Shenlong Wang

Our world is full of identical objects (\emphe. g., cans of coke, cars of same model).

Image Reconstruction Object +1

Paper
Code

On the Overconfidence Problem in Semantic 3D Mapping

no code implementations • 16 Nov 2023 • Joao Marcos Correia Marques, Albert Zhai, Shenlong Wang, Kris Hauser

Semantic 3D mapping, the process of fusing depth and image segmentation information between multiple views to build 3D maps annotated with object classes in real-time, is a recent topic of interest.

Image Segmentation Semantic Segmentation

Paper
Add Code

ContactGen: Generative Contact Modeling for Grasp Generation

no code implementations • ICCV 2023 • Shaowei Liu, Yang Zhou, Jimei Yang, Saurabh Gupta, Shenlong Wang

This paper presents a novel object-centric contact representation ContactGen for hand-object interaction.

Grasp Generation Object

Paper
Add Code

MapPrior: Bird's-Eye View Map Layout Estimation with Generative Models

no code implementations • ICCV 2023 • Xiyue Zhu, Vlas Zyrianov, Zhijian Liu, Shenlong Wang

Despite tremendous advancements in bird's-eye view (BEV) perception, existing models fall short in generating realistic and coherent semantic map layouts, and they fail to account for uncertainties arising from partial sensor information (such as occlusion or limited coverage).

Paper
Add Code

UrbanIR: Large-Scale Urban Scene Inverse Rendering from a Single Video

no code implementations • 15 Jun 2023 • Zhi-Hao Lin, Bohan Liu, Yi-Ting Chen, David Forsyth, Jia-Bin Huang, Anand Bhattad, Shenlong Wang

UrbanIR uses a novel loss to make very good estimates of shadow volumes in the original scene.

Inverse Rendering

Paper
Add Code

Building Rearticulable Models for Arbitrary 3D Objects from 4D Point Clouds

no code implementations • CVPR 2023 • Shaowei Liu, Saurabh Gupta, Shenlong Wang

We build rearticulable models for arbitrary everyday man-made objects containing an arbitrary number of parts that are connected together in arbitrary ways via 1 degree-of-freedom joints.

Paper
Add Code

PEANUT: Predicting and Navigating to Unseen Targets

no code implementations • ICCV 2023 • Albert J. Zhai, Shenlong Wang

In this work, we present a straightforward method for learning these regularities by predicting the locations of unobserved objects from incomplete semantic maps.

Paper
Add Code

QFF: Quantized Fourier Features for Neural Field Representations

no code implementations • 2 Dec 2022 • Jae Yong Lee, Yuqun Wu, Chuhang Zou, Shenlong Wang, Derek Hoiem

Instead, we propose to encode features in bins of Fourier features that are commonly used for positional encoding.

Paper
Add Code

ClimateNeRF: Extreme Weather Synthesis in Neural Radiance Field

no code implementations • ICCV 2023 • Yuan Li, Zhi-Hao Lin, David Forsyth, Jia-Bin Huang, Shenlong Wang

Physical simulations produce excellent predictions of weather effects.

Neural Rendering Physical Simulations

Paper
Add Code

CASA: Category-agnostic Skeletal Animal Reconstruction

no code implementations • 4 Nov 2022 • Yuefan Wu, Zeyuan Chen, Shaowei Liu, Zhongzheng Ren, Shenlong Wang

Recovering the skeletal shape of an animal from a monocular video is a longstanding challenge.

Retrieval

Paper
Add Code

Learning to Generate Realistic LiDAR Point Clouds

1 code implementation • 8 Sep 2022 • Vlas Zyrianov, Xiyue Zhu, Shenlong Wang

We present LiDARGen, a novel, effective, and controllable generative model that produces realistic LiDAR point cloud sensory readings.

Denoising Point Cloud Generation

118

Paper
Code

Virtual Correspondence: Humans as a Cue for Extreme-View Geometry

no code implementations • CVPR 2022 • Wei-Chiu Ma, Anqi Joyce Yang, Shenlong Wang, Raquel Urtasun, Antonio Torralba

Similar to classic correspondences, VCs conform with epipolar geometry; unlike classic correspondences, VCs do not need to be co-visible across views.

3D Reconstruction Novel View Synthesis +1

Paper
Add Code

NeurMiPs: Neural Mixture of Planar Experts for View Synthesis

1 code implementation • CVPR 2022 • Zhi-Hao Lin, Wei-Chiu Ma, Hao-Yu Hsu, Yu-Chiang Frank Wang, Shenlong Wang

We present Neural Mixtures of Planar Experts (NeurMiPs), a novel planar-based scene representation for modeling geometry and appearance.

Novel View Synthesis

113

Paper
Code

Deep Feedback Inverse Problem Solver

no code implementations • ECCV 2020 • Wei-Chiu Ma, Shenlong Wang, Jiayuan Gu, Sivabalan Manivasagam, Antonio Torralba, Raquel Urtasun

Specifically, at each iteration, the neural network takes the feedback as input and outputs an update on the current estimation.

Pose Estimation

Paper
Add Code

Mending Neural Implicit Modeling for 3D Vehicle Reconstruction in the Wild

no code implementations • 18 Jan 2021 • Shivam Duggal, ZiHao Wang, Wei-Chiu Ma, Sivabalan Manivasagam, Justin Liang, Shenlong Wang, Raquel Urtasun

Reconstructing high-quality 3D objects from sparse, partial observations from a single view is of crucial importance for various applications in computer vision, robotics, and graphics.

3D Object Reconstruction

Paper
Add Code

Non-parametric Memory for Spatio-Temporal Segmentation of Construction Zones for Self-Driving

no code implementations • 18 Jan 2021 • Min Bai, Shenlong Wang, Kelvin Wong, Ersin Yumer, Raquel Urtasun

In this paper, we introduce a non-parametric memory representation for spatio-temporal segmentation that captures the local space and time around an autonomous vehicle (AV).

Paper
Add Code

Deep Parametric Continuous Convolutional Neural Networks

no code implementations • CVPR 2018 • Shenlong Wang, Simon Suo, Wei-Chiu Ma, Andrei Pokrovsky, Raquel Urtasun

Standard convolutional neural networks assume a grid structured input is available and exploit discrete convolutions as their fundamental building blocks.

Ranked #2 on Semantic Segmentation on S3DIS Area5 (Number of params metric)

Motion Estimation Point Cloud Segmentation +1

Paper
Add Code

S3: Neural Shape, Skeleton, and Skinning Fields for 3D Human Modeling

no code implementations • CVPR 2021 • Ze Yang, Shenlong Wang, Sivabalan Manivasagam, Zeng Huang, Wei-Chiu Ma, Xinchen Yan, Ersin Yumer, Raquel Urtasun

Constructing and animating humans is an important component for building virtual worlds in a wide variety of applications such as virtual reality or robotics testing in simulation.

Paper
Add Code

Asynchronous Multi-View SLAM

no code implementations • 17 Jan 2021 • Anqi Joyce Yang, Can Cui, Ioan Andrei Bârsan, Raquel Urtasun, Shenlong Wang

Existing multi-camera SLAM systems assume synchronized shutters for all cameras, which is often not the case in practice.

Sensor Modeling

Paper
Add Code

GeoSim: Realistic Video Simulation via Geometry-Aware Composition for Self-Driving

no code implementations • CVPR 2021 • Yun Chen, Frieda Rong, Shivam Duggal, Shenlong Wang, Xinchen Yan, Sivabalan Manivasagam, Shangjie Xue, Ersin Yumer, Raquel Urtasun

Scalable sensor simulation is an important yet challenging open problem for safety-critical domains such as self-driving.

Data Augmentation Synthetic Data Generation

Paper
Add Code

SceneGen: Learning to Generate Realistic Traffic Scenes

no code implementations • CVPR 2021 • Shuhan Tan, Kelvin Wong, Shenlong Wang, Sivabalan Manivasagam, Mengye Ren, Raquel Urtasun

Existing methods typically insert actors into the scene according to a set of hand-crafted heuristics and are limited in their ability to model the true complexity and diversity of real traffic scenes, thus inducing a content gap between synthesized traffic scenes versus real ones.

Paper
Add Code

Pit30M: A Benchmark for Global Localization in the Age of Self-Driving Cars

no code implementations • 23 Dec 2020 • Julieta Martinez, Sasha Doubov, Jack Fan, Ioan Andrei Bârsan, Shenlong Wang, Gellért Máttyus, Raquel Urtasun

We are interested in understanding whether retrieval-based localization approaches are good enough in the context of self-driving vehicles.

LIDAR Semantic Segmentation Retrieval +2

Paper
Add Code

Convolutional Recurrent Network for Road Boundary Extraction

no code implementations • CVPR 2019 • Justin Liang, Namdar Homayounfar, Wei-Chiu Ma, Shenlong Wang, Raquel Urtasun

Creating high definition maps that contain precise information of static elements of the scene is of utmost importance for enabling self driving cars to drive safely.

Self-Driving Cars

Paper
Add Code

Deep Continuous Fusion for Multi-Sensor 3D Object Detection

no code implementations • ECCV 2018 • Ming Liang, Bin Yang, Shenlong Wang, Raquel Urtasun

In this paper, we propose a novel 3D object detector that can exploit both LIDAR as well as cameras to perform very accurate localization.

3D Object Detection Object +1

Paper
Add Code

Learning to Localize Using a LiDAR Intensity Map

no code implementations • 20 Dec 2020 • Ioan Andrei Bârsan, Shenlong Wang, Andrei Pokrovsky, Raquel Urtasun

In this paper we propose a real-time, calibration-agnostic and effective localization system for self-driving cars.

Self-Driving Cars

Paper
Add Code

Learning to Localize Through Compressed Binary Maps

no code implementations • CVPR 2019 • Xinkai Wei, Ioan Andrei Bârsan, Shenlong Wang, Julieta Martinez, Raquel Urtasun

One of the main difficulties of scaling current localization systems to large environments is the on-board storage required for the maps.

Paper
Add Code

MuSCLE: Multi Sweep Compression of LiDAR using Deep Entropy Models

no code implementations • NeurIPS 2020 • Sourav Biswas, Jerry Liu, Kelvin Wong, Shenlong Wang, Raquel Urtasun

Our model exploits spatio-temporal relationships across multiple LiDAR sweeps to reduce the bitrate of both geometry and intensity values.

Paper
Add Code

Conditional Entropy Coding for Efficient Video Compression

no code implementations • ECCV 2020 • Jerry Liu, Shenlong Wang, Wei-Chiu Ma, Meet Shah, Rui Hu, Pranaab Dhawan, Raquel Urtasun

We propose a very simple and efficient video compression framework that only focuses on modeling the conditional entropy between frames.

MS-SSIM SSIM +1

Paper
Add Code

DSDNet: Deep Structured self-Driving Network

no code implementations • ECCV 2020 • Wenyuan Zeng, Shenlong Wang, Renjie Liao, Yun Chen, Bin Yang, Raquel Urtasun

In this paper, we propose the Deep Structured self-Driving Network (DSDNet), which performs object detection, motion prediction, and motion planning with a single neural network.

Motion Planning motion prediction +2

Paper
Add Code

LiDARsim: Realistic LiDAR Simulation by Leveraging the Real World

no code implementations • CVPR 2020 • Sivabalan Manivasagam, Shenlong Wang, Kelvin Wong, Wenyuan Zeng, Mikita Sazanovich, Shuhan Tan, Bin Yang, Wei-Chiu Ma, Raquel Urtasun

We first utilize ray casting over the 3D scene and then use a deep neural network to produce deviations from the physics-based simulation, producing realistic LiDAR point clouds.

Paper
Add Code

OctSqueeze: Octree-Structured Entropy Model for LiDAR Compression

6 code implementations • CVPR 2020 • Lila Huang, Shenlong Wang, Kelvin Wong, Jerry Liu, Raquel Urtasun

We present a novel deep compression algorithm to reduce the memory footprint of LiDAR point clouds.

Self-Driving Cars

219

Paper
Code

Identifying Unknown Instances for Autonomous Driving

no code implementations • 24 Oct 2019 • Kelvin Wong, Shenlong Wang, Mengye Ren, Ming Liang, Raquel Urtasun

In the past few years, we have seen great progress in perception algorithms, particular through the use of deep learning.

Autonomous Driving Instance Segmentation +1

Paper
Add Code

Efficient Graph Generation with Graph Recurrent Attention Networks

2 code implementations • NeurIPS 2019 • Renjie Liao, Yujia Li, Yang Song, Shenlong Wang, Charlie Nash, William L. Hamilton, David Duvenaud, Raquel Urtasun, Richard S. Zemel

Our model generates graphs one block of nodes and associated edges at a time.

Graph Generation

451

Paper
Code

DeepPruner: Learning Efficient Stereo Matching via Differentiable PatchMatch

1 code implementation • ICCV 2019 • Shivam Duggal, Shenlong Wang, Wei-Chiu Ma, Rui Hu, Raquel Urtasun

Our goal is to significantly speed up the runtime of current state-of-the-art stereo algorithms to enable real-time inference.

Stereo Matching Stereo Matching Hand

345

Paper
Code

DSIC: Deep Stereo Image Compression

1 code implementation • ICCV 2019 • Jerry Liu, Shenlong Wang, Raquel Urtasun

In this paper we tackle the problem of stereo image compression, and leverage the fact that the two images have overlapping fields of view to further compress the representations.

Image Compression

Paper
Code

Exploiting Sparse Semantic HD Maps for Self-Driving Vehicle Localization

no code implementations • 8 Aug 2019 • Wei-Chiu Ma, Ignacio Tartavull, Ioan Andrei Bârsan, Shenlong Wang, Min Bai, Gellert Mattyus, Namdar Homayounfar, Shrinidhi Kowshika Lakshmikanth, Andrei Pokrovsky, Raquel Urtasun

In this paper we propose a novel semantic localization algorithm that exploits multiple sensors and has precision on the order of a few centimeters.

Self-Driving Cars

Paper
Add Code

Deep Multi-Sensor Lane Detection

no code implementations • 4 May 2019 • Min Bai, Gellert Mattyus, Namdar Homayounfar, Shenlong Wang, Shrinidhi Kowshika Lakshmikanth, Raquel Urtasun

Reliable and accurate lane detection has been a long-standing problem in the field of autonomous driving.

Autonomous Driving Lane Detection +1

Paper
Add Code

Deep Rigid Instance Scene Flow

no code implementations • CVPR 2019 • Wei-Chiu Ma, Shenlong Wang, Rui Hu, Yuwen Xiong, Raquel Urtasun

In this paper we tackle the problem of scene flow estimation in the context of self-driving.

Rolling Shutter Correction Scene Flow Estimation

Paper
Add Code

Proximal Deep Structured Models

no code implementations • NeurIPS 2016 • Shenlong Wang, Sanja Fidler, Raquel Urtasun

Many problems in real-world applications involve predicting continuous-valued random variables that are statistically related.

Image Denoising Optical Flow Estimation

Paper
Add Code

TorontoCity: Seeing the World with a Million Eyes

no code implementations • ICCV 2017 • Shenlong Wang, Min Bai, Gellert Mattyus, Hang Chu, Wenjie Luo, Bin Yang, Justin Liang, Joel Cheverie, Sanja Fidler, Raquel Urtasun

In this paper we introduce the TorontoCity benchmark, which covers the full greater Toronto area (GTA) with 712. 5 $km^2$ of land, 8439 $km$ of road and around 400, 000 buildings.

Instance Segmentation Semantic Segmentation

Paper
Add Code

AutoScaler: Scale-Attention Networks for Visual Correspondence

no code implementations • 17 Nov 2016 • Shenlong Wang, Linjie Luo, Ning Zhang, Jia Li

We propose AutoScaler, a scale-attention network to explicitly optimize this trade-off in visual correspondence tasks.

Optical Flow Estimation

Paper
Add Code

Find your Way by Observing the Sun and Other Semantic Cues

no code implementations • 23 Jun 2016 • Wei-Chiu Ma, Shenlong Wang, Marcus A. Brubaker, Sanja Fidler, Raquel Urtasun

In this paper we present a robust, efficient and affordable approach to self-localization which does not require neither GPS nor knowledge about the appearance of the world.

Paper
Add Code

The Global Patch Collider

no code implementations • CVPR 2016 • Shenlong Wang, Sean Ryan Fanello, Christoph Rhemann, Shahram Izadi, Pushmeet Kohli

In contrast to conventional approaches that rely on pairwise distance computation, our algorithm isolates distinctive pixel pairs that hit the same leaf during traversal through multiple learned tree structures.

Optical Flow Estimation Stereo Matching +1

Paper
Add Code

HD Maps: Fine-Grained Road Segmentation by Parsing Ground and Aerial Images

no code implementations • CVPR 2016 • Gellert Mattyus, Shenlong Wang, Sanja Fidler, Raquel Urtasun

In this paper we present an approach to enhance existing maps with fine grained segmentation categories such as parking spots and sidewalk, as well as the number and location of road lanes.

Road Segmentation

Paper
Add Code

Enhancing Road Maps by Parsing Aerial Images Around the World

no code implementations • ICCV 2015 • Gellert Mattyus, Shenlong Wang, Sanja Fidler, Raquel Urtasun

In recent years, contextual models that exploit maps have been shown to be very effective for many recognition and localization tasks.

Semantic Segmentation

Paper
Add Code

Lost Shopping! Monocular Localization in Large Indoor Spaces

no code implementations • ICCV 2015 • Shenlong Wang, Sanja Fidler, Raquel Urtasun

In this paper we propose a novel approach to localization in very large indoor spaces (i. e., 200+ store shopping malls) that takes a single image and a floor plan of the environment as input.

Text Detection Translation

Paper
Add Code

Holistic 3D Scene Understanding From a Single Geo-Tagged Image

no code implementations • CVPR 2015 • Shenlong Wang, Sanja Fidler, Raquel Urtasun

In this paper we are interested in exploiting geographic priors to help outdoor scene understanding.

3D Object Detection object-detection +3

Paper
Add Code

Efficient Inference of Continuous Markov Random Fields with Polynomial Potentials

no code implementations • NeurIPS 2014 • Shenlong Wang, Alex Schwing, Raquel Urtasun

In this paper, we prove that every multivariate polynomial with even degree can be decomposed into a sum of convex and concave polynomials.

3D Reconstruction Image Denoising

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.