Search Results for author: Fuyang Zhang

Found 5 papers, 3 papers with code

SceneScript: Reconstructing Scenes With An Autoregressive Structured Language Model

no code implementations • 19 Mar 2024 • Armen Avetisyan, Christopher Xie, Henry Howard-Jenkins, Tsun-Yi Yang, Samir Aroudj, Suvam Patra, Fuyang Zhang, Duncan Frost, Luke Holland, Campbell Orme, Jakob Engel, Edward Miller, Richard Newcombe, Vasileios Balntas

We introduce SceneScript, a method that directly produces full scene models as a sequence of structured language commands using an autoregressive, token-based approach.

3D Object Detection Decoder +2

Paper
Add Code

MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction

no code implementations • 20 Feb 2024 • Shitao Tang, Jiacheng Chen, Dilin Wang, Chengzhou Tang, Fuyang Zhang, Yuchen Fan, Vikas Chandra, Yasutaka Furukawa, Rakesh Ranjan

MVDiffusion++ achieves superior flexibility and scalability with two surprisingly simple ideas: 1) A ``pose-free architecture'' where standard self-attention among 2D latent features learns 3D consistency across an arbitrary number of conditional and generation views without explicitly using camera pose information; and 2) A ``view dropout strategy'' that discards a substantial number of output views during training, which reduces the training-time memory footprint and enables dense and high-resolution view synthesis at test time.

3D Object Reconstruction 3D Reconstruction +2

Paper
Add Code

MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion

1 code implementation • NeurIPS 2023 • Shitao Tang, Fuyang Zhang, Jiacheng Chen, Peng Wang, Yasutaka Furukawa

This paper introduces MVDiffusion, a simple yet effective method for generating consistent multi-view images from text prompts given pixel-to-pixel correspondences (e. g., perspective crops from a panorama or multi-view images given depth maps and poses).

Image Generation

434

Paper
Code

Structured Outdoor Architecture Reconstruction by Exploration and Classification

1 code implementation • ICCV 2021 • Fuyang Zhang, Xiang Xu, Nelson Nauata, Yasutaka Furukawa

This paper presents an explore-and-classify framework for structured architectural reconstruction from an aerial image.

Classification

Paper
Code

Conv-MPN: Convolutional Message Passing Neural Network for Structured Outdoor Architecture Reconstruction

1 code implementation • CVPR 2020 • Fuyang Zhang, Nelson Nauata, Yasutaka Furukawa

In our problem, nodes correspond to building edges in an image.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.