Search Results for author: Wenzhao Zheng

Found 29 papers, 24 papers with code

Structural Deep Metric Learning for Room Layout Estimation

no code implementations ECCV 2020 Wenzhao Zheng, Jiwen Lu, Jie zhou

We employ a metric model and a layout encoder to map the RGB images and the ground-truth layouts to the embedding space, respectively, and a layout decoder to map the embeddings to the corresponding layouts, where the whole framework is trained in an end-to-end manner.

Metric Learning Room Layout Estimation

GenAD: Generative End-to-End Autonomous Driving

1 code implementation18 Feb 2024 Wenzhao Zheng, Ruiqi Song, Xianda Guo, Chenming Zhang, Long Chen

We then employ a variational autoencoder to learn the future trajectory distribution in a structural latent space for trajectory prior modeling.

Autonomous Driving motion prediction

Path Choice Matters for Clear Attribution in Path Methods

1 code implementation19 Jan 2024 Borui Zhang, Wenzhao Zheng, Jie zhou, Jiwen Lu

Rigorousness and clarity are both essential for interpretations of DNNs to engender human trust.

OccWorld: Learning a 3D Occupancy World Model for Autonomous Driving

1 code implementation27 Nov 2023 Wenzhao Zheng, Weiliang Chen, Yuanhui Huang, Borui Zhang, Yueqi Duan, Jiwen Lu

In this paper, we explore a new framework of learning a world model, OccWorld, in the 3D Occupancy space to simultaneously predict the movement of the ego car and the evolution of the surrounding scenes.

Autonomous Driving

SelfOcc: Self-Supervised Vision-Based 3D Occupancy Prediction

1 code implementation21 Nov 2023 Yuanhui Huang, Wenzhao Zheng, Borui Zhang, Jie zhou, Jiwen Lu

Our SelfOcc outperforms the previous best method SceneRF by 58. 7% using a single frame as input on SemanticKITTI and is the first self-supervised work that produces reasonable 3D occupancy for surround cameras on nuScenes.

Autonomous Driving Monocular Depth Estimation

Exploring Unified Perspective For Fast Shapley Value Estimation

1 code implementation2 Nov 2023 Borui Zhang, Baotong Tian, Wenzhao Zheng, Jie zhou, Jiwen Lu

Shapley values have emerged as a widely accepted and trustworthy tool, grounded in theoretical axioms, for addressing challenges posed by black-box models like deep neural networks.

Introspective Deep Metric Learning

2 code implementations11 Sep 2023 Chengkun Wang, Wenzhao Zheng, Zheng Zhu, Jie zhou, Jiwen Lu

This paper proposes an introspective deep metric learning (IDML) framework for uncertainty-aware comparisons of images.

Image Retrieval Metric Learning

PointOcc: Cylindrical Tri-Perspective View for Point-based 3D Semantic Occupancy Prediction

1 code implementation31 Aug 2023 Sicheng Zuo, Wenzhao Zheng, Yuanhui Huang, Jie zhou, Jiwen Lu

To address this, we propose a cylindrical tri-perspective view to represent point clouds effectively and comprehensively and a PointOcc model to process them efficiently.

3D Semantic Occupancy Prediction Autonomous Driving +2

Human-M3: A Multi-view Multi-modal Dataset for 3D Human Pose Estimation in Outdoor Scenes

1 code implementation1 Aug 2023 Bohao Fan, Siqi Wang, Wenxuan Guo, Wenzhao Zheng, Jianjiang Feng, Jie zhou

In this article, we propose Human-M3, an outdoor multi-modal multi-view multi-person human pose database which includes not only multi-view RGB videos of outdoor scenes but also corresponding pointclouds.

3D Human Pose Estimation

SurroundOcc: Multi-Camera 3D Occupancy Prediction for Autonomous Driving

2 code implementations ICCV 2023 Yi Wei, Linqing Zhao, Wenzhao Zheng, Zheng Zhu, Jie zhou, Jiwen Lu

Towards a more comprehensive perception of a 3D scene, in this paper, we propose a SurroundOcc method to predict the 3D occupancy with multi-camera images.

3D Object Detection Autonomous Driving +2

Deep Factorized Metric Learning

1 code implementation CVPR 2023 Chengkun Wang, Wenzhao Zheng, Junlong Li, Jie zhou, Jiwen Lu

Learning a generalizable and comprehensive similarity metric to depict the semantic discrepancies between images is the foundation of many computer vision tasks.

Image Classification Metric Learning

Bort: Towards Explainable Neural Networks with Bounded Orthogonal Constraint

1 code implementation18 Dec 2022 Borui Zhang, Wenzhao Zheng, Jie zhou, Jiwen Lu

Deep learning has revolutionized human society, yet the black-box nature of deep neural networks hinders further application to reliability-demanded industries.

Probabilistic Deep Metric Learning for Hyperspectral Image Classification

1 code implementation15 Nov 2022 Chengkun Wang, Wenzhao Zheng, Xian Sun, Jiwen Lu, Jie zhou

We propose to learn a global probabilistic distribution for each pixel in the patch and a probabilistic metric to model the distance between distributions.

Classification Hyperspectral Image Classification +1

Token-Label Alignment for Vision Transformers

1 code implementation ICCV 2023 Han Xiao, Wenzhao Zheng, Zheng Zhu, Jie zhou, Jiwen Lu

Data mixing strategies (e. g., CutMix) have shown the ability to greatly improve the performance of convolutional neural networks (CNNs).

Image Classification Semantic Segmentation +1

OPERA: Omni-Supervised Representation Learning with Hierarchical Supervisions

1 code implementation ICCV 2023 Chengkun Wang, Wenzhao Zheng, Zheng Zhu, Jie zhou, Jiwen Lu

The pretrain-finetune paradigm in modern computer vision facilitates the success of self-supervised learning, which tends to achieve better transferability than supervised learning.

Image Classification object-detection +3

A Simple Baseline for Multi-Camera 3D Object Detection

1 code implementation22 Aug 2022 Yunpeng Zhang, Wenzhao Zheng, Zheng Zhu, Guan Huang, Jie zhou, Jiwen Lu

First, we extract multi-scale features and generate the perspective object proposals on each monocular image.

Autonomous Driving Monocular 3D Object Detection +2

Introspective Deep Metric Learning for Image Retrieval

2 code implementations9 May 2022 Wenzhao Zheng, Chengkun Wang, Jie zhou, Jiwen Lu

This paper proposes an introspective deep metric learning (IDML) framework for uncertainty-aware comparisons of images.

Image Classification Image Retrieval +2

SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation

1 code implementation7 Apr 2022 Yi Wei, Linqing Zhao, Wenzhao Zheng, Zheng Zhu, Yongming Rao, Guan Huang, Jiwen Lu, Jie zhou

In this paper, we propose a SurroundDepth method to incorporate the information from multiple surrounding views to predict depth maps across cameras.

Autonomous Driving Monocular Depth Estimation

Attributable Visual Similarity Learning

1 code implementation CVPR 2022 Borui Zhang, Wenzhao Zheng, Jie zhou, Jiwen Lu

This paper proposes an attributable visual similarity learning (AVSL) framework for a more accurate and explainable similarity measure between images.

Ranked #3 on Metric Learning on CARS196 (using extra training data)

Metric Learning Semantic Similarity +1

Dimension Embeddings for Monocular 3D Object Detection

no code implementations CVPR 2022 Yunpeng Zhang, Wenzhao Zheng, Zheng Zhu, Guan Huang, Dalong Du, Jie zhou, Jiwen Lu

In this paper, we propose a general method to learn appropriate embeddings for dimension estimation in monocular 3D object detection.

Monocular 3D Object Detection Object +1

Deep Relational Metric Learning

1 code implementation ICCV 2021 Wenzhao Zheng, Borui Zhang, Jiwen Lu, Jie zhou

This paper presents a deep relational metric learning (DRML) framework for image clustering and retrieval.

Image Clustering Metric Learning +1

Deep Compositional Metric Learning

1 code implementation CVPR 2021 Wenzhao Zheng, Chengkun Wang, Jiwen Lu, Jie zhou

In this paper, we propose a deep compositional metric learning (DCML) framework for effective and generalizable similarity measurement between images.

Metric Learning

Deep Metric Learning via Adaptive Learnable Assessment

no code implementations CVPR 2020 Wenzhao Zheng, Jiwen Lu, Jie Zhou

In this paper, we propose a deep metric learning via adaptive learnable assessment (DML-ALA) method for image retrieval and clustering, which aims to learn a sample assessment strategy to maximize the generalization of the trained metric.

Clustering Image Retrieval +3

Hardness-Aware Deep Metric Learning

2 code implementations CVPR 2019 Wenzhao Zheng, Zhaodong Chen, Jiwen Lu, Jie zhou

This paper presents a hardness-aware deep metric learning (HDML) framework.

Ranked #30 on Metric Learning on CUB-200-2011 (using extra training data)

Image Retrieval Metric Learning

Deep Adversarial Metric Learning

no code implementations CVPR 2018 Yueqi Duan, Wenzhao Zheng, Xudong Lin, Jiwen Lu, Jie zhou

Learning an effective distance metric between image pairs plays an important role in visual analysis, where the training procedure largely relies on hard negative samples.

Metric Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.