Search Results for author: Peixiang Huang

Found 7 papers, 2 papers with code

Improving Vision-and-Language Reasoning via Spatial Relations Modeling

no code implementations • 9 Nov 2023 • Cheng Yang, Rui Xu, Ye Guo, Peixiang Huang, Yiru Chen, Wenkui Ding, Zhongyuan Wang, Hong Zhou

Further, we design two pre-training tasks named object position regression (OPR) and spatial relation classification (SRC) to learn to reconstruct the spatial relation graph respectively.

Position regression Relation +3

Paper
Add Code

SCAAT: Improving Neural Network Interpretability via Saliency Constrained Adaptive Adversarial Training

no code implementations • 9 Nov 2023 • Rui Xu, Wenkang Qin, Peixiang Huang, Hao Wang, Lin Luo

Deep Neural Networks (DNNs) are expected to provide explanation for users to understand their black-box predictions.

Paper
Add Code

What a Whole Slide Image Can Tell? Subtype-guided Masked Transformer for Pathological Image Captioning

no code implementations • 31 Oct 2023 • Wenkang Qin, Rui Xu, Peixiang Huang, Xiaomin Wu, Heyu Zhang, Lin Luo

Pathological captioning of Whole Slide Images (WSIs), though is essential in computer-aided pathological diagnosis, has rarely been studied due to the limitations in datasets and model training efficacy.

Image Captioning Sentence +1

Paper
Add Code

Assessing and Enhancing Robustness of Deep Learning Models with Corruption Emulation in Digital Pathology

no code implementations • 31 Oct 2023 • Peixiang Huang, Songtao Zhang, Yulu Gan, Rui Xu, Rongqi Zhu, Wenkang Qin, Limei Guo, Shan Jiang, Lin Luo

Deep learning in digital pathology brings intelligence and automation as substantial enhancements to pathological analysis, the gold standard of clinical diagnosis.

Paper
Add Code

RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision

1 code implementation • 18 Sep 2023 • Mingjie Pan, Jiaming Liu, Renrui Zhang, Peixiang Huang, Xiaoqi Li, Bing Wang, Hongwei Xie, Li Liu, Shanghang Zhang

3D occupancy prediction holds significant promise in the fields of robot perception and autonomous driving, which quantifies 3D scenes into grid cells with semantic labels.

Autonomous Driving

389

Paper
Code

UniOcc: Unifying Vision-Centric 3D Occupancy Prediction with Geometric and Semantic Rendering

no code implementations • 15 Jun 2023 • Mingjie Pan, Li Liu, Jiaming Liu, Peixiang Huang, Longlong Wang, Shanghang Zhang, Shaoqing Xu, Zhiyi Lai, Kuiyuan Yang

In this technical report, we present our solution, named UniOCC, for the Vision-Centric 3D occupancy prediction track in the nuScenes Open Dataset Challenge at CVPR 2023.

Ranked #3 on Prediction Of Occupancy Grid Maps on Occ3D-nuScenes

Prediction Of Occupancy Grid Maps

Paper
Add Code

TiG-BEV: Multi-view BEV 3D Object Detection via Target Inner-Geometry Learning

1 code implementation • 28 Dec 2022 • Peixiang Huang, Li Liu, Renrui Zhang, Song Zhang, Xinli Xu, Baichao Wang, Guoyi Liu

In this paper, we propose the learning scheme of Target Inner-Geometry from the LiDAR modality into camera-based BEV detectors for both dense depth and BEV features, termed as TiG-BEV.

3D Object Detection object-detection

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.