Search Results for author: Haotian Tang

Found 13 papers, 10 papers with code

TorchSparse++: Efficient Training and Inference Framework for Sparse Convolution on GPUs

1 code implementation • 25 Oct 2023 • Haotian Tang, Shang Yang, Zhijian Liu, Ke Hong, Zhongming Yu, Xiuyu Li, Guohao Dai, Yu Wang, Song Han

On top of this, we design the Sparse Autotuner, which extends the design space of existing sparse convolution libraries and searches for the best dataflow configurations for training and inference workloads.

Autonomous Driving Recommendation Systems

1,113

Paper
Code

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

2 code implementations • 21 Sep 2023 • Yukang Chen, Shengju Qian, Haotian Tang, Xin Lai, Zhijian Liu, Song Han, Jiaya Jia

For example, training on the context length of 8192 needs 16x computational costs in self-attention layers as that of 2048.

4k Instruction Following +2

5,809

Paper
Code

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

5 code implementations • 1 Jun 2023 • Ji Lin, Jiaming Tang, Haotian Tang, Shang Yang, Wei-Ming Chen, Wei-Chen Wang, Guangxuan Xiao, Xingyu Dang, Chuang Gan, Song Han

We then propose to search for the optimal per-channel scaling that protects the salient weights by observing the activation, not weights.

Autonomous Driving Common Sense Reasoning +3

18,614

Paper
Code

SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer

1 code implementation • CVPR 2023 • Xuanyao Chen, Zhijian Liu, Haotian Tang, Li Yi, Hang Zhao, Song Han

High-resolution images enable neural networks to learn richer visual representations.

2D Semantic Segmentation Instance Segmentation +4

Paper
Code

FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer

no code implementations • CVPR 2023 • Zhijian Liu, Xinyu Yang, Haotian Tang, Shang Yang, Song Han

Transformer, as an alternative to CNN, has been proven effective in many modalities (e. g., texts and images).

Autonomous Driving

Paper
Add Code

End-to-End Entity Detection with Proposer and Regressor

1 code implementation • 19 Oct 2022 • Xueru Wen, Changjiang Zhou, Haotian Tang, Luguang Liang, Yu Jiang, Hong Qi

Named entity recognition is a traditional task in natural language processing.

named-entity-recognition Named Entity Recognition +2

Paper
Code

Type-supervised sequence labeling based on the heterogeneous star graph for named entity recognition

1 code implementation • 19 Oct 2022 • Xueru Wen, Changjiang Zhou, Haotian Tang, Luguang Liang, Yu Jiang, Hong Qi

Named entity recognition is a fundamental task in natural language processing, identifying the span and category of entities in unstructured texts.

Graph Attention named-entity-recognition +4

Paper
Code

BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

1 code implementation • 26 May 2022 • Zhijian Liu, Haotian Tang, Alexander Amini, Xinyu Yang, Huizi Mao, Daniela Rus, Song Han

Multi-sensor fusion is essential for an accurate and reliable autonomous driving system.

Ranked #4 on 3D Object Detection on nuScenes

3D Multi-Object Tracking 3D Object Detection +3

2,011

Paper
Code

PVNAS: 3D Neural Architecture Search with Point-Voxel Convolution

no code implementations • 25 Apr 2022 • Zhijian Liu, Haotian Tang, Shengyu Zhao, Kevin Shao, Song Han

3D neural networks are widely used in real-world applications (e. g., AR/VR headsets, self-driving cars).

Neural Architecture Search Self-Driving Cars

Paper
Add Code

Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications

no code implementations • 25 Apr 2022 • Han Cai, Ji Lin, Yujun Lin, Zhijian Liu, Haotian Tang, Hanrui Wang, Ligeng Zhu, Song Han

Deep neural networks (DNNs) have achieved unprecedented success in the field of artificial intelligence (AI), including computer vision, natural language processing and speech recognition.

Model Compression Neural Architecture Search +3

Paper
Add Code

TorchSparse: Efficient Point Cloud Inference Engine

1 code implementation • 21 Apr 2022 • Haotian Tang, Zhijian Liu, Xiuyu Li, Yujun Lin, Song Han

TorchSparse directly optimizes the two bottlenecks of sparse convolution: irregular computation and data movement.

Autonomous Driving

1,113

Paper
Code

Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution

6 code implementations • ECCV 2020 • Haotian Tang, Zhijian Liu, Shengyu Zhao, Yujun Lin, Ji Lin, Hanrui Wang, Song Han

Self-driving cars need to understand 3D scenes efficiently and accurately in order to drive safely.

Ranked #1 on Robust 3D Semantic Segmentation on SemanticKITTI-C

3D Object Detection LIDAR Semantic Segmentation +4

1,134

Paper
Code

Point-Voxel CNN for Efficient 3D Deep Learning

4 code implementations • NeurIPS 2019 • Zhijian Liu, Haotian Tang, Yujun Lin, Song Han

The computation cost and memory footprints of the voxel-based models grow cubically with the input resolution, making it memory-prohibitive to scale up the resolution.

Ranked #1 on 3D Object Detection on KITTI Pedestrian Hard val

3D Object Detection 3D Semantic Segmentation +2

1,673

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.