Search Results for author: Chenbin Pan

Found 5 papers, 1 papers with code

CLIP-BEVFormer: Enhancing Multi-View Image-Based BEV Detector with Ground Truth Flow

no code implementations • 13 Mar 2024 • Chenbin Pan, Burhaneddin Yaman, Senem Velipasalar, Liu Ren

Autonomous driving stands as a pivotal domain in computer vision, shaping the future of transportation.

3D Object Detection Autonomous Driving +2

Paper
Add Code

VLP: Vision Language Planning for Autonomous Driving

no code implementations • 10 Jan 2024 • Chenbin Pan, Burhaneddin Yaman, Tommaso Nesti, Abhirup Mallik, Alessandro G Allievi, Senem Velipasalar, Liu Ren

Autonomous driving is a complex and challenging task that aims at safe motion planning through scene understanding and reasoning.

Autonomous Driving Motion Planning +1

Paper
Add Code

SVT: Supertoken Video Transformer for Efficient Video Understanding

no code implementations • 1 Apr 2023 • Chenbin Pan, Rui Hou, Hanchao Yu, Qifan Wang, Senem Velipasalar, Madian Khabsa

Whether by processing videos with fixed resolution from start to end or incorporating pooling and down-scaling strategies, existing video transformers process the whole video content throughout the network without specially handling the large portions of redundant information.

Video Understanding

Paper
Add Code

EgoViT: Pyramid Video Transformer for Egocentric Action Recognition

no code implementations • 15 Mar 2023 • Chenbin Pan, Zhiqi Zhang, Senem Velipasalar, Yi Xu

Different from previous video transformers, which use the same static embedding as the class token for diverse inputs, we propose a dynamic class token generator that produces a class token for each input video by analyzing the hand-object interaction and the related motion information.

Action Recognition

Paper
Add Code

PT-CapsNet: A Novel Prediction-Tuning Capsule Network Suitable for Deeper Architectures

1 code implementation • ICCV 2021 • Chenbin Pan, Senem Velipasalar

Existing variations of CapsNets mainly focus on performance comparison with the original CapsNet, and have not outperformed CNN-based models on complex tasks.

object-detection Object Detection +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.