no code implementations • 26 Jun 2023 • Prashant Kumar, Dhruv Makwana, Onkar Susladkar, Anurag Mittal, Prem Kumar Kalra
In the real world however, LiDAR scans consist of non-stationary dynamic structures - moving and movable objects.
no code implementations • IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023 • Onkar Susladkar, Gayatri Deshmukh, Dhruv Makwana, Sparsh Mittal, R Sai Chandra Teja, Rekha Singhal
We introduce a novel network, GAFNet (Global Attention Fourier Net), which learns through large-scale pre-training over three image-text datasets (COCO, SBU, and CC-3M), for achieving high performance on downstream vision and language tasks.
1 code implementation • 26 Oct 2022 • Onkar Susladkar, Dhruv Makwana, Gayatri Deshmukh, Sparsh Mittal, Sai Chandra Teja R, Rekha Singhal
Further, we use a novel multi-headed decoder that generates a high-pass filtered image and a segmentation map, in addition to a text-free image.
2 code implementations • 13 Jul 2022 • Dhruv Makwana, Subhrajit Nag, Onkar Susladkar, Gayatri Deshmukh, Sai Chandra Teja R, Sparsh Mittal, C Krishna Mohan
We propose a novel deep learning model named ACLNet, for cloud segmentation from ground images.
Ranked #1 on Semantic Segmentation on SWINySEG
1 code implementation • 3 Jul 2022 • Subhrajit Nag, Dhruv Makwana, Sai Chandra Teja R, Sparsh Mittal, C Krishna Mohan
WSCN has a model size of only 0. 51MB and performs only 0. 2M FLOPS.
Ranked #1 on Semantic Segmentation on MixedWM38
no code implementations • 17 Aug 2021 • Feng Sun, Ajith Kumar V, Guanci Yang, Qikui Zhu, Yiyun Zhang, Ansi Zhang, Dhruv Makwana
Graph Convolutional Networks (GCNs) are widely used in many applications yet still need large amounts of labelled data for training.