1 code implementation • 5 Jan 2024 • Saurabh Atreya, Maheswar Bora, Aritra Mukherjee, Abhijit Das
We proposed SliT-CNN, a novel 2D spatial-temporal convolutional neural network (CNN) for better featuring of the air signature.
no code implementations • 5 Jan 2024 • Aritra Mukherjee, Abhijit Das
Recent literature has witnessed significant interest towards 3D biometrics employing monocular vision for robust authentication methods.
1 code implementation • 24 Nov 2023 • Kartik Kuckreja, Muhammad Sohail Danish, Muzammal Naseer, Abhijit Das, Salman Khan, Fahad Shahbaz Khan
Furthermore, the lack of domain-specific multimodal instruction following data as well as strong backbone models for RS make it hard for the models to align their behavior with user queries.
1 code implementation • 31 Oct 2023 • Srijan Das, Tanmay Jain, Dominick Reilly, Pranav Balaji, Soumyajit Karmakar, Shyam Marjit, Xiang Li, Abhijit Das, Michael S. Ryoo
We explore the appropriate SSL tasks that can be optimized alongside the primary task, the training schemes for these tasks, and the data scale at which they can be most effective.
no code implementations • 25 Aug 2023 • Pranav Balaji, Abhijit Das, Srijan Das, Antitza Dantcheva
This work explores various ways of exploring multi-task learning (MTL) techniques aimed at classifying videos as original or manipulated in cross-manipulation scenario to attend generalizability in deep fake scenario.
no code implementations • 30 Aug 2021 • Tanmoy Mondal, Abhijit Das, Zuheng Ming
In this work, we adhere to explore a Multi-Tasking learning (MTL) based network to perform document attribute classification such as the font type, font size, font emphasis and scanning resolution classification of a document image.
1 code implementation • 6 Apr 2020 • S. V. Aruna Kumar, Ehsan Yaghoubi, Abhijit Das, B. S. Harish, Hugo Proença
Over the last decades, the world has been witnessing growing threats to the security in urban spaces, which has augmented the relevance given to visual surveillance solutions able to detect, track and identify persons of interest in crowds.