Search Results for author: Sahal Shaji Mullappilly

Found 7 papers, 7 papers with code

Semi-supervised Open-World Object Detection

1 code implementation • 25 Feb 2024 • Sahal Shaji Mullappilly, Abhishek Singh Gehlot, Rao Muhammad Anwer, Fahad Shahbaz Khan, Hisham Cholakkal

We demonstrate the effectiveness of our SS-OWOD problem setting and approach for remote sensing object detection, proposing carefully curated splits and baseline performance evaluations.

Incremental Learning Object +2

Paper
Code

BiMediX: Bilingual Medical Mixture of Experts LLM

1 code implementation • 20 Feb 2024 • Sara Pieri, Sahal Shaji Mullappilly, Fahad Shahbaz Khan, Rao Muhammad Anwer, Salman Khan, Timothy Baldwin, Hisham Cholakkal

In this paper, we introduce BiMediX, the first bilingual medical mixture of experts LLM designed for seamless interaction in both English and Arabic.

Multiple-choice Open-Ended Question Answering

Paper
Code

Arabic Mini-ClimateGPT : A Climate Change and Sustainability Tailored Arabic LLM

1 code implementation • 14 Dec 2023 • Sahal Shaji Mullappilly, Abdelrahman Shaker, Omkar Thawakar, Hisham Cholakkal, Rao Muhammad Anwer, Salman Khan, Fahad Shahbaz Khan

To this end, we propose a light-weight Arabic Mini-ClimateGPT that is built on an open-source LLM and is specifically fine-tuned on a conversational-style instruction tuning curated Arabic dataset Clima500-Instruct with over 500k instructions about climate change and sustainability.

Paper
Code

GLaMM: Pixel Grounding Large Multimodal Model

1 code implementation • 6 Nov 2023 • Hanoona Rasheed, Muhammad Maaz, Sahal Shaji Mullappilly, Abdelrahman Shaker, Salman Khan, Hisham Cholakkal, Rao M. Anwer, Erix Xing, Ming-Hsuan Yang, Fahad S. Khan

In this work, we present Grounding LMM (GLaMM), the first model that can generate natural language responses seamlessly intertwined with corresponding object segmentation masks.

Conversational Question Answering Image Captioning +5

576

Paper
Code

XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models

1 code implementation • 13 Jun 2023 • Omkar Thawkar, Abdelrahman Shaker, Sahal Shaji Mullappilly, Hisham Cholakkal, Rao Muhammad Anwer, Salman Khan, Jorma Laaksonen, Fahad Shahbaz Khan

The latest breakthroughs in large vision-language models, such as Bard and GPT-4, have showcased extraordinary abilities in performing a wide range of tasks.

Language Modelling Large Language Model

422

Paper
Code

An Empirical Study Of Self-supervised Learning Approaches For Object Detection With Transformers

2 code implementations • 11 May 2022 • Gokul Karthik Kumar, Sahal Shaji Mullappilly, Abhishek Singh Gehlot

However, the CNN feature maps still maintain the spatial relationship and we utilize this property to design self-supervised learning approaches to train the encoder of object detection transformers in pretraining and multi-task learning settings.

Image Classification Image Reconstruction +6

Paper
Code

MuCoT: Multilingual Contrastive Training for Question-Answering in Low-resource Languages

1 code implementation • DravidianLangTech (ACL) 2022 • Gokul Karthik Kumar, Abhishek Singh Gehlot, Sahal Shaji Mullappilly, Karthik Nandakumar

These models are pre-trained in a self-supervised fashion with a large English text corpus and further fine-tuned with a massive English QA dataset (e. g., SQuAD).

Ranked #1 on Question Answering on ChAII - Hindi and Tamil Question Answering

Question Answering

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.