Search Results for author: Manoj Kumar

Found 23 papers, 10 papers with code

IISERB Brains at SemEval-2022 Task 6: A Deep-learning Framework to Identify Intended Sarcasm in English

no code implementations • SemEval (NAACL) 2022 • Tanuj Shekhawat, Manoj Kumar, Udaybhan Rathore, Aditya Joshi, Jasabanta Patro

This paper describes the system architectures and the models submitted by our team “IISERB Brains” to SemEval 2022 Task 6 competition.

Paper
Add Code

Controlled Data Generation via Insertion Operations for NLU

no code implementations • NAACL (ACL) 2022 • Manoj Kumar, Yuval Merhav, Haidar Khan, Rahul Gupta, Anna Rumshisky, Wael Hamza

Use of synthetic data is rapidly emerging as a realistic alternative to manually annotating live traffic for industry-scale model building.

intent-classification Intent Classification +4

Paper
Add Code

Frozen Feature Augmentation for Few-Shot Image Classification

no code implementations • 15 Mar 2024 • Andreas Bär, Neil Houlsby, Mostafa Dehghani, Manoj Kumar

Training a linear classifier or lightweight model on top of pretrained vision model outputs, so-called 'frozen features', leads to impressive performance on a number of downstream few-shot tasks.

Classification Data Augmentation +1

Paper
Add Code

Image Captioners Are Scalable Vision Learners Too

1 code implementation • NeurIPS 2023 • Michael Tschannen, Manoj Kumar, Andreas Steiner, Xiaohua Zhai, Neil Houlsby, Lucas Beyer

We further analyze the effect of the model architecture and scale, as well as the pretraining data on the representation quality, and find that captioning exhibits the same or better scaling behavior along these axes.

Decoder Image Captioning

1,578

Paper
Code

Scaling Vision Transformers to 22 Billion Parameters

1 code implementation • 10 Feb 2023 • Mostafa Dehghani, Josip Djolonga, Basil Mustafa, Piotr Padlewski, Jonathan Heek, Justin Gilmer, Andreas Steiner, Mathilde Caron, Robert Geirhos, Ibrahim Alabdulmohsin, Rodolphe Jenatton, Lucas Beyer, Michael Tschannen, Anurag Arnab, Xiao Wang, Carlos Riquelme, Matthias Minderer, Joan Puigcerver, Utku Evci, Manoj Kumar, Sjoerd van Steenkiste, Gamaleldin F. Elsayed, Aravindh Mahendran, Fisher Yu, Avital Oliver, Fantine Huot, Jasmijn Bastings, Mark Patrick Collier, Alexey Gritsenko, Vighnesh Birodkar, Cristina Vasconcelos, Yi Tay, Thomas Mensink, Alexander Kolesnikov, Filip Pavetić, Dustin Tran, Thomas Kipf, Mario Lučić, Xiaohua Zhai, Daniel Keysers, Jeremiah Harmsen, Neil Houlsby

The scaling of Transformers has driven breakthrough capabilities for language models.

Ranked #1 on Zero-Shot Transfer Image Classification on ObjectNet

Action Classification Fairness +3

192

Paper
Code

Dual PatchNorm

7 code implementations • 2 Feb 2023 • Manoj Kumar, Mostafa Dehghani, Neil Houlsby

We propose Dual PatchNorm: two Layer Normalization layers (LayerNorms), before and after the patch embedding layer in Vision Transformers.

7,087

Paper
Code

Large language models can segment narrative events similarly to humans

no code implementations • 24 Jan 2023 • Sebastian Michelmann, Manoj Kumar, Kenneth A. Norman, Mariya Toneva

In the future, GPT-3 may thereby help to elucidate the principles underlying human event perception.

Language Modelling Large Language Model

Paper
Add Code

A Unified Framework for Optimization-Based Graph Coarsening

no code implementations • 2 Oct 2022 • Manoj Kumar, Anurag Sharma, Sandeep Kumar

In this paper, we introduce a novel optimization-based framework for graph dimensionality reduction.

Dimensionality Reduction Graph Learning

Paper
Add Code

Functional Optimization Reinforcement Learning for Real-Time Bidding

no code implementations • 25 Jun 2022 • Yining Lu, Changjie Lu, Naina Bandyopadhyay, Manoj Kumar, Gaurav Gupta

In order to evaluate the proposed RTB strategy's performance, we demonstrate the results on ten sequential simulated auction campaigns.

Attribute Multi-agent Reinforcement Learning +2

Paper
Add Code

Do better ImageNet classifiers assess perceptual similarity better?

no code implementations • 9 Mar 2022 • Manoj Kumar, Neil Houlsby, Nal Kalchbrenner, Ekin D. Cubuk

Perceptual distances between images, as measured in the space of pre-trained deep features, have outperformed prior low-level, pixel-based metrics on assessing perceptual similarity.

Paper
Add Code

IISERB Brains at SemEval 2022 Task 6: A Deep-learning Framework to Identify Intended Sarcasm in English

1 code implementation • 4 Mar 2022 • Tanuj Singh Shekhawat, Manoj Kumar, Udaybhan Rathore, Aditya Joshi, Jasabanta Patro

This paper describes the system architectures and the models submitted by our team "IISERBBrains" to SemEval 2022 Task 6 competition.

Paper
Code

Skillful Twelve Hour Precipitation Forecasts using Large Context Neural Networks

2 code implementations • 14 Nov 2021 • Lasse Espeholt, Shreya Agrawal, Casper Sønderby, Manoj Kumar, Jonathan Heek, Carla Bromberg, Cenk Gazen, Jason Hickey, Aaron Bell, Nal Kalchbrenner

An emerging class of weather models based on neural networks represents a paradigm shift in weather forecasting: the models learn the required transformations from data instead of relying on hand-coded physics and are computationally efficient.

energy management Management +2

220

Paper
Code

Coexistence of coarsening and mean field relaxation in the long-range Ising chain

no code implementations • 16 Feb 2021 • Federico Corberi, Alessandro Iannone, Manoj Kumar, Eugenio Lippiello, Paolo Politi

We study the kinetics after a low temperature quench of the one-dimensional Ising model with long range interactions between spins at distance $r$ decaying as $r^{-\alpha}$.

Statistical Mechanics

Paper
Add Code

Colorization Transformer

2 code implementations • ICLR 2021 • Manoj Kumar, Dirk Weissenborn, Nal Kalchbrenner

We present the Colorization Transformer, a novel approach for diverse high fidelity image colorization based on self-attention.

Ranked #2 on Colorization on ImageNet val

Colorization Image Colorization

32,932

Paper
Code

ProtoDA: Efficient Transfer Learning for Few-Shot Intent Classification

no code implementations • 28 Jan 2021 • Manoj Kumar, Varun Kumar, Hadrien Glaude, Cyprien delichy, Aman Alok, Rahul Gupta

We make use of a conditional generator for data augmentation that is trained directly using the meta-learning objective and simultaneously with prototypical networks, hence ensuring that data augmentation is customized to the task.

Classification Data Augmentation +9

Paper
Add Code

Necessary and Sufficient Condition for Satisfiability of a Boolean Formula in CNF and its Implications on P versus NP problem

no code implementations • 13 Jan 2021 • Manoj Kumar

Which leads to the necessary and sufficient condition for satisfiability of a boolean formula, in CNF.

Computational Complexity

Paper
Add Code

Designing Neural Speaker Embeddings with Meta Learning

1 code implementation • 31 Jul 2020 • Manoj Kumar, Tae Jin-Park, Somer Bishop, Shrikanth Narayanan

Our experiments illustrate the applicability of meta-learning as a generalized learning paradigm for training deep neural speaker embeddings.

Audio and Speech Processing Sound

302

Paper
Code

Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap

1 code implementation • 5 Mar 2020 • Tae Jin Park, Kyu J. Han, Manoj Kumar, Shrikanth Narayanan

In this study, we propose a new spectral clustering framework that can auto-tune the parameters of the clustering algorithm in the context of speaker diarization.

Ranked #1 on Speaker Diarization on CALLHOME (DER(ig olp) metric)

Clustering speaker-diarization +1

Paper
Code

Learning Domain Invariant Representations for Child-Adult Classification from Speech

no code implementations • 25 Oct 2019 • Rimita Lahiri, Manoj Kumar, Somer Bishop, Shrikanth Narayanan

Diagnostic procedures for ASD (autism spectrum disorder) involve semi-naturalistic interactions between the child and a clinician.

Binary Classification General Classification

Paper
Add Code

VideoFlow: A Conditional Flow-Based Model for Stochastic Video Generation

1 code implementation • ICLR 2020 • Manoj Kumar, Mohammad Babaeizadeh, Dumitru Erhan, Chelsea Finn, Sergey Levine, Laurent Dinh, Durk Kingma

Generative models that can model and predict sequences of future events can, in principle, learn to capture complex real-world phenomena, such as physical interactions.

Ranked #15 on Video Generation on BAIR Robot Pushing