Search Results for author: Gautam Shroff

Found 53 papers, 4 papers with code

Prompt Augmented Generative Replay via Supervised Contrastive Learning for Lifelong Intent Detection

no code implementations • Findings (NAACL) 2022 • Vaibhav Varshney, Mayur Patidar, Rajat Kumar, Lovekesh Vig, Gautam Shroff

This typically entails repeated retraining of the intent detector on both the existing and novel intents which can be expensive and would require storage of all past data corresponding to prior intents.

Continual Learning Contrastive Learning +2

Paper
Add Code

Intent Detection and Discovery from User Logs via Deep Semi-Supervised Contrastive Clustering

no code implementations • NAACL 2022 • Rajat Kumar, Mayur Patidar, Vaibhav Varshney, Lovekesh Vig, Gautam Shroff

However, even skilled domain experts are often unable to foresee all possible user intents at design time and for practical applications, novel intents may have to be inferred incrementally on-the-fly from user utterances.

Ranked #1 on Open Intent Discovery on BANKING77

Clustering Intent Detection +4

Paper
Add Code

Acceleron: A Tool to Accelerate Research Ideation

no code implementations • 7 Mar 2024 • Harshit Nigam, Manasi Patwardhan, Lovekesh Vig, Gautam Shroff

To aid with research ideation, we propose `Acceleron', a research accelerator for different phases of the research life cycle, and which is specially designed to aid the ideation process.

Paper
Add Code

Conservative Predictions on Noisy Financial Data

no code implementations • 18 Oct 2023 • Omkar Nabar, Gautam Shroff

We apply a similar approach, where a model abstains from making a prediction on data points that it is uncertain on.

Paper
Add Code

Adapt and Decompose: Efficient Generalization of Text-to-SQL via Domain Adapted Least-To-Most Prompting

no code implementations • 1 Aug 2023 • Aseem Arora, Shabbirhussain Bhaisaheb, Harshit Nigam, Manasi Patwardhan, Lovekesh Vig, Gautam Shroff

Cross-domain and cross-compositional generalization of Text-to-SQL semantic parsing is a challenging task.

Domain Generalization Language Modelling +4

Paper
Add Code

Neuro-symbolic Meta Reinforcement Learning for Trading

no code implementations • 15 Jan 2023 • S I Harini, Gautam Shroff, Ashwin Srinivasan, Prayushi Faldu, Lovekesh Vig

We model short-duration (e. g. day) trading in financial markets as a sequential decision-making problem under uncertainty, with the added complication of continual concept-drift.

Decision Making Meta Reinforcement Learning +3

Paper
Add Code

Calibrating Deep Neural Networks using Explicit Regularisation and Dynamic Data Pruning

no code implementations • 20 Dec 2022 • Ramya Hebbalaguppe, Rishabh Patra, Tirtharaj Dash, Gautam Shroff, Lovekesh Vig

Contemporary model calibration techniques mitigate the problem of overconfident predictions by pushing down the confidence of the winning class while increasing the confidence of the remaining classes across all test samples.

Image Classification

Paper
Add Code

Neural Feature-Adaptation for Symbolic Predictions Using Pre-Training and Semantic Loss

no code implementations • 29 Nov 2022 • Vedant Shah, Aditya Agrawal, Lovekesh Vig, Ashwin Srinivasan, Gautam Shroff, Tanmay Verlekar

Real data is often imperfect meaning that even if the symbolic theory remains unchanged, we may still need to address the problem of mapping raw data to high-level symbols, each time there is a change in the data acquisition environment or equipment.

Paper
Add Code

Knowledge-based Analogical Reasoning in Neuro-symbolic Latent Spaces

no code implementations • 19 Sep 2022 • Vishwa Shah, Aditya Sharma, Gautam Shroff, Lovekesh Vig, Tirtharaj Dash, Ashwin Srinivasan

However, connectionist models struggle to include explicit domain knowledge for deductive reasoning.

Paper
Add Code

Continual Learning for Multivariate Time Series Tasks with Variable Input Dimensions

no code implementations • 14 Mar 2022 • Vibhor Gupta, Jyoti Narwariya, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff

We note that existing continual learning methods do not take into account variability in input dimensions arising due to different subsets of sensors being available across tasks, and struggle to adapt to such variable input dimensions (VID) tasks.

Activity Recognition Continual Learning +2

Paper
Add Code

Learning to Liquidate Forex: Optimal Stopping via Adaptive Top-K Regression

no code implementations • 25 Feb 2022 • Diksha Garg, Pankaj Malhotra, Anil Bhatia, Sanjay Bhat, Lovekesh Vig, Gautam Shroff

We consider learning a trading agent acting on behalf of the treasury of a firm earning revenue in a foreign currency (FC) and incurring expenses in the home currency (HC).

regression

Paper
Add Code

DRTCI: Learning Disentangled Representations for Temporal Causal Inference

no code implementations • 20 Jan 2022 • Garima Gupta, Lovekesh Vig, Gautam Shroff

Medical professionals evaluating alternative treatment plans for a patient often encounter time varying confounders, or covariates that affect both the future treatment assignment and the patient outcome.

Causal Inference counterfactual +1

Paper
Add Code

Solving Visual Analogies Using Neural Algorithmic Reasoning

no code implementations • 19 Nov 2021 • Atharv Sonwane, Gautam Shroff, Lovekesh Vig, Ashwin Srinivasan, Tirtharaj Dash

We consider a class of visual analogical reasoning problems that involve discovering the sequence of transformations by which pairs of input/output images are related, so as to analogously transform future inputs.

Program Synthesis Visual Analogies

Paper
Add Code

Forecasting Market Prices using DL with Data Augmentation and Meta-learning: ARIMA still wins!

no code implementations • NeurIPS Workshop ICBINB 2021 • Vedant Shah, Gautam Shroff

We benchmark the baseline techniques on this synthetic data as well as use it for data augmentation.

Data Augmentation Meta-Learning +2

Paper
Add Code

Using Program Synthesis and Inductive Logic Programming to solve Bongard Problems

no code implementations • 19 Oct 2021 • Atharv Sonwane, Sharad Chitlangia, Tirtharaj Dash, Lovekesh Vig, Gautam Shroff, Ashwin Srinivasan

The ability to solve Bongard problems is an example of such a test.

Inductive logic programming Program Synthesis

Paper
Add Code

Complex Question Answering on knowledge graphs using machine translation and multi-task learning

no code implementations • EACL 2021 • Saurabh Srivastava, Mayur Patidar, Sudip Chowdhury, Puneet Agarwal, Indrajit Bhattacharya, Gautam Shroff

Question answering (QA) over a knowledge graph (KG) is a task of answering a natural language (NL) query using the information stored in KG.

Entity Linking Knowledge Graphs +5

Paper
Add Code

CAMTA: Causal Attention Model for Multi-touch Attribution

no code implementations • 21 Dec 2020 • Sachin Kumar, Garima Gupta, Ranjitha Prasad, Arnab Chatterjee, Lovekesh Vig, Gautam Shroff

Advertising channels have evolved from conventional print media, billboards and radio advertising to online digital advertising (ad), where the users are exposed to a sequence of ad campaigns via social networks, display ads, search etc.

Selection bias

Paper
Add Code

Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation

no code implementations • 16 Dec 2020 • Diksha Garg, Priyanka Gupta, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff

Most of the existing deep reinforcement learning (RL) approaches for session-based recommendations either rely on costly online interactions with real users, or rely on potentially biased rule-based or data-driven user-behavior models for learning.

Distributional Reinforcement Learning Offline RL +3

Paper
Add Code

Hi-CI: Deep Causal Inference in High Dimensions

no code implementations • 22 Aug 2020 • Ankit Sharma, Garima Gupta, Ranjitha Prasad, Arnab Chatterjee, Lovekesh Vig, Gautam Shroff

The proposed architecture comprises of a decorrelation network and an outcome prediction network.

Causal Inference counterfactual +1

Paper
Add Code

Handling Variable-Dimensional Time Series with Graph Neural Networks

no code implementations • 1 Jul 2020 • Vibhor Gupta, Jyoti Narwariya, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff

Such a combinatorial generalization is achieved by conditioning the layers of a core neural network-based time series model with a "conditioning vector" that carries information of the available combination of sensors for each time series.

Activity Recognition Time Series +2

Paper
Add Code

Graph Neural Networks for Leveraging Industrial Equipment Structure: An application to Remaining Useful Life Estimation

no code implementations • 30 Jun 2020 • Jyoti Narwariya, Pankaj Malhotra, Vishnu Tv, Lovekesh Vig, Gautam Shroff

Deep learning models such as those based on recurrent neural networks (RNNs) or convolutional neural networks (CNNs) fail to explicitly leverage this potentially rich source of domain-knowledge into the learning procedure.

Time Series Time Series Analysis

Paper
Add Code

Privacy Guidelines for Contact Tracing Applications

no code implementations • 28 Apr 2020 • Manish Shukla, Rajan M A, Sachin Lodha, Gautam Shroff, Ramesh Raskar

Due to this there is an emergence of mobile based applications for contact tracing.

Paper
Add Code

MultiMBNN: Matched and Balanced Causal Inference with Neural Networks

no code implementations • 28 Apr 2020 • Ankit Sharma, Garima Gupta, Ranjitha Prasad, Arnab Chatterjee, Lovekesh Vig, Gautam Shroff

Causal inference (CI) in observational studies has received a lot of attention in healthcare, education, ad attribution, policy evaluation, etc.

Causal Inference

Paper
Add Code

MetaCI: Meta-Learning for Causal Inference in a Heterogeneous Population

no code implementations • 9 Dec 2019 • Ankit Sharma, Garima Gupta, Ranjitha Prasad, Arnab Chatterjee, Lovekesh Vig, Gautam Shroff

Performing inference on data obtained through observational studies is becoming extremely relevant due to the widespread availability of data in fields such as healthcare, education, retail, etc.

Causal Inference counterfactual +1

Paper
Add Code

From Monolingual to Multilingual FAQ Assistant using Multilingual Co-training

no code implementations • WS 2019 • Mayur Patidar, Surabhi Kumari, Manasi Patwardhan, Kar, Shirish e, Puneet Agarwal, Lovekesh Vig, Gautam Shroff

We observe that the proposed approach provides consistent gains in the performance of BERT for multiple benchmark datasets (e. g. 1. 0{\%} gain on MLDocs, and 1. 2{\%} gain on XNLI over translate-train with BERT), while requiring a single model for multiple languages.

Cross-Lingual Transfer

Paper
Add Code

Meta-Learning for Few-Shot Time Series Classification

no code implementations • 13 Sep 2019 • Jyoti Narwariya, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff, Vishnu Tv

We overcome this limitation in order to train a common agent across domains with each domain having different number of target classes, we utilize a triplet-loss based learning procedure that does not require any constraints to be enforced on the number of classes for the few-shot TSC tasks.

Activity Recognition Classification +5

Paper
Add Code

NISER: Normalized Item and Session Representations to Handle Popularity Bias

2 code implementations • 10 Sep 2019 • Priyanka Gupta, Diksha Garg, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff

The models using normalized item and session-graph representations perform significantly better: i. for the less popular long-tail items in the offline setting, and ii.

Ranked #2 on Session-Based Recommendations on Last.FM

Session-Based Recommendations

Paper
Code

Meta-Learning for Black-box Optimization

no code implementations • 16 Jul 2019 • Vishnu TV, Pankaj Malhotra, Jyoti Narwariya, Lovekesh Vig, Gautam Shroff

Recently, neural networks trained as optimizers under the "learning to learn" or meta-learning framework have been shown to be effective for a broad range of optimization tasks including derivative-free black-box function optimization.

Meta-Learning

Paper
Add Code

One-shot Information Extraction from Document Images using Neuro-Deductive Program Synthesis

no code implementations • 6 Jun 2019 • Vishal Sunder, Ashwin Srinivasan, Lovekesh Vig, Gautam Shroff, Rohit Rahul

Our interest in this paper is in meeting a rapidly growing industrial demand for information extraction from images of documents such as invoices, bills, receipts etc.

Program Synthesis

Paper
Add Code

Fast Online "Next Best Offers" using Deep Learning

no code implementations • 31 May 2019 • Rekha Singhal, Gautam Shroff, Mukund Kumar, Sharod Roy, Sanket Kadarkar, Rupinder virk, Siddharth Verma, Vartika Tiwari

In this paper, we present iPrescribe, a scalable low-latency architecture for recommending 'next-best-offers' in an online setting.

BIG-bench Machine Learning

Paper
Add Code

ConvTimeNet: A Pre-trained Deep Convolutional Neural Network for Time Series Classification

no code implementations • 29 Apr 2019 • Kathan Kashiparekh, Jyoti Narwariya, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff

We also provide qualitative insights into the working of CTN by: i) analyzing the activations and filters of first convolution layer suggesting the filters in CTN are generically useful, ii) analyzing the impact of the design decision to incorporate multiple length decisions, and iii) finding regions of time series that affect the final classification decision via occlusion sensitivity analysis.

Computational Efficiency General Classification +3

Paper
Add Code

Transfer Learning for Clinical Time Series Analysis using Deep Neural Networks

no code implementations • 1 Apr 2019 • Priyanka Gupta, Pankaj Malhotra, Jyoti Narwariya, Lovekesh Vig, Gautam Shroff

We, therefore, conclude that pre-trained deep models like TimeNet and HealthNet allow leveraging the advantages of deep learning for clinical time series analysis tasks, while also minimize dependence on hand-crafted features, deal robustly with scarce labeled training data scenarios without overfitting, as well as reduce dependence on expertise and resources required to train deep networks from scratch.

Domain Adaptation Time Series +2

Paper
Add Code

Data-driven Prognostics with Predictive Uncertainty Estimation using Ensemble of Deep Ordinal Regression Models

no code implementations • 23 Mar 2019 • Vishnu TV, Diksha, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff

We propose a novel deep learning based approach for Prognostics with Uncertainty Quantification that is useful in scenarios where: (i) access to labeled failure data is scarce due to rarity of failures (ii) future operational conditions are unobserved and (iii) inherent noise is present in the sensor readings.

regression Time Series Analysis +1

Paper
Add Code

MEETING BOT: Reinforcement Learning for Dialogue Based Meeting Scheduling

no code implementations • 28 Dec 2018 • Vishwanath D, Lovekesh Vig, Gautam Shroff, Puneet Agarwal

In this paper we present Meeting Bot, a reinforcement learning based conversational system that interacts with multiple users to schedule meetings.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Deep Reader: Information extraction from Document images via relation extraction and Natural Language

no code implementations • 11 Dec 2018 • Vishwanath D, Rohit Rahul, Gunjan Sehgal, Swati, Arindam Chowdhury, Monika Sharma, Lovekesh Vig, Gautam Shroff, Ashwin Srinivasan

In this paper, we propose a novel enterprise based end-to-end framework called DeepReader which facilitates information extraction from document images via identification of visual entities and populating a meta relational model across different entities in the document image.

Optical Character Recognition Optical Character Recognition (OCR) +2

Paper
Add Code

Prosocial or Selfish? Agents with different behaviors for Contract Negotiation using Reinforcement Learning

no code implementations • 19 Sep 2018 • Vishal Sunder, Lovekesh Vig, Arnab Chatterjee, Gautam Shroff

We further train a meta agent with a mixture of behaviors by learning an ensemble of different models using reinforcement learning.

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Add Code

Transfer Learning for Clinical Time Series Analysis using Recurrent Neural Networks

no code implementations • 4 Jul 2018 • Priyanka Gupta, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff

We consider transferring the knowledge captured in an RNN trained on several source tasks simultaneously using a large labeled dataset to build the model for a target task with limited labeled data.

Mortality Prediction Time Series +2

Paper
Add Code

Information Bottleneck Inspired Method For Chat Text Segmentation

no code implementations • IJCNLP 2017 • S Vishal, Mohit Yadav, Lovekesh Vig, Gautam Shroff

We present a novel technique for segmenting chat conversations using the information bottleneck method (Tishby et al., 2000), augmented with sequential continuity constraints.

Representation Learning Text Generation +2

Paper
Add Code

Crop Planning using Stochastic Visual Optimization

no code implementations • 25 Oct 2017 • Gunjan Sehgal, Bindu Gupta, Kaushal Paneri, Karamjit Singh, Geetika Sharma, Gautam Shroff

Given the weather and soil properties, farmers need to take critical decisions such as which seed variety to plant and in what proportion, in order to maximize productivity.

Decision Making

Paper
Add Code

Predicting Remaining Useful Life using Time Series Embeddings based on Recurrent Neural Networks

no code implementations • 4 Sep 2017 • Narendhar Gugulothu, Vishnu Tv, Pankaj Malhotra, Lovekesh Vig, Puneet Agarwal, Gautam Shroff

We propose Embed-RUL: a novel approach for RUL estimation from sensor data that does not rely on any degradation-trend assumptions, is robust to noise, and handles missing values.

Time Series Time Series Analysis

Paper
Add Code

Comparative Benchmarking of Causal Discovery Techniques

no code implementations • 18 Aug 2017 • Karamjit Singh, Garima Gupta, Vartika Tewari, Gautam Shroff

In this paper we present a comprehensive view of prominent causal discovery algorithms, categorized into two main categories (1) assuming acyclic and no latent variables, and (2) allowing both cycles and latent variables, along with experimental results comparing them from three perspectives: (a) structural accuracy, (b) standard predictive accuracy, and (c) accuracy of counterfactual inference.

Benchmarking Causal Discovery +2

Paper
Add Code

TimeNet: Pre-trained deep recurrent neural network for time series classification

2 code implementations • 23 Jun 2017 • Pankaj Malhotra, Vishnu Tv, Lovekesh Vig, Puneet Agarwal, Gautam Shroff

Inspired by the tremendous success of deep Convolutional Neural Networks as generic feature extractors for images, we propose TimeNet: a deep recurrent neural network (RNN) trained on diverse time series in an unsupervised manner using sequence to sequence (seq2seq) models to extract features from time series.

Dynamic Time Warping General Classification +3

Paper
Code

Learning and Knowledge Transfer with Memory Networks for Machine Comprehension

no code implementations • EACL 2017 • Mohit Yadav, Lovekesh Vig, Gautam Shroff

Motivated by these practical issues, we propose a novel curriculum inspired training procedure for Memory Networks to improve the performance for machine comprehension with relatively small volumes of training data.

Question Answering Reading Comprehension +1

Paper
Add Code

Deep Convolutional Neural Networks for Pairwise Causality

no code implementations • 3 Jan 2017 • Karamjit Singh, Garima Gupta, Lovekesh Vig, Gautam Shroff, Puneet Agarwal

Discovering causal models from observational and interventional data is an important first step preceding what-if analysis or counterfactual reasoning.

Attribute Causal Discovery +2

Paper
Add Code

Neuro-symbolic EDA-based Optimisation using ILP-enhanced DBNs

no code implementations • 20 Dec 2016 • Sarmimala Saikia, Lovekesh Vig, Ashwin Srinivasan, Gautam Shroff, Puneet Agarwal, Richa Rawat

We investigate solving discrete optimisation problems using the estimation of distribution (EDA) approach via a novel combination of deep belief networks(DBN) and inductive logic programming (ILP). While DBNs are used to learn the structure of successively better feasible solutions, ILP enables the incorporation of domain-based background knowledge related to the goodness of solutions. Recent work showed that ILP could be an effective way to use domain knowledge in an EDA scenario. However, in a purely ILP-based EDA, sampling successive populations is either inefficient or not straightforward. In our Neuro-symbolic EDA, an ILP engine is used to construct a model for good solutions using domain-based background knowledge. These rules are introduced as Boolean features in the last hidden layer of DBNs used for EDA-based optimization. This incorporation of logical ILP features requires some changes while training and sampling from DBNs: (a)our DBNs need to be trained with data for units at the input layer as well as some units in an otherwise hidden layer, and (b)we would like the samples generated to be drawn from instances entailed by the logical model. We demonstrate the viability of our approach on instances of two optimisation problems: predicting optimal depth-of-win for the KRK endgame, and jobshop scheduling. Our results are promising: (i)On each iteration of distribution estimation, samples obtained with an ILP-assisted DBN have a substantially greater proportion of good solutions than samples generated using a DBN without ILP features, and (ii)On termination of distribution estimation, samples obtained using an ILP-assisted DBN contain more near-optimal samples than samples from a DBN without ILP features. These results suggest that the use of ILP-constructed theories could be useful for incorporating complex domain-knowledge into deep models for estimation of distribution based procedures.

Inductive logic programming

Paper
Add Code

Multi-Sensor Prognostics using an Unsupervised Health Index based on LSTM Encoder-Decoder

no code implementations • 22 Aug 2016 • Pankaj Malhotra, Vishnu Tv, Anusha Ramakrishnan, Gaurangi Anand, Lovekesh Vig, Puneet Agarwal, Gautam Shroff

Many approaches for estimation of Remaining Useful Life (RUL) of a machine, using its operational sensor data, make assumptions about how a system degrades or a fault evolves, e. g., exponential degradation.

Exponential degradation Time Series +1

Paper
Add Code

Generation of Near-Optimal Solutions Using ILP-Guided Sampling

no code implementations • 3 Aug 2016 • Ashwin Srinivasan, Gautam Shroff, Lovekesh Vig, Sarmimala Saikia, Puneet Agarwal

To answer this in the affirmative, we need: (a)a general-purpose technique for the incorporation of domain knowledge when constructing models for optimal values; and (b)a way of using these models to generate new data samples.

Inductive logic programming Job Shop Scheduling +1

Paper
Add Code

LSTM-based Encoder-Decoder for Multi-sensor Anomaly Detection

9 code implementations • 1 Jul 2016 • Pankaj Malhotra, Anusha Ramakrishnan, Gaurangi Anand, Lovekesh Vig, Puneet Agarwal, Gautam Shroff

Mechanical devices such as engines, vehicles, aircrafts, etc., are typically instrumented with numerous sensors to capture the behavior and health of the machine.

Anomaly Detection Outlier Detection +3

1,154

Paper
Code

ODE - Augmented Training Improves Anomaly Detection in Sensor Data from Machines

no code implementations • 5 May 2016 • Mohit Yadav, Pankaj Malhotra, Lovekesh Vig, K Sriram, Gautam Shroff

The available data is then augmented with data generated from the ODE, and the anomaly detector is retrained on this augmented dataset.

Anomaly Detection Time Series +1

Paper
Add Code

Long Short Term Memory Networks for Anomaly Detection in Time Series

1 code implementation • ESANN 2015 • Pankaj Malhotra, Lovekesh Vig, Gautam Shroff, Puneet Agarwal

Long Short Term Memory (LSTM) networks have been demonstrated to be particularly useful for learning sequences containing longer term patterns of unknown length, due to their ability to maintain long term memory.

Anomaly Detection Fault Detection +2

545

Paper
Code

Efficiently Discovering Frequent Motifs in Large-scale Sensor Data

no code implementations • 2 Jan 2015 • Puneet Agarwal, Gautam Shroff, Sarmimala Saikia, Zaigham Khan

While analyzing vehicular sensor data, we found that frequently occurring waveforms could serve as features for further analysis, such as rule mining, classification, and anomaly detection.

Anomaly Detection Clustering +2

Paper
Add Code

Warranty Cost Estimation Using Bayesian Network

no code implementations • 11 Nov 2014 • Karamjit Singh, Puneet Agarwal, Gautam Shroff

All multi-component product manufacturing companies face the problem of warranty cost estimation.

Paper
Add Code

Multi-Sensor Event Detection using Shape Histograms

no code implementations • 16 Aug 2014 • Ehtesham Hassan, Gautam Shroff, Puneet Agarwal

We present results on real-life vehicular sensor data and show that our technique performs better than available pattern detection implementations on our data, and that it can also be used to combine features from multiple sensors resulting in better accuracy than using any single sensor.

Event Detection Time Series +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.