Search Results for author: Balaraman Ravindran

Found 89 papers, 29 papers with code

InSaAF: Incorporating Safety through Accuracy and Fairness | Are LLMs ready for the Indian Legal Domain?

no code implementations • 16 Feb 2024 • Yogesh Tripathi, Raghav Donakanti, Sahil Girhepuje, Ishan Kavathekar, Bhaskara Hanuma Vedula, Gokul S Krishnan, Shreya Goyal, Anmol Goel, Balaraman Ravindran, Ponnurangam Kumaraguru

Task performance and fairness scores of LLaMA and LLaMA--2 models indicate that the proposed $LSS_{\beta}$ metric can effectively determine the readiness of a model for safe usage in the legal sector.

Fairness

Paper
Add Code

LineConGraphs: Line Conversation Graphs for Effective Emotion Recognition using Graph Neural Networks

no code implementations • 4 Dec 2023 • Gokul S Krishnan, Sarala Padi, Craig S. Greenberg, Balaraman Ravindran, Dinesh Manoch, Ram D. Sriram

To overcome these limitations, we propose novel line conversation graph convolutional network (LineConGCN) and graph attention (LineConGAT) models for ERC analysis.

Emotion Recognition Graph Attention +1

Paper
Add Code

Multi-Agent Learning of Efficient Fulfilment and Routing Strategies in E-Commerce

no code implementations • 20 Nov 2023 • Omkar Shelke, Pranavi Pathakota, Anandsingh Chauhan, Harshad Khadilkar, Hardik Meisheri, Balaraman Ravindran

This paper presents an integrated algorithmic framework for minimising product delivery costs in e-commerce (known as the cost-to-serve or C2S).

Decision Making

Paper
Add Code

PolicyClusterGCN: Identifying Efficient Clusters for Training Graph Convolutional Networks

no code implementations • 25 Jun 2023 • Saket Gurukar, Shaileshh Bojja Venkatakrishnan, Balaraman Ravindran, Srinivasan Parthasarathy

Specifically, the subgraph-based sampling approaches such as ClusterGCN and GraphSAINT have achieved state-of-the-art performance on the node classification tasks.

graph partitioning Node Classification +1

Paper
Add Code

GAN-MPC: Training Model Predictive Controllers with Parameterized Cost Functions using Demonstrations from Non-identical Experts

1 code implementation • 30 May 2023 • Returaj Burnwal, Anirban Santara, Nirav P. Bhatt, Balaraman Ravindran, Gaurav Aggarwal

We propose a novel approach that uses a generative adversarial network (GAN) to minimize the Jensen-Shannon divergence between the state-trajectory distributions of the demonstrator and the imitator.

Generative Adversarial Network Imitation Learning +1

Paper
Code

Clustering Indices based Automatic Classification Model Selection

1 code implementation • 23 May 2023 • Sudarsun Santhiappan, Nitin Shravan, Balaraman Ravindran

We also propose an end-to-end Automated ML system for data classification based on our model selection method.

Classification Clustering +2

Paper
Code

MABL: Bi-Level Latent-Variable World Model for Sample-Efficient Multi-Agent Reinforcement Learning

no code implementations • 12 Apr 2023 • Aravind Venugopal, Stephanie Milani, Fei Fang, Balaraman Ravindran

Unlike existing models, MABL is capable of encoding essential global information into the latent states during training while guaranteeing the decentralized execution of learned policies.

reinforcement-learning SMAC+

Paper
Add Code

Are Models Trained on Indian Legal Data Fair?

no code implementations • 13 Mar 2023 • Sahil Girhepuje, Anmol Goel, Gokul S Krishnan, Shreya Goyal, Satyendra Pandey, Ponnurangam Kumaraguru, Balaraman Ravindran

We highlight the propagation of learnt algorithmic biases in the bail prediction task for models trained on Hindi legal documents.

Fairness

Paper
Add Code

Physics-Informed Model-Based Reinforcement Learning

1 code implementation • 5 Dec 2022 • Adithya Ramesh, Balaraman Ravindran

In these environments, the physics-informed version of our algorithm achieves significantly better average-return and sample efficiency.

Model-based Reinforcement Learning reinforcement-learning +1

Paper
Code

ReGrAt: Regularization in Graphs using Attention to handle class imbalance

no code implementations • 27 Nov 2022 • Neeraja Kirtane, Jeshuren Chelladurai, Balaraman Ravindran, Ashish Tendulkar

Changing data composition is a popular way to address the imbalance in node classification.

Classification Node Classification

Paper
Add Code

GrabQC: Graph based Query Contextualization for automated ICD coding

no code implementations • 14 Jul 2022 • Jeshuren Chelladurai, Sudarsun Santhiappan, Balaraman Ravindran

We propose to automate this manual process by automatically constructing a query for the IR system using the entities auto-extracted from the clinical notes.

Information Retrieval Retrieval

Paper
Add Code

Multi-Variate Time Series Forecasting on Variable Subsets

1 code implementation • 25 Jun 2022 • Jatin Chauhan, Aravindan Raghuveer, Rishi Saket, Jay Nandy, Balaraman Ravindran

Through systematic experiments across 4 datasets and 5 forecast models, we show that our technique is able to recover close to 95\% performance of the models even when only 15\% of the original variables are present.

Multivariate Time Series Forecasting Time Series

Paper
Code

Matching options to tasks using Option-Indexed Hierarchical Reinforcement Learning

no code implementations • 12 Jun 2022 • Kushal Chauhan, Soumya Chatterjee, Akash Reddy, Balaraman Ravindran, Pradeep Shenoy

The options framework in Hierarchical Reinforcement Learning breaks down overall goals into a combination of options or simpler tasks and associated policies, allowing for abstraction in the action space.

Continual Learning Hierarchical Reinforcement Learning +3

Paper
Add Code

Evolutionary Approach to Security Games with Signaling

no code implementations • 29 Apr 2022 • Adam Żychowski, Jacek Mańdziuk, Elizabeth Bondi, Aravind Venugopal, Milind Tambe, Balaraman Ravindran

Green Security Games have become a popular way to model scenarios involving the protection of natural resources, such as wildlife.

Paper
Add Code

A Survey of Adversarial Defences and Robustness in NLP

no code implementations • 12 Mar 2022 • Shreya Goyal, Sumanth Doddapaneni, Mitesh M. Khapra, Balaraman Ravindran

In the past few years, it has become increasingly evident that deep neural networks are not resilient enough to withstand adversarial perturbations in input data, leaving them vulnerable to attack.

Adversarial Defense named-entity-recognition +5

Paper
Add Code

A Causal Approach for Unfair Edge Prioritization and Discrimination Removal

no code implementations • 29 Nov 2021 • Pavan Ravishankar, Pranshu Malviya, Balaraman Ravindran

We prove this result for the non-trivial non-parametric model setting when the cumulative unfairness cannot be expressed in terms of edge unfairness.

Paper
Add Code

Smooth Imitation Learning via Smooth Costs and Smooth Policies

no code implementations • 3 Nov 2021 • Sapana Chaudhary, Balaraman Ravindran

We call our new smooth IL algorithm \textit{Smooth Policy and Cost Imitation Learning} (SPaCIL, pronounced 'Special').

Continuous Control Imitation Learning +1

Paper
Add Code

Multi-Tailed, Multi-Headed, Spatial Dynamic Memory refined Text-to-Image Synthesis

no code implementations • 15 Oct 2021 • Amrit Diggavi Seshadri, Balaraman Ravindran

Synthesizing high-quality, realistic images from text-descriptions is a challenging task, and current methods synthesize images from text in a multi-stage manner, typically by first generating a rough initial image and then refining image details at subsequent stages.

Image Generation

Paper
Add Code

Dynamic probabilistic logic models for effective abstractions in RL

no code implementations • 15 Oct 2021 • Harsha Kokel, Arjun Manoharan, Sriraam Natarajan, Balaraman Ravindran, Prasad Tadepalli

State abstraction enables sample-efficient learning and better task transfer in complex reinforcement learning environments.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Semi-Supervised Deep Learning for Multiplex Networks

1 code implementation • 5 Oct 2021 • Anasua Mitra, Priyesh Vijayan, Ranbir Sanasam, Diganta Goswami, Srinivasan Parthasarathy, Balaraman Ravindran

Multiplex networks are complex graph structures in which a set of entities are connected to each other via multiple types of relations, each relation representing a distinct layer.

Representation Learning

Paper
Code

Causal Contextual Bandits with Targeted Interventions

no code implementations • ICLR 2022 • Chandrasekar Subramanian, Balaraman Ravindran

We study a contextual bandit setting where the learning agent has the ability to perform interventions on targeted subsets of the population, apart from possessing qualitative causal side-information.

Multi-Armed Bandits

Paper
Add Code

A Joint Training Framework for Open-World Knowledge Graph Embeddings

no code implementations • AKBC 2021 • Karthik V, Beethika Tripathi, Mitesh M Khapra, Balaraman Ravindran

However, we find that existing approaches suffer from one or more of four drawbacks – 1) They are not modular with respect to the choice of the KG embedding model 2) They ignore best practices for aligning two embedding spaces 3) They do not account for differences in training strategy needed when presented with datasets with different description sizes and 4) They do not produce entity embeddings for use by downstream tasks.

Dialogue Generation Entity Embeddings +4

Paper
Add Code

TAG: Task-based Accumulated Gradients for Lifelong learning

1 code implementation • 11 May 2021 • Pranshu Malviya, Balaraman Ravindran, Sarath Chandar

We also show that our method performs better than several state-of-the-art methods in lifelong learning on complex datasets with a large number of tasks.

Ranked #1 on Continual Learning on mini-Imagenet (20 tasks) - 1 epoch

Continual Learning

Paper
Code

Selective Intervention Planning using Restless Multi-Armed Bandits to Improve Maternal and Child Health Outcomes

no code implementations • 7 Mar 2021 • Siddharth Nishtala, Lovish Madaan, Aditya Mate, Harshavardhan Kamarthi, Anirudh Grama, Divy Thakkar, Dhyanesh Narayanan, Suresh Chaudhary, Neha Madhiwalla, Ramesh Padmanabhan, Aparna Hegde, Pradeep Varakantham, Balaraman Ravindran, Milind Tambe

India has a maternal mortality ratio of 113 and child mortality ratio of 2830 per 100, 000 live births.

Multi-Armed Bandits

Paper
Add Code

Hyperedge Prediction using Tensor Eigenvalue Decomposition

1 code implementation • 6 Feb 2021 • Deepak Maurya, Balaraman Ravindran

This is further used to propose a hyperedge prediction algorithm.

Hyperedge Prediction

Paper
Code

qRRT: Quality-Biased Incremental RRT for Optimal Motion Planning in Non-Holonomic Systems

no code implementations • 7 Jan 2021 • Nahas Pareekutty, Francis James, Balaraman Ravindran, Suril V. Shah

This paper presents a sampling-based method for optimal motion planning in non-holonomic systems in the absence of known cost functions.

Motion Planning Optimal Motion Planning +2

Paper
Add Code

Neural Fitted Q Iteration based Optimal Bidding Strategy in Real Time Reactive Power Market_1

no code implementations • 7 Jan 2021 • Jahnvi Patel, Devika Jay, Balaraman Ravindran, K. Shanti Swarup

In this paper, a pioneer work on learning optimal bidding strategies from observation and experience in a three-stage reactive power market is reported.

Stochastic Optimization

Paper
Add Code

Reinforcement Learning for Unified Allocation and Patrolling in Signaling Games with Uncertainty

no code implementations • 18 Dec 2020 • Aravind Venugopal, Elizabeth Bondi, Harshavardhan Kamarthi, Keval Dholakia, Balaraman Ravindran, Milind Tambe

We therefore first propose a novel GSG model that combines defender allocation, patrolling, real-time drone notification to human patrollers, and drones sending warning signals to attackers.

Decision Making Multiagent Systems

Paper
Add Code

Relational Boosted Bandits

1 code implementation • 16 Dec 2020 • Ashutosh Kakadiya, Sriraam Natarajan, Balaraman Ravindran

Contextual bandits algorithms have become essential in real-world user interaction problems in recent years.

Attribute Descriptive +3

Paper
Code

Hypergraph Partitioning using Tensor Eigenvalue Decomposition

no code implementations • 16 Nov 2020 • Deepak Maurya, Balaraman Ravindran

We also show improvement for the min-cut solution on 2-uniform hypergraphs (graphs) over the standard spectral partitioning algorithm.

graph partitioning hypergraph partitioning

Paper
Add Code

Goal directed molecule generation using Monte Carlo Tree Search

no code implementations • 30 Oct 2020 • Anand A. Rajasekar, Karthik Raman, Balaraman Ravindran

One challenging and essential task in biochemistry is the generation of novel molecules with desired properties.

Navigate

Paper
Add Code

MADRaS : Multi Agent Driving Simulator

no code implementations • 2 Oct 2020 • Anirban Santara, Sohan Rudra, Sree Aditya Buridi, Meha Kaushik, Abhishek Naik, Bharat Kaul, Balaraman Ravindran

In this work, we present MADRaS, an open-source multi-agent driving simulator for use in the design and evaluation of motion planning algorithms for autonomous driving.

Autonomous Driving Car Racing +5

Paper
Add Code

Reinforcement Learning for Improving Object Detection

no code implementations • 18 Aug 2020 • Siddharth Nayak, Balaraman Ravindran

The performance of a trained object detection neural network depends a lot on the image quality.

Object object-detection +3

Paper
Add Code

A Causal Linear Model to Quantify Edge Flow and Edge Unfairness for UnfairEdge Prioritization and Discrimination Removal

no code implementations • 10 Jul 2020 • Pavan Ravishankar, Pranshu Malviya, Balaraman Ravindran

Unlike previous works that only make cautionary claims of discrimination and de-biases data after its generation, this paper attempts to prioritize unfair sources before mitigating their unfairness in the real-world.

Paper
Add Code

Missed calls, Automated Calls and Health Support: Using AI to improve maternal health outcomes by increasing program engagement

no code implementations • 13 Jun 2020 • Siddharth Nishtala, Harshavardhan Kamarthi, Divy Thakkar, Dhyanesh Narayanan, Anirudh Grama, Aparna Hegde, Ramesh Padmanabhan, Neha Madhiwalla, Suresh Chaudhary, Balaraman Ravindran, Milind Tambe

India accounts for 11% of maternal deaths globally where a woman dies in childbirth every fifteen minutes.

Paper
Add Code

Reinforcement Learning for Multi-Product Multi-Node Inventory Management in Supply Chains

1 code implementation • 7 Jun 2020 • Nazneen N Sultana, Hardik Meisheri, Vinita Baniwal, Somjit Nath, Balaraman Ravindran, Harshad Khadilkar

This paper describes the application of reinforcement learning (RL) to multi-product inventory management in supply chains.

Decision Making Hierarchical Reinforcement Learning +3

Paper
Code

On Incorporating Structural Information to improve Dialogue Response Generation

1 code implementation • WS 2020 • Nikita Moghe, Priyesh Vijayan, Balaraman Ravindran, Mitesh M. Khapra

This requires capturing structural, sequential and semantic information from the conversation context and the background resources.

Response Generation

Paper
Code

Understanding Dynamic Scenes using Graph Convolution Networks

1 code implementation • 9 May 2020 • Sravan Mylavarapu, Mahtab Sandhu, Priyesh Vijayan, K. Madhava Krishna, Balaraman Ravindran, Anoop Namboodiri

We present a novel Multi-Relational Graph Convolutional Network (MRGCN) based framework to model on-road vehicle behaviors from a sequence of temporally ordered frames as grabbed by a moving monocular camera.

Ranked #1 on Test results on KITTI

Motion Segmentation Semantic Segmentation +1

Paper
Code

Towards Transparent and Explainable Attention Models

2 code implementations • ACL 2020 • Akash Kumar Mohankumar, Preksha Nema, Sharan Narasimhan, Mitesh M. Khapra, Balaji Vasan Srinivasan, Balaraman Ravindran

To make attention mechanisms more faithful and plausible, we propose a modified LSTM cell with a diversity-driven training objective that ensures that the hidden representations learned at different time steps are diverse.

Attribute

Paper
Code

EMPIR: Ensembles of Mixed Precision Deep Networks for Increased Robustness against Adversarial Attacks

1 code implementation • ICLR 2020 • Sanchari Sen, Balaraman Ravindran, Anand Raghunathan

Our results indicate that EMPIR boosts the average adversarial accuracies by 42. 6%, 15. 2% and 10. 5% for the DNN models trained on the MNIST, CIFAR-10 and ImageNet datasets respectively, when compared to single full-precision models, without sacrificing accuracy on the unperturbed inputs.

Self-Driving Cars

Paper
Code

A Unified Non-Negative Matrix Factorization Framework for Semi-Supervised Learning on Graphs

1 code implementation • Proceedings of the 2020 SIAM International Conference on Data Mining 2020 • Anasua Mitra, Priyesh Vijayan, Srinivasan Parthasarathy, Balaraman Ravindran

We propose a Semi-Supervised Learning (SSL) methodology that explicitly encodes different necessary priors to learn efficient representations for nodes in a network.

Node Classification

Paper
Code

Towards Accurate Vehicle Behaviour Classification With Multi-Relational Graph Convolutional Networks

1 code implementation • 3 Feb 2020 • Sravan Mylavarapu, Mahtab Sandhu, Priyesh Vijayan, K. Madhava Krishna, Balaraman Ravindran, Anoop Namboodiri

Understanding on-road vehicle behaviour from a temporal sequence of sensor data is gaining in popularity.

General Classification Optical Flow Estimation

Paper
Code

SEERL: Sample Efficient Ensemble Reinforcement Learning

no code implementations • 15 Jan 2020 • Rohan Saphal, Balaraman Ravindran, Dheevatsa Mudigere, Sasikanth Avancha, Bharat Kaul

However, ensemble methods are relatively less popular in reinforcement learning owing to the high sample complexity and computational expense involved in obtaining a diverse ensemble.

Continuous Control Ensemble Learning +3

Paper
Add Code

Reinforcement Learning for Multi-Objective Optimization of Online Decisions in High-Dimensional Systems

no code implementations • 1 Oct 2019 • Hardik Meisheri, Vinita Baniwal, Nazneen N Sultana, Balaraman Ravindran, Harshad Khadilkar

This paper describes a purely data-driven solution to a class of sequential decision-making problems with a large number of concurrent online decisions, with applications to computing systems and operations research.

Decision Making Management +2

Paper
Add Code

Option Encoder: A Framework for Discovering a Policy Basis in Reinforcement Learning

no code implementations • 9 Sep 2019 • Arjun Manoharan, Rahul Ramesh, Balaraman Ravindran

Option discovery and skill acquisition frameworks are integral to the functioning of a Hierarchically organized Reinforcement learning agent.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Let's Ask Again: Refine Network for Automatic Question Generation

1 code implementation • IJCNLP 2019 • Preksha Nema, Akash Kumar Mohankumar, Mitesh M. Khapra, Balaji Vasan Srinivasan, Balaraman Ravindran

It is desired that the generated question should be (i) grammatically correct (ii) answerable from the passage and (iii) specific to the given answer.

Question Generation Question-Generation

Paper
Code

Influence maximization in unknown social networks: Learning Policies for Effective Graph Sampling

1 code implementation • 8 Jul 2019 • Harshavardhan Kamarthi, Priyesh Vijayan, Bryan Wilder, Balaraman Ravindran, Milind Tambe

A serious challenge when finding influential actors in real-world social networks is the lack of knowledge about the structure of the underlying network.

Graph Sampling

Paper
Code

ExTra: Transfer-guided Exploration

no code implementations • 27 Jun 2019 • Anirban Santara, Rishabh Madan, Balaraman Ravindran, Pabitra Mitra

Given an optimal policy in a related task-environment, we show that its bisimulation distance from the current task-environment gives a lower bound on the optimal advantage of state-action pairs in the current task-environment.

Paper
Add Code

Learning Interpretable Models Using an Oracle

no code implementations • 17 Jun 2019 • Abhishek Ghose, Balaraman Ravindran

Our work addresses this by: (a) showing that learning a training distribution (often different from the test distribution) can often increase accuracy of small models, and therefore may be used as a strategy to compensate for small sizes, and (b) providing a model-agnostic algorithm to learn such training distributions.

Sentence Embedding text-classification +1

Paper
Add Code

MaMiC: Macro and Micro Curriculum for Robotic Reinforcement Learning

no code implementations • 17 May 2019 • Manan Tomar, Akhil Sathuluri, Balaraman Ravindran

Shaping in humans and animals has been shown to be a powerful tool for learning complex tasks as compared to learning in a randomized fashion.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Successor Options: An Option Discovery Framework for Reinforcement Learning

1 code implementation • 14 May 2019 • Rahul Ramesh, Manan Tomar, Balaraman Ravindran

This work adopts a complementary approach, where we attempt to discover options that navigate to landmark states.

Navigate reinforcement-learning +1

Paper
Code

Interpretability with Accurate Small Models

no code implementations • 4 May 2019 • Abhishek Ghose, Balaraman Ravindran

Our technique identifies the training data distribution to learn from that leads to the highest accuracy for a model of a given size.

Bayesian Optimization

Paper
Add Code

Network Representation Learning: Consolidation and Renewed Bearing

1 code implementation • 2 May 2019 • Saket Gurukar, Priyesh Vijayan, Aakash Srinivasan, Goonmeet Bajaj, Chen Cai, Moniba Keymanesh, Saravana Kumar, Pranav Maneriker, Anasua Mitra, Vedang Patel, Balaraman Ravindran, Srinivasan Parthasarathy

An important area of research that has emerged over the last decade is the use of graphs as a vehicle for non-linear dimensionality reduction in a manner akin to previous efforts based on manifold learning with uses for downstream database processing, machine learning and visualization.

Dimensionality Reduction General Classification +3

Paper
Code

Polyphonic Music Composition with LSTM Neural Networks and Reinforcement Learning

no code implementations • 5 Feb 2019 • Harish Kumar, Balaraman Ravindran

On top of our LSTM neural network that learnt musical sequences in this representation, we built an RL agent that learnt to find combinations of songs whose joint dominance produced pleasant compositions.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

An Active Learning Framework for Efficient Robust Policy Search

no code implementations • 1 Jan 2019 • Sai Kiran Narayanaswami, Nandan Sudarsanam, Balaraman Ravindran

Robust Policy Search is the problem of learning policies that do not degrade in performance when subject to unseen environment model parameters.

Active Learning Continuous Control +1

Paper
Add Code

Hypergraph Clustering: A Modularity Maximization Approach

no code implementations • 28 Dec 2018 • Tarun Kumar, Sankaran Vaidyanathan, Harini Ananthapadmanabhan, Srinivasan Parthasarathy, Balaraman Ravindran

Clustering on hypergraphs has been garnering increased attention with potential applications in network analysis, VLSI design and computer vision, among others.

Clustering

Paper
Add Code

Studying the Plasticity in Deep Convolutional Neural Networks using Random Pruning

1 code implementation • 26 Dec 2018 • Deepak Mittal, Shweta Bhardwaj, Mitesh M. Khapra, Balaraman Ravindran

In this work, we report experiments which suggest that the comparable performance of the pruned network is not due to the specific criterion chosen but due to the inherent plasticity of deep neural networks which allows them to recover from the loss of pruned filters once the rest of the filters are fine-tuned.

Image Classification Image Segmentation +3

152

Paper
Code

Learning to Prevent Monocular SLAM Failure using Reinforcement Learning

no code implementations • 23 Dec 2018 • Vignesh Prasad, Karmesh Yadav, Rohitashva Singh Saurabh, Swapnil Daga, Nahas Pareekutty, K. Madhava Krishna, Balaraman Ravindran, Brojeshwar Bhowmick

Monocular SLAM refers to using a single camera to estimate robot ego motion while building a map of the environment.

Robotics

Paper
Add Code

Discovering hierarchies using Imitation Learning from hierarchy aware policies

no code implementations • 1 Dec 2018 • Ameet Deshpande, Harshavardhan Kamarthi, Balaraman Ravindran

Learning options that allow agents to exhibit temporally higher order behavior has proven to be useful in increasing exploration, reducing sample complexity and for various transfer scenarios.

Imitation Learning

Paper
Add Code

Successor Options : An Option Discovery Algorithm for Reinforcement Learning

no code implementations • 27 Sep 2018 • Manan Tomar*, Rahul Ramesh*, Balaraman Ravindran

Additionally, we describe an Incremental Successor options model that iteratively builds options and explores in environments where exploration through primitive actions is inadequate to form the Successor Representations.

Hierarchical Reinforcement Learning reinforcement-learning +1

Paper
Add Code

Improvements on Hindsight Learning

no code implementations • 16 Sep 2018 • Ameet Deshpande, Srikanth Sarma, Ashutosh Jha, Balaraman Ravindran

One such approach is Hindsight Experience replay which uses an off-policy Reinforcement Learning algorithm to learn a goal conditioned policy.

Policy Gradient Methods reinforcement-learning +1

Paper
Add Code

Pack and Detect: Fast Object Detection in Videos Using Region-of-Interest Packing

no code implementations • 5 Sep 2018 • Athindran Ramesh Kumar, Balaraman Ravindran, Anand Raghunathan

Based on these observations, we propose Pack and Detect (PaD), an approach to reduce the computational requirements of object detection in videos.

Object object-detection +4

Paper
Add Code

Fusion Graph Convolutional Networks

1 code implementation • 31 May 2018 • Priyesh Vijayan, Yash Chandak, Mitesh M. Khapra, Srinivasan Parthasarathy, Balaraman Ravindran

State-of-the-art models for node classification on such attributed graphs use differentiable recursive functions that enable aggregation and filtering of neighborhood information from multiple hops.

General Classification Node Classification

Paper
Code

HOPF: Higher Order Propagation Framework for Deep Collective Classification

1 code implementation • 31 May 2018 • Priyesh Vijayan, Yash Chandak, Mitesh M. Khapra, Srinivasan Parthasarathy, Balaraman Ravindran

Given a graph where every node has certain attributes associated with it and some nodes have labels associated with them, Collective Classification (CC) is the task of assigning labels to every unlabeled node using information from the node as well as its neighbors.

Attribute Classification +1

Paper
Code

Language Expansion In Text-Based Games

no code implementations • 17 May 2018 • Ghulam Ahmed Ansari, Sagar J P, Sarath Chandar, Balaraman Ravindran

Text-based games are suitable test-beds for designing agents that can learn by interaction with the environment in the form of natural language text.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

DiGrad: Multi-Task Reinforcement Learning with Shared Actions

no code implementations • 27 Feb 2018 • Parijat Dewangan, S Phaniteja, K. Madhava Krishna, Abhishek Sarkar, Balaraman Ravindran

In this paper, we propose a new approach for simultaneous training of multiple tasks sharing a set of common actions in continuous action spaces, which we call as DiGrad (Differential Policy Gradient).

Multi-Task Learning reinforcement-learning +1

Paper
Add Code

Recovering from Random Pruning: On the Plasticity of Deep Convolutional Neural Networks

1 code implementation • 31 Jan 2018 • Deepak Mittal, Shweta Bhardwaj, Mitesh M. Khapra, Balaraman Ravindran

Image Classification object-detection +1

Paper
Code

Rate of Change Analysis for Interestingness Measures

no code implementations • 14 Dec 2017 • Nandan Sudarsanam, Nishanth Kumar, Abhishek Sharma, Balaraman Ravindran

We present a comprehensive analysis of 50 interestingness measures and classify them in accordance with the two properties.

General Classification

Paper
Add Code

Efficient-UCBV: An Almost Optimal Algorithm using Variance Estimates

no code implementations • 9 Nov 2017 • Subhojyoti Mukherjee, K. P. Naveen, Nandan Sudarsanam, Balaraman Ravindran

We propose a novel variant of the UCB algorithm (referred to as Efficient-UCB-Variance (EUCBV)) for minimizing cumulative regret in the stochastic multi-armed bandit (MAB) setting.

Thompson Sampling

Paper
Add Code

Shared Learning : Enhancing Reinforcement in $Q$-Ensembles

no code implementations • 14 Sep 2017 • Rakesh R. Menon, Balaraman Ravindran

Deep Reinforcement Learning has been able to achieve amazing successes in a variety of domains from video games to continuous control by trying to maximize the cumulative reward.

Atari Games Continuous Control +3

Paper
Add Code

RAIL: Risk-Averse Imitation Learning

1 code implementation • 20 Jul 2017 • Anirban Santara, Abhishek Naik, Balaraman Ravindran, Dipankar Das, Dheevatsa Mudigere, Sasikanth Avancha, Bharat Kaul

Generative Adversarial Imitation Learning (GAIL) is a state-of-the-art algorithm for learning policies when the expert's behavior is available as a fixed set of trajectories.

Autonomous Driving Continuous Control +1

Paper
Code

Learning to Mix n-Step Returns: Generalizing lambda-Returns for Deep Reinforcement Learning

no code implementations • ICLR 2018 • Sahil Sharma, Girish Raguvir J, Srivatsan Ramesh, Balaraman Ravindran

Our second major contribution is that we propose a generalization of lambda-returns called Confidence-based Autodidactic Returns (CAR), wherein the RL agent learns the weighting of the n-step returns in an end-to-end manner.

Benchmarking Decision Making +2

Paper
Add Code

Learning to Factor Policies and Action-Value Functions: Factored Action Space Representations for Deep Reinforcement learning

no code implementations • 20 May 2017 • Sahil Sharma, Aravind Suresh, Rahul Ramesh, Balaraman Ravindran

Deep Reinforcement Learning (DRL) methods have performed well in an increasing numbering of high-dimensional visual decision making domains.

Decision Making Q-Learning +2

Paper
Add Code

Diversity driven Attention Model for Query-based Abstractive Summarization

2 code implementations • ACL 2017 • Preksha Nema, Mitesh Khapra, Anirban Laha, Balaraman Ravindran

Abstractive summarization aims to generate a shorter version of the document covering all the salient points in a compact and coherent fashion.

Ranked #2 on Query-Based Extractive Summarization on Debatepedia

Abstractive Text Summarization Extractive Summarization +3

Paper
Code

Thresholding Bandits with Augmented UCB

no code implementations • 7 Apr 2017 • Subhojyoti Mukherjee, K. P. Naveen, Nandan Sudarsanam, Balaraman Ravindran

In this paper we propose the Augmented-UCB (AugUCB) algorithm for a fixed-budget version of the thresholding bandit problem (TBP), where the objective is to identify a set of arms whose quality is above a threshold.

Paper
Add Code

DyVEDeep: Dynamic Variable Effort Deep Neural Networks

no code implementations • 4 Apr 2017 • Sanjay Ganapathy, Swagath Venkataramani, Balaraman Ravindran, Anand Raghunathan

Complementary to these approaches, DyVEDeep is a dynamic approach that exploits the heterogeneity in the inputs to DNNs to improve their compute efficiency with comparable classification accuracy.

Paper
Add Code

Learning to Repeat: Fine Grained Action Repetition for Deep Reinforcement Learning

no code implementations • 20 Feb 2017 • Sahil Sharma, Aravind Srinivas, Balaraman Ravindran

Reinforcement Learning algorithms can learn complex behavioral patterns for sequential decision making tasks wherein an agent interacts with an environment and acquires feedback in the form of rewards sampled from it.

Car Racing Decision Making +2

Paper
Add Code

Learning to Multi-Task by Active Sampling

1 code implementation • ICLR 2018 • Sahil Sharma, Ashutosh Jha, Parikshit Hegde, Balaraman Ravindran

In this work, we propose an efficient multi-task learning framework which solves multiple goal-directed tasks in an on-line setup without the need for expert supervision.

Active Learning Meta-Learning +1

Paper
Code

Exploration for Multi-task Reinforcement Learning with Deep Generative Models

no code implementations • 29 Nov 2016 • Sai Praveen Bangaru, JS Suhas, Balaraman Ravindran

Exploration in multi-task reinforcement learning is critical in training agents to deduce the underlying MDP.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

EPOpt: Learning Robust Neural Network Policies Using Model Ensembles

no code implementations • 5 Oct 2016 • Aravind Rajeswaran, Sarvjeet Ghotra, Balaraman Ravindran, Sergey Levine

Sample complexity and safety are major challenges when learning policies with reinforcement learning for real-world tasks, especially when the policies are represented using rich function approximators like deep neural networks.

Domain Adaptation

Paper
Add Code

Dynamic Frame skip Deep Q Network

no code implementations • 17 May 2016 • Aravind Srinivas, Sahil Sharma, Balaraman Ravindran

Deep Reinforcement Learning methods have achieved state of the art performance in learning control policies for the games in the Atari 2600 domain.

Atari Games

Paper
Add Code

Option Discovery in Hierarchical Reinforcement Learning using Spatio-Temporal Clustering

no code implementations • 17 May 2016 • Aravind Srinivas, Ramnandan Krishnamurthy, Peeyush Kumar, Balaraman Ravindran

This paper introduces an automated skill acquisition framework in reinforcement learning which involves identifying a hierarchical description of the given task in terms of abstract states and extended actions between abstract states.

Clustering Hierarchical Reinforcement Learning +3

Paper
Add Code

Linear Bandit algorithms using the Bootstrap

no code implementations • 4 May 2016 • Nandan Sudarsanam, Balaraman Ravindran

One of the proposed methods, X-Random bootstrap, performs better than the baselines in-terms of cumulative regret across various degrees of noise and different number of trials.

Thompson Sampling

Paper
Add Code

Bridge Correlational Neural Networks for Multilingual Multimodal Representation Learning

1 code implementation • NAACL 2016 • Janarthanan Rajendran, Mitesh M. Khapra, Sarath Chandar, Balaraman Ravindran

In this work, we address a real-world scenario where no direct parallel data is available between two views of interest (say, $V_1$ and $V_2$) but parallel data is available between each of these views and a pivot view ($V_3$).

Document Classification Representation Learning +2

Paper
Code

Attend, Adapt and Transfer: Attentive Deep Architecture for Adaptive Transfer from multiple sources in the same domain

2 code implementations • 10 Oct 2015 • Janarthanan Rajendran, Aravind Srinivas, Mitesh M. Khapra, P. Prasanna, Balaraman Ravindran

Second, the agent should be able to selectively transfer, which is the ability to select and transfer from different and multiple source tasks for different parts of the state space of the target task.

Paper
Code

TSEB: More Efficient Thompson Sampling for Policy Learning

no code implementations • 10 Oct 2015 • P. Prasanna, Sarath Chandar, Balaraman Ravindran

In this paper, we propose TSEB, a Thompson Sampling based algorithm with adaptive exploration bonus that aims to solve the problem with tighter PAC guarantees, while being cautious on the regret as well.

Thompson Sampling

Paper
Add Code

A Reinforcement Learning Approach to Online Learning of Decision Trees

no code implementations • 24 Jul 2015 • Abhinav Garlapati, aditi raghunathan, Vaishnavh Nagarajan, Balaraman Ravindran

Online decision tree learning algorithms typically examine all features of a new data point to update model parameters.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Correlational Neural Networks

2 code implementations • 27 Apr 2015 • Sarath Chandar, Mitesh M. Khapra, Hugo Larochelle, Balaraman Ravindran

CCA based approaches learn a joint representation by maximizing correlation of the views when projected to the common subspace.

Representation Learning Transfer Learning

Paper
Code

An Autoencoder Approach to Learning Bilingual Word Representations

no code implementations • NeurIPS 2014 • Sarath Chandar A P, Stanislas Lauly, Hugo Larochelle, Mitesh M. Khapra, Balaraman Ravindran, Vikas Raykar, Amrita Saha

Cross-language learning allows us to use training data from one language to build models for a different language.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.