Learning Robust Options by Conditional Value at Risk Optimization

1 code implementation • NeurIPS 2019

While there are several methods to learn options that are robust against the uncertainty of model parameters, these methods only consider either the worst case or the average (ordinary) case for learning options.

Paper
Code

Towards Safe Reinforcement Learning via Constraining Conditional Value-at-Risk

1 code implementation • 9 Jun 2022

Though deep reinforcement learning (DRL) has obtained substantial success, it may encounter catastrophic failures due to the intrinsic uncertainty of both transition and observation.

Continuous Control reinforcement-learning +2

Paper
Code

RiskQ: Risk-sensitive Multi-Agent Reinforcement Learning Value Factorization

1 code implementation • NeurIPS 2023

Multi-agent systems are characterized by environmental uncertainty, varying policies of agents, and partial observability, which result in significant risks.

Multi-agent Reinforcement Learning reinforcement-learning

Paper
Code

Optimizing Conditional Value-At-Risk of Black-Box Functions

1 code implementation • NeurIPS 2021

This paper presents two Bayesian optimization (BO) algorithms with theoretical performance guarantee to maximize the conditional value-at-risk (CVaR) of a black-box function: CV-UCB and CV-TS which are based on the well-established principle of optimism in the face of uncertainty and Thompson sampling, respectively.

Bayesian Optimization Thompson Sampling

Paper
Code

Value of Information Analysis for External Validation of Risk Prediction Models

1 code implementation • 5 Aug 2022

Methods: We define the Expected Value of Perfect Information (EVPI) for model validation as the expected loss in NB due to not confidently knowing which of the alternative decisions confers the highest NB at a given risk threshold.

Applications

Paper
Code

CARNA: Characterizing Advanced heart failure Risk and hemodyNAmic phenotypes using learned multi-valued decision diagrams

1 code implementation • 11 Jun 2023

To address these limitations, this paper presents CARNA, a hemodynamic risk stratification and phenotyping framework for advanced HF that takes advantage of the explainability and expressivity of machine learned Multi-Valued Decision Diagrams (MVDDs).

Decision Making Descriptive

Paper
Code

Portfolio Construction with Gaussian Mixture Returns and Exponential Utility via Convex Optimization

1 code implementation • 9 May 2022

We consider the problem of choosing an optimal portfolio, assuming the asset returns have a Gaussian mixture (GM) distribution, with the objective of maximizing expected exponential utility.

Optimization and Control Portfolio Management

Paper
Code

Random Erasing Data Augmentation

18 code implementations • 16 Aug 2017

In this paper, we introduce Random Erasing, a new data augmentation method for training the convolutional neural network (CNN).

Ranked #4 on Image Classification on Fashion-MNIST

General Classification Image Augmentation +4

29,680

Paper
Code

A Brief Overview of AI Governance for Responsible Machine Learning Systems

1 code implementation • 21 Nov 2022

Organizations of all sizes, across all industries and domains are leveraging artificial intelligence (AI) technologies to solve some of their biggest challenges around operations, customer experience, and much more.

6,722

Paper
Code

Bayes Risk Transducer: Transducer with Controllable Alignment Prediction

1 code implementation • 19 Aug 2023

While the vanilla transducer does not have a prior preference for any of the valid paths, this work intends to enforce the preferred paths and achieve controllable alignment prediction.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

7,864

Paper
Code

Search Results