Search Results

Learning Robust Options by Conditional Value at Risk Optimization

1 code implementation NeurIPS 2019

While there are several methods to learn options that are robust against the uncertainty of model parameters, these methods only consider either the worst case or the average (ordinary) case for learning options.

Towards Safe Reinforcement Learning via Constraining Conditional Value-at-Risk

1 code implementation9 Jun 2022

Though deep reinforcement learning (DRL) has obtained substantial success, it may encounter catastrophic failures due to the intrinsic uncertainty of both transition and observation.

Continuous Control reinforcement-learning +2

RiskQ: Risk-sensitive Multi-Agent Reinforcement Learning Value Factorization

1 code implementation NeurIPS 2023

Multi-agent systems are characterized by environmental uncertainty, varying policies of agents, and partial observability, which result in significant risks.

Multi-agent Reinforcement Learning reinforcement-learning

Optimizing Conditional Value-At-Risk of Black-Box Functions

1 code implementation NeurIPS 2021

This paper presents two Bayesian optimization (BO) algorithms with theoretical performance guarantee to maximize the conditional value-at-risk (CVaR) of a black-box function: CV-UCB and CV-TS which are based on the well-established principle of optimism in the face of uncertainty and Thompson sampling, respectively.

Bayesian Optimization Thompson Sampling

Value of Information Analysis for External Validation of Risk Prediction Models

1 code implementation5 Aug 2022

Methods: We define the Expected Value of Perfect Information (EVPI) for model validation as the expected loss in NB due to not confidently knowing which of the alternative decisions confers the highest NB at a given risk threshold.

Applications

CARNA: Characterizing Advanced heart failure Risk and hemodyNAmic phenotypes using learned multi-valued decision diagrams

1 code implementation11 Jun 2023

To address these limitations, this paper presents CARNA, a hemodynamic risk stratification and phenotyping framework for advanced HF that takes advantage of the explainability and expressivity of machine learned Multi-Valued Decision Diagrams (MVDDs).

Decision Making Descriptive

Portfolio Construction with Gaussian Mixture Returns and Exponential Utility via Convex Optimization

1 code implementation9 May 2022

We consider the problem of choosing an optimal portfolio, assuming the asset returns have a Gaussian mixture (GM) distribution, with the objective of maximizing expected exponential utility.

Optimization and Control Portfolio Management

Random Erasing Data Augmentation

18 code implementations16 Aug 2017

In this paper, we introduce Random Erasing, a new data augmentation method for training the convolutional neural network (CNN).

General Classification Image Augmentation +4

A Brief Overview of AI Governance for Responsible Machine Learning Systems

1 code implementation21 Nov 2022

Organizations of all sizes, across all industries and domains are leveraging artificial intelligence (AI) technologies to solve some of their biggest challenges around operations, customer experience, and much more.

Bayes Risk Transducer: Transducer with Controllable Alignment Prediction

1 code implementation19 Aug 2023

While the vanilla transducer does not have a prior preference for any of the valid paths, this work intends to enforce the preferred paths and achieve controllable alignment prediction.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2