Search Results for author: Jörg Rahnenführer

Found 8 papers, 3 papers with code

Employing an Adjusted Stability Measure for Multi-Criteria Model Fitting on Data Sets with Similar Features

no code implementations15 Jun 2021 Andrea Bommert, Jörg Rahnenführer, Michel Lang

We propose the approach of tuning the hyperparameters of a predictive model in a multi-criteria fashion with respect to predictive accuracy and feature selection stability.

feature selection

Adjusted Measures for Feature Selection Stability for Data Sets with Similar Features

1 code implementation25 Sep 2020 Andrea Bommert, Jörg Rahnenführer

For data sets with similar features, for example highly correlated features, most existing stability measures behave in an undesired way: They consider features that are almost identical but have different identifiers as different features.

feature selection

Feature Selection Methods for Cost-Constrained Classification in Random Forests

no code implementations14 Aug 2020 Rudolf Jagdhuber, Michel Lang, Jörg Rahnenführer

Cost-sensitive feature selection describes a feature selection problem, where features raise individual costs for inclusion in a model.

Classification feature selection +2

Implications on Feature Detection when using the Benefit-Cost Ratio

no code implementations12 Aug 2020 Rudolf Jagdhuber, Jörg Rahnenführer

In situations with large cost differences and small effect sizes, the BCR missed relevant features and preferred cheap noise features.

feature selection

Improving Reliability of Latent Dirichlet Allocation by Assessing Its Stability Using Clustering Techniques on Replicated Runs

no code implementations14 Feb 2020 Jonas Rieger, Lars Koppers, Carsten Jentsch, Jörg Rahnenführer

Based on the newly proposed measure for LDA stability, we propose a method to increase the reliability and hence to improve the reproducibility of empirical findings based on topic modeling.

Clustering

High Dimensional Restrictive Federated Model Selection with multi-objective Bayesian Optimization over shifted distributions

1 code implementation24 Feb 2019 Xudong Sun, Andrea Bommert, Florian Pfisterer, Jörg Rahnenführer, Michel Lang, Bernd Bischl

To carry out a clinical research under this scenario, an analyst could train a machine learning model only on local data site, but it is still possible to execute a statistical query at a certain cost in the form of sending a machine learning model to some of the remote data sites and get the performance measures as feedback, maybe due to prediction being usually much cheaper.

Bayesian Optimization BIG-bench Machine Learning +2

Cannot find the paper you are looking for? You can Submit a new open access paper.