Variance-Based Feature Importance in Neural Networks

16 Oct 2019  ·  Cláudio Rebelo de Sá ·

This paper proposes a new method to measure the relative importance of features in Artificial Neural Networks (ANN) models. Its underlying principle assumes that the more important a feature is, the more the weights, connected to the respective input neuron, will change during the training of the model. To capture this behavior, a running variance of every weight connected to the input layer is measured during training. For that, an adaptation of Welford’s online algorithm for computing the online variance is proposed. When the training is finished, for each input, the variances of the weights are combined with the final weights to obtain the measure of relative importance for each feature. This method was tested with shallow and deep neural network architectures on several well-known classification and regression problems. The results obtained confirm that this approach is making meaningful measurements. Moreover, results showed that the importance scores are highly correlated with the variable importance method from Random Forests (RF).

PDF

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Benchmark
Feature Importance boston Garson Variable Importance Pearson Correlation 0.76 # 1
Feature Importance boston VarImpVIANN Pearson Correlation 0.76 # 1
Feature Importance Breastcancer Garson Variable Importance Pearson Correlation 0.22 # 2
Feature Importance Breastcancer VarImpVIANN Pearson Correlation 0.60 # 1
Feature Importance Diabetes Garson Variable Importance Pearson Correlation 0.64 # 2
Feature Importance Diabetes VarImpVIANN Pearson Correlation 0.86 # 1
Feature Importance Digits VarImpVIANN Pearson Correlation 0.83 # 1
Feature Importance Digits Garson Variable Importance Pearson Correlation 0.60 # 2
Feature Importance iris Garson Variable Importance Pearson Correlation 0.73 # 2
Feature Importance iris VarImpVIANN Pearson Correlation 0.90 # 1
Feature Importance Wine Garson Variable Importance Pearson Correlation 0.74 # 1
Feature Importance Wine VarImpVIANN Pearson Correlation 0.41 # 2

Methods