Similar Search Results

Bias in random forest variable importance measures: illustrations, sources and a solution

Strobl, Carolin; Boulesteix, Anne-Laure; Zeileis, Achim; … - 2006

Variable importance measures for random forests have been receiving increased attention as a means of variable selection in many classification tasks in bioinformatics and related scientific fields, for instance to select a subset of genetic markers relevant for the prediction of a certain...

Persistent link: https://www.econbiz.de/10010280795

Maximally selected chi-square statistics and umbrella orderings

Boulesteix, Anne-Laure (contributor); … - 2006

Binary outcomes that depend on an ordinal predictor in a nonmonotonic way are common in medical data analysis. Such patterns can be addressed in terms of cutpoints: for example, one looks for two cutpoints that define an interval in the range of the ordinal predictor for which the probability of...

Persistent link: https://www.econbiz.de/10003377879

Maximally selected chi-square statistics and binary splits of nominal variables

Boulesteix, Anne-Laure (contributor) - 2005

We address the problem of maximally selected chi-square statistics in the case of a binary Y variable and a nominal X variable with several categories. The distribution of the maximally selected chi-square statistic has already been derived when the best cutpoint is chosen from a continuous or...

Persistent link: https://www.econbiz.de/10003135759

Partial least squares : a versatile tool for the analysis of high-dimensional genomic data

Boulesteix, Anne-Laure (contributor); … - 2005

Partial Least Squares (PLS) is a highly efficient statistical regression technique that is well suited for the analysis of high-dimensional genomic data. In this paper we review the theory and applications of PLS both under methodological and biological points of view. Focusing on microarray...

Persistent link: https://www.econbiz.de/10003309967

Penalized partial least squares based on B-Splines transformations

Krämer, Nicole (contributor); … - 2006

We propose a novel method to model nonlinear regression problems by adapting the principle of penalization to Partial Least Squares (PLS). Starting with a generalized additive model, we expand the additive component of each variable in terms of a generous amount of B-Splines basis functions. In...

Persistent link: https://www.econbiz.de/10003365547

Maximally selected chi-square statistics and umbrella orderings

Boulesteix, Anne-Laure; Strobl, Carolin - 2006

Persistent link: https://www.econbiz.de/10010266135

Unbiased split selection for classification trees based on the Gini Index

Strobl, Carolin; Boulesteix, Anne-Laure; Augustin, Thomas - 2005

The Gini gain is one of the most common variable selection criteria in machine learning. We derive the exact distribution of the maximally selected Gini gain in the context of binary classification using continuous predictors by means of a combinatorial approach. This distribution provides a...

Persistent link: https://www.econbiz.de/10010266219

Unbiased split selection for classification trees based on the Gini Index

Strobl, Carolin (contributor); … - 2005

Persistent link: https://www.econbiz.de/10003310038

Maximally selected Chi-squared statistics and non-monotonic associations: An exact approach based on two cutpoints

Boulesteix, Anne-Laure; Strobl, Carolin - In: Computational Statistics & Data Analysis 51 (2007) 12, pp. 6295-6306

Persistent link: https://www.econbiz.de/10005165617

Unbiased split selection for classification trees based on the Gini Index

Strobl, Carolin; Boulesteix, Anne-Laure; Augustin, Thomas - In: Computational Statistics & Data Analysis 52 (2007) 1, pp. 483-501

Persistent link: https://www.econbiz.de/10005172603

1
2
3
4
5
6
7
8
9
10
Next
Last