Showing 1 - 10 of 310
Variable importance measures for random forests have been receiving increased attention as a means of variable selection in many classification tasks in bioinformatics and related scientific fields, for instance to select a subset of genetic markers relevant for the prediction of a certain...
Persistent link: https://www.econbiz.de/10010280795
Binary outcomes that depend on an ordinal predictor in a nonmonotonic way are common in medical data analysis. Such patterns can be addressed in terms of cutpoints: for example, one looks for two cutpoints that define an interval in the range of the ordinal predictor for which the probability of...
Persistent link: https://www.econbiz.de/10003377879
We address the problem of maximally selected chi-square statistics in the case of a binary Y variable and a nominal X variable with several categories. The distribution of the maximally selected chi-square statistic has already been derived when the best cutpoint is chosen from a continuous or...
Persistent link: https://www.econbiz.de/10003135759
Partial Least Squares (PLS) is a highly efficient statistical regression technique that is well suited for the analysis of high-dimensional genomic data. In this paper we review the theory and applications of PLS both under methodological and biological points of view. Focusing on microarray...
Persistent link: https://www.econbiz.de/10003309967
We propose a novel method to model nonlinear regression problems by adapting the principle of penalization to Partial Least Squares (PLS). Starting with a generalized additive model, we expand the additive component of each variable in terms of a generous amount of B-Splines basis functions. In...
Persistent link: https://www.econbiz.de/10003365547
Binary outcomes that depend on an ordinal predictor in a nonmonotonic way are common in medical data analysis. Such patterns can be addressed in terms of cutpoints: for example, one looks for two cutpoints that define an interval in the range of the ordinal predictor for which the probability of...
Persistent link: https://www.econbiz.de/10010266135
The Gini gain is one of the most common variable selection criteria in machine learning. We derive the exact distribution of the maximally selected Gini gain in the context of binary classification using continuous predictors by means of a combinatorial approach. This distribution provides a...
Persistent link: https://www.econbiz.de/10010266219
The Gini gain is one of the most common variable selection criteria in machine learning. We derive the exact distribution of the maximally selected Gini gain in the context of binary classification using continuous predictors by means of a combinatorial approach. This distribution provides a...
Persistent link: https://www.econbiz.de/10003310038
Persistent link: https://www.econbiz.de/10005165617
Persistent link: https://www.econbiz.de/10005172603