Similar Search Results

Variable selection for discrimination of more than two classes where data are sparse

Szepannek, Gero; Weihs, Claus - Institut für Wirtschafts- und Sozialstatistik, … - 2005

In classification, with an increasing number of variables, the required number of observations grows drastically. In this paper we present an approach to put into effect the maximal possible variable selection, by splitting a K class classification problem into pairwise problems. The principle...

Persistent link: https://www.econbiz.de/10009219827

Non-parametric vertical box control chart for monitoring the mean

Rafajlowicz, Ewaryst; Pawlak, Mirosław; Steland, Ansgar - Institut für Wirtschafts- und Sozialstatistik, … - 2004

A new class of non-parametric control charts for de- tecting the change in the process mean is examined. The method, called a Vertical Box Control Chart (V-Box Chart), offers a simple and quick detection of the mean change in an observed process. No parametric assumption on the distribution...

Persistent link: https://www.econbiz.de/10009219828

Testing large-dimensional correlation

Arnold, Matthias; Weißbach, Rafael - Institut für Wirtschafts- und Sozialstatistik, … - 2007

This paper introduces a test for zero correlation in situations where the correlation matrix is large compared to the sample size. The test statistic is the sum of the squared correlation coefficients in the sample. We derive its limiting null distribution as the number of variables as well as...

Persistent link: https://www.econbiz.de/10009219829

Application of a Genetic Algorithm to Variable Selection in Fuzzy Clustering

Röver, Christian; Szepannek, Gero - Institut für Wirtschafts- und Sozialstatistik, … - 2004

In order to group the observations of a data set into a given number of clusters, an ?optimal? subset out of a greater number of explanatory variables is to be selected. The problem is approached by maximizing a quality measure under certain restrictions that are supposed to keep the subset most...

Persistent link: https://www.econbiz.de/10009219830

Robust Learning from Bites for Data Mining

Christmann, Andreas; Steinwart, Ingo; Hubert, Mia - Institut für Wirtschafts- und Sozialstatistik, … - 2006

Some methods from statistical machine learning and from robust statistics have two drawbacks. Firstly, they are computer-intensive such that they can hardly be used for massive data sets, say with millions of data points. Secondly, robust and non-parametric confidence intervals for the...

Persistent link: https://www.econbiz.de/10009219831

Estimating a bivariate density when there are extra data on one or both components

Hall, Peter; Neumeyer, Natalie - Institut für Wirtschafts- und Sozialstatistik, … - 2005

Assume we have a dataset, Z say, from the joint distribution of random variables X and Y , and two further, independent datasets, X and Y, from the marginal distributions of X and Y , respectively. We wish to combine X, Y and Z, so as to construct an estimator of the joint density. This problem...

Persistent link: https://www.econbiz.de/10009219832

The Two-Sample Problem with Regression Errors : An Empirical Process Approach

Mora, Juan; Neumeyer, Natalie - Institut für Wirtschafts- und Sozialstatistik, … - 2005

We describe how to test the null hypothesis that errors from two parametrically specified regression models have the same distribution versus a general alternative. First we obtain the asymptotic properties of teststatistics derived from the difference between the two residual-based empirical...

Persistent link: https://www.econbiz.de/10009219833

On rank tests for shift detection in time series

Fried, Roland; Gather, Ursula - Institut für Wirtschafts- und Sozialstatistik, … - 2006

Robustified rank tests, applying a robust scale estimator, are investigated for reliable and fast shift detection in time series. The tests show good power for sufficiently large shifts, low false detection rates for Gaussian noise and high robustness against outliers. Wilcoxon scores in...

Persistent link: https://www.econbiz.de/10009219834

Bootstrapping frequency domain tests in multivariate time series with an application to comparing spectral densities

Dette, Holger; Paparoditis, Efstathios - Institut für Wirtschafts- und Sozialstatistik, … - 2008

We propose a general bootstrap procedure to approximate the null distribution of nonparametric frequency domain tests about the spectral density matrix of a multivariate time series. Under a set of easy to verify conditions, we establish asymptotic validity of the proposed bootstrap procedure....

Persistent link: https://www.econbiz.de/10009219835

Selinski, Silvia - Institut für Wirtschafts- und Sozialstatistik, … - 2006

The issue of suitable similarity measures for a joint consideration of so called SNP data and epidemiological variables arises from the GENICA (Interdisciplinary Study Group on Gene Environment Interaction and Breast Cancer in Germany) casecontrol study of sporadic breast cancer. The GENICA...

Persistent link: https://www.econbiz.de/10009219836