Data preprocessing and data parsimony in corporate failure forecast models: evidence from Australian materials industry
The present study, based on data for delisted and active corporations in the Australian materials industry, is an attempt to develop a systematic way of selecting corporate failure-related features. We empirically tested the proposed procedure using three datasets. The first dataset contains 82 financial economic factors from the corporation's financial statement. The second dataset comprises 73 relevant financial ratios, which either directly or indirectly measure a corporation's propensity to fail, and are conciliated from the first dataset. The third dataset is a parsimonious dataset obtained from the application of combining a filter and a wrapper to preprocess the first dataset. The robustness of this preprocessed dataset is tested by comparing its performance with the first and second datasets in two statistical (logistic regression and naïve-Bayes) and two machine learning (decision tree, neural network) classes of prediction models. Tests for prediction accuracies and reliabilities, using the computational (ROC curve, AUC) and the statistical (Cochran's "Q" statistic) criteria show that the third dataset outperforms the other two datasets in all four predicting models, achieving various accuracies ranges from 81 per cent to 84 per cent. Copyright The Authors Journal compilation (c) 2006 AFAANZ.
Year of publication: |
2006
|
---|---|
Authors: | Wu, Weiping ; Lee, Vincent Cheng Siong ; Tan, Ting Yean |
Published in: |
Accounting and Finance. - Accounting and Finance Association of Australia and New Zealand - AFAANZ, ISSN 0810-5391. - Vol. 46.2006, 2, p. 327-345
|
Publisher: |
Accounting and Finance Association of Australia and New Zealand - AFAANZ |
Saved in:
freely available
Saved in favorites
Similar items by person
-
Wu, Weiping, (2006)
-
China and the Asian financial crisis : policy options and political role
Lim, Kian-Guan, (1999)
-
Multivariate Data Driven Models for Firm Value Prediction from Small Data Samples
Barnes, Mark B., (2008)
- More ...