Feature selection in the Laplacian support vector machine
Traditional classifiers including support vector machines use only labeled data in training. However, labeled instances are often difficult, costly, or time consuming to obtain while unlabeled instances are relatively easy to collect. The goal of semi-supervised learning is to improve the classification accuracy by using unlabeled data together with a few labeled data in training classifiers. Recently, the Laplacian support vector machine has been proposed as an extension of the support vector machine to semi-supervised learning. The Laplacian support vector machine has drawbacks in its interpretability as the support vector machine has. Also it performs poorly when there are many non-informative features in the training data because the final classifier is expressed as a linear combination of informative as well as non-informative features. We introduce a variant of the Laplacian support vector machine that is capable of feature selection based on functional analysis of variance decomposition. Through synthetic and benchmark data analysis, we illustrate that our method can be a useful tool in semi-supervised learning.
Year of publication: |
2011
|
---|---|
Authors: | Lee, Sangjun ; Park, Changyi ; Koo, Ja-Yong |
Published in: |
Computational Statistics & Data Analysis. - Elsevier, ISSN 0167-9473. - Vol. 55.2011, 1, p. 567-577
|
Publisher: |
Elsevier |
Keywords: | Classification Component selection and smoothing operator Functional ANOVA decomposition Manifold regularization Semi-supervised learning |
Saved in:
Online Resource
Saved in favorites
Similar items by person
-
Stepwise feature selection using generalized logistic loss
Park, Changyi, (2008)
-
Classification of gene functions using support vector machine for time-course gene expression data
Park, Changyi, (2008)
-
Structured multicategory support vector machines with analysis of variance decomposition
Lee, Yoonkyung, (2006)
- More ...