Home  |  Browse  |  Authors  |  Advanced Search  |  Help
Login | Create Account
Gertheiss, Jan and Tutz, Gerhard (19. June 2008): Feature Selection and Weighting by Nearest Neighbor Ensembles. Department of Statistics: Technical Reports, No.33

Metadaten exportieren

Autor(en) recherchieren

Lesezeichen anlegen

[img]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Reader
2488Kb

Official URL: http://dx.doi.org/10.1016/j.chemolab.2009.07.004

Abstract

In the field of statistical discrimination nearest neighbor methods are a well known, quite simple but successful nonparametric classification tool. In higher dimensions, however, predictive power normally deteriorates. In general, if some covariates are assumed to be noise variables, variable selection is a promising approach. The paper’s main focus is on the development and evaluation of a nearest neighbor ensemble with implicit variable selection. In contrast to other nearest neighbor approaches we are not primarily interested in classification, but in estimating the (posterior) class probabilities. In simulation studies and for real world data the proposed nearest neighbor ensemble is compared to an extended forward/backward variable selection procedure for nearest neighbor classifiers, and some alternative well established classification tools (that offer probability estimates as well). Despite its simple structure, the proposed method’s performance is quite good - especially if relevant covariates can be separated from noise variables. Another advantage of the presented ensemble is the easy identification of interactions that are usually hard to detect. So not simply variable selection but rather some kind of feature selection is performed. The paper is a preprint of an article published in Chemometrics and Intelligent Laboratory Systems. Please use the journal version for citation.

Item Type:Paper (Technical Report)
Published in:Chemometrics and Intelligent Laboratory Systems, Vol. 99, 2009: pp. 30-38.
Keywords:Nearest Neighbor Methods, Variable Selection, Ensemble Methods, Classification
Subjects:Mathematics, Computer Science and Statistics > Statistics > Technical Reports
URN:urn:nbn:de:bvb:19-epub-4479-4
Language:English
ID Code:4479
Deposited On:19. Jun 2008 15:36
Last Modified:29. Sep 2010 10:23
Open Access LMU is powered by EPrints 3 which is developed by the School of Electronics and Computer Science at the University of Southampton. More information and software creditsAbout