
| Krause, Rüdiger and Tutz, Gerhard (2004): Variable selection and discrimination in gene expression data by genetic algorithms. Collaborative Research Center 386, Discussion Paper 390 |
| PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Reader 392Kb |
Gene expression datasets usually have thousends of explanatory variables which are observed on only few samples. Generally most variables of a dataset have no effect and one is interested in eliminating these irrelevant variables. In order to obtain a subset of relevant variables an appropriate selection procedure is necessary. In this paper we propose the selection of variables by use of genetic algorithms with the logistic regression as underlying modelling procedure. The selection procedure aims at minimizing information criteria like AIC or BIC. It is demonstrated that selection of variables by genetic algorithms yields models which compete well with the best available classification procedures in terms of test misclassification error.
| Item Type: | Paper (Research Paper) |
|---|---|
| Subjects: | Mathematics, Computer Science and Statistics Mathematics, Computer Science and Statistics > Statistics Mathematics, Computer Science and Statistics > Statistics > Collaborative Research Center 386 |
| Dewey Classification: | 600 Natural sciences and mathematics 600 Natural sciences and mathematics > 510 Mathematics |
| URN: | urn:nbn:de:bvb:19-epub-1760-6 |
| ID Code: | 1760 |
| Deposited On: | 10. Apr 2007 |
| Last Modified: | 28. Jun 2010 14:35 |