Logo Logo
Hilfe
Hilfe
Switch Language to English

Mohajer, Mojgan; Englmeier, Karl-Hans und Schmid, Volker J. ORCID logoORCID: https://orcid.org/0000-0003-2195-8130 (1. Dezember 2010): A comparison of Gap statistic definitions with and without logarithm function. Department of Statistics: Technical Reports, Nr. 96 [PDF, 766kB]

[thumbnail of mojgan_englmeier_schmid.pdf]
Vorschau
Download (766kB)

Abstract

The Gap statistic is a standard method for determining the number of clusters in a set of data. The Gap statistic standardizes the graph of $\log(W_{k})$, where $W_{k}$ is the within-cluster dispersion, by comparing it to its expectation under an appropriate null reference distribution of the data. We suggest to use $W_{k}$ instead of $\log(W_{k})$, and to compare it to the expectation of $W_{k}$ under a null reference distribution. In fact, whenever a number fulfills the original Gap statistic inequality, this number also fulfills the inequality of a Gap statistic using $W_{k}$, but not \textit{vice versa}. The two definitions of the Gap function are evaluated on several simulated data set and on a real data of DCE-MR images.

Dokument bearbeiten Dokument bearbeiten