gap {SAGx}R Documentation

GAP statistic clustering figure of merit

Description

Calculates a goodness of clustering measure based on the average dispersion compared to a reference distribution.

Usage

gap(data = swiss,class = g, B = 500, cluster.func = myclus)

Arguments

data

The data matrix, with samples (observations) in rows and genes (variables)in columns

class

a vector descibing the cluster memberships of the rows of data

B

the number of bootstrap samples

cluster.func

a function taking the arguments data and k (number of clusters) and outputs cluster assignments as list elements cluster ( accessed by object$cluster ).

Value

The GAP statistic and the standard deviation

Author(s)

Per Broberg

References

Tishirani, R., Walther, G. and Hastie, T. (2000) Estimating the number of clusters in a dataset via the Gap statistic. Technical Report Stanford

Examples

library("MASS")
data(swiss)
cl <- myclus(data = swiss, k = 3)
gap(swiss,cl$cluster)

[Package SAGx version 1.62.0 Index]