Thomas M. Cheng
IT Professional
We study feature selection for k-means clustering. Although the literature contains many methods with good empirical performance, algorithms with provable theoretical behavior have only recently been developed. Unfortunately, these algorithms are randomized and fail with, say, a constant probability. We present the first deterministic feature selection algorithm for k-means clustering with relative error guarantees. At the heart of our algorithm lies a deterministic method for decompositions of the identity and a structural result which quantifies some of the tradeoffs in dimensionality reduction. © 1963-2012 IEEE.
Thomas M. Cheng
IT Professional
Sabine Deligne, Ellen Eide, et al.
INTERSPEECH - Eurospeech 2001
Elena Cabrio, Philipp Cimiano, et al.
CLEF 2013
Elliot Linzer, M. Vetterli
Computing