Rajasekar Krishnamurthy, Yunyao Li, et al.
SIGMOD Record
We describe a Bayesian approach to model selection in unsupervised learning that determines both the feature set and the number of clusters. We then evaluate this scheme (based on marginal likelihood) and one based on cross-validated likelihood. For the Bayesian scheme we derive a closed-form solution of the marginal likelihood by assuming appropriate forms of the likelihood function and prior. Extensive experiments compare these approaches and all results are verified by comparison against ground truth. In these experiments the Bayesian scheme using our objective function gave better results than cross-validation.
Rajasekar Krishnamurthy, Yunyao Li, et al.
SIGMOD Record
Ashutosh Garg, Sreeram Balakrishnan, et al.
ICASSP 2004
Laura Chiticariu, Rajasekar Krishnamurthy, et al.
ACL 2010
Ronald Fagin, Benny Kimelfeld, et al.
SIGMOD/PODS/ 2010