Hazar Yueksel, Ramon Bertran, et al.
MLSys 2020
We study the problem of list-decodable Gaussian covariance estimation. Given a multiset of points in such that an unknown fraction of points in are i.i.d. samples from an unknown Gaussian , the goal is to output a list of hypotheses at least one of which is close to in relative Frobenius norm. Our main result is a sample and time algorithm for this task that guarantees relative Frobenius norm error of . Importantly, our algorithm relies purely on spectral techniques. As a corollary, we obtain an efficient spectral algorithm for robust partial clustering of Gaussian mixture models (GMMs) --- a key ingredient in the recent work of [BakDJKKV22] on robustly learning arbitrary GMMs. Combined with the other components of [BakDJKKV22], our new method yields the first Sum-of-Squares-free algorithm for robustly learning GMMs, resolving an open problem proposed by Vempala and Kothari. At the technical level, we develop a novel multi-filtering method for list-decodable covariance estimation that may be useful in other settings.
Hazar Yueksel, Ramon Bertran, et al.
MLSys 2020
Megh Thakkar, Quentin Fournier, et al.
ACL 2024
Natalia Martinez Gil, Kanthi Sarpatwar, et al.
NeurIPS 2023
Natalia Martinez Gil, Dhaval Patel, et al.
UAI 2024