Symmetry Teleportation for Accelerated Optimization
Bo Zhao, Nima Dehmamy, et al.
NeurIPS 2022
Test log-likelihood is commonly used to compare different models of the same data and different approximate inference algorithms for fitting the same probabilistic model. We present simple examples demonstrating how comparisons based on test log-likelihood can contradict comparisons according to other objectives. Specifically, our examples show that (i) conclusions about forecast accuracy based on test log-likelihood comparisons may not agree with conclusions based on other distributional quantities like means; and (ii) that approximate Bayesian inference algorithms that attain higher test log-likelihoods need not also yield more accurate posterior approximations.
Bo Zhao, Nima Dehmamy, et al.
NeurIPS 2022
Ella Rabinovich, Samuel Ackerman, et al.
EMNLP 2023
Ben Huh, Avinash Baidya
NeurIPS 2022
Debarun Bhattacharjya, Balaji Ganesan, et al.
NeurIPS 2024