Efficient multi-prompt evaluation of LLMsFelipe Maia PoloRonald Xuet al.2024NeurIPS 2024Conference paper
tinyBenchmarks: evaluating LLMs with fewer examplesFelipe Maia PoloLucas Weberet al.2024ICML 2024Conference paper
tinyBenchmarks: evaluating LLMs with fewer examplesFelipe Maia PoloLucas Weberet al.2024ICLR 2024Workshop paper