DOVE: A Large-Scale Multi-Dimensional Predictions Dataset Towards Meaningful LLM EvaluationEliya HabbaOfir Arvivet al.2025ACL 2025
Unitxt: Flexible, Shareable and Reusable Data Preparation and Evaluation for Generative AIElron BandelYotam Perlitzet al.2024NAACL 2024
FastFit: Fast and Effective Few-Shot Text Classification with a Multitude of ClassesAsaf YehudaiElron Bandel2024NAACL 2024