EvalAssist: Insights on Task-Specific Evaluations and AI-assisted Judgement Strategy PreferencesZahra AshktorabMichael Desmondet al.2025UIST 2025
Label Sleuth: From Unlabeled Text to a Classifier in a Few HoursEyal ShnarchAlon Halfonet al.2022EMNLP 2022
InteractEva: A Simulation-Based Evaluation Framework for Interactive AI SystemsYannis KatsisMaeda Hanafiet al.2022AAAI 2022