Position: TRUSTLLM: Trustworthiness in Large Language ModelsYue HuangLichao Sunet al.2024ICML 2024Conference paper
A Needle in a Haystack: An Analysis of High-Agreement Workers on MTurk for SummarizationLining ZhangSimon Milleet al.2023ACL 2023Conference paper
GEMv2: Multilingual NLG Benchmarking in a Single Line of CodeSebastian GehrmannAbhik Bhattacharjeeet al.2022EMNLP 2022Conference paper