Conceptual Diagnostics for Knowledge Graphs and Large Language ModelsRosario Uceda-SosaMaria Changet al.2025ACL 2025
Multi-Sense Embeddings for Language Models and Knowledge DistillationQitong WangMohammed Zakiet al.2025ACL 2025
DOVE: A Large-Scale Multi-Dimensional Predictions Dataset Towards Meaningful LLM EvaluationEliya HabbaOfir Arvivet al.2025ACL 2025
MTRAG: A Multi-Turn Conversational Benchmark for Evaluating Retrieval-Augmented Generation SystemsYannis KatsisSara Rosenthalet al.2025ACL 2025
Query-driven Document-level Scientific Evidence Extraction from Biomedical StudiesMassimiliano PronestiJoao Bettencourt-Silvaet al.2025ACL 2025
A Perspective on LLM Data Generation with Few-shot Examples: from Intent to Kubernetes ManifestAntonino AngiLiubov Nedoshivinaet al.2025ACL 2025
Multi-Sense Embeddings for Language Models and Knowledge DistillationQitong WangMohammed Zakiet al.2025ACL 2025
Stereotype Detection as a Catalyst for Enhanced Bias Detection: A Multi-Task Learning ApproachAditya TomarRudra Murthy Venkataramanaet al.2025ACL 2025
PLAY2PROMPT: Zero-shot Tool Instruction Optimization for LLM Agents via Tool PlayWei FangYang Zhanget al.2025ACL 2025