A Perspective on LLM Data Generation with Few-shot Examples: from Intent to Kubernetes ManifestAntonino AngiLiubov Nedoshivinaet al.2025ACL 2025
Think Again! The Effect of Test-Time Compute on Preferences, Opinions, and Beliefs of Large Language ModelsGeorge KourItay Nakashet al.2025ACL 2025
Conceptual Diagnostics for Knowledge Graphs and Large Language ModelsRosario Uceda-SosaMaria Changet al.2025ACL 2025
Multi-Sense Embeddings for Language Models and Knowledge DistillationQitong WangMohammed Zakiet al.2025ACL 2025
DOVE: A Large-Scale Multi-Dimensional Predictions Dataset Towards Meaningful LLM EvaluationEliya HabbaOfir Arvivet al.2025ACL 2025
Defensive Prompt Patch: A Robust and Generalizable Defense of Large Language Models against Jailbreak AttacksChen XiongXiangyu Qiet al.2025ACL 2025
Protecting Users From Themselves: Safeguarding Contextual Privacy in Interactions with Conversational AgentsIvoline NgongSwanand Ravindra Kadheet al.2025ACL 2025
Knowledge Base Construction for Knowledge-Augmented Text-to-SQLJinheon BaekHorst Samulowitzet al.2025ACL 2025
“You are Beautiful, Body Image Stereotypes are Ugly!” BIStereo: A Benchmark to Measure Body Image Stereotypes in Language ModelsNarjis AsadNihar Ranjan Sahooet al.2025ACL 2025
PLAY2PROMPT: Zero-shot Tool Instruction Optimization for LLM Agents via Tool PlayWei FangYang Zhanget al.2025ACL 2025