Towards Enforcing Company Policy Adherence in Agentic WorkflowsNaama ZwerdlingDavid Boazet al.2025EMNLP 2025
SIMBA UQ: Similarity-Based Aggregation for Uncertainty Quantification in Large Language ModelsDebarun BhattacharjyaBalaji Ganesanet al.2025EMNLP 2025
AutoPDL: Automatic Prompt Optimization for LLM AgentsClaudio SpiessMandana Vaziriet al.2025AutoML 2025
Bootstrapping Learned Cost Models with Synthetic SQL QueriesMichael NiddChristoph Miksovic Czaschet al.2025VLDB 2025
Learnable Channel Converter for Multi-Spectral Image to RGB Visualization using a Vision-Text ModelHaoxiang QiuTomoya Sakaiet al.2025IGARSS 2025
A Perspective on LLM Data Generation with Few-shot Examples: from Intent to Kubernetes ManifestAntonino AngiLiubov Nedoshivinaet al.2025ACL 2025
ConCodeEval: Evaluating Large Language Models for Code Constraints in Domain-Specific LanguagesMehant KammakomatiSameer Pimparkhedeet al.2025ACL 2025
Improving Large Language Models for Programmatic Text Understanding via Iterative Instruction RefinementLecheng YanChenyang Lyuet al.2025ICIC 2025
Otter: Generating Tests from Issues to Validate SWE PatchesToufique AhmedJatin Ganhotraet al.2025ICML 2025
Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive SearchMaohao ShenGuangtao Zenget al.2025ICML 2025