QueryGym: Step-by-Step Interaction with Relational DatabasesHaritha AnanthakrishnanHarsha Kokelet al.2026AAAI 2026
Auto-BenchmarkCard: Automated Synthesis of Benchmark DocumentationAris HofmannInge Vejsbjerget al.2026AAAI 2026
AssetOpsBench-Live: Privacy-Aware Online Evaluation of Multi-Agent Performance in Industrial OperationsDhaval PatelNianjun Zhouet al.2026AAAI 2026
ToolSmith: A Multi-Agent Framework for Enterprise Tool CreationPurna Chandra Sekhar VakudavathuKushal Mukherjeeet al.2026AAAI 2026
DFAgent: From Natural Language Data Interactions to Reusable Agent-Ready ToolsNeelamadhav GantayatRenuka Sindhgattaet al.2026AAAI 2026
AutoTuneX: Interactive Automated Fine-Tuning for Large Language ModelsDaniel Karl I. WeidelePriyanshu Raiet al.2026AAAI 2026
FinOps Agent - A Use-case for IT Infrastructure and Cost OptimizationAn VoManish Modaniet al.2026AAAI 2026
An Analysis of Hyper-Parameter Optimization Methods for Retrieval Augmented GenerationMatan OrbachOhad Eytanet al.2026AAAI 2026