Neil Thompson, Martin Fleming, et al.
IAAI 2024
Although LLMs can generate tools for generic domains and tasks, they struggle with enterprise-related domains that in- volve proprietary APIs and data schemas. We present Tool- Smith, a framework for autonomously generating and validat- ing agent-compatible tools. Given an API specification and a Tool Specification Requirement (TSR), ToolSmith produces a tool function and verifies it through a closed-loop process: it creates natural language (NL) tests and executes the tool in a secure agent sandbox for validation. For state-changing tools, ToolSmith confirms outcomes by querying the API with pa- rameters derived from the NL tests. If the tool fails to produce the desired output, ToolSmith generates diagnostic feedback to iteratively regenerate it. By ensuring both functional cor- rectness and agent compatibility, ToolSmith enables reliable automation of enterprise workflows.
Neil Thompson, Martin Fleming, et al.
IAAI 2024
Owen Cornec, Rahul Nair, et al.
NeurIPS 2021
Gaetano Rossiello, Shankar Subramaniam
ACM CAIS 2026
Phanwadee Sinthong, Dhaval Patel, et al.
VLDB 2022