NESTFUL: A Benchmark for Evaluating LLMs on Nested Sequences of API CallsKinjal BasuIbrahim Abdelazizet al.2025EMNLP 2025
Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular TasksIbrahim AbdelazizKinjal Basuet al.2024EMNLP 2024
API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMsKinjal BasuIbrahim Abdelazizet al.2024ACL 2024