NESTFUL: A Benchmark for Evaluating LLMs on Nested Sequences of API Calls
- Kinjal Basu
- Ibrahim Abdelaziz
- et al.
- 2025
- EMNLP 2025
I am a Senior Technical Staff Member (STSM) at IBM AI Research. My broad research interests include AI Agents, Large Language Models, AutoML, Natural Language Processing, and applied Machine Learning.
I am currently working on LLM-based AI agents for Business Automation.
In the past, I have worked on AutoML, deep generative models, probabilistic programming, text classification and ranking, anomaly detection, and statistical machine translation. I have applied machine learning to various business problems in industries such as food safety, manufacturing, and construction.
Building a unified Digital Labor platform with conversation, orchestration, and training capabilities.
Scaling AI technologies for NLP and text data to a large variety of users.