NESTFUL: A Benchmark for Evaluating LLMs on Nested Sequences of API Calls

Kinjal Basu | Ibrahim Abdelaziz | Kiran Kate | Mayank Agarwal | Maxwell Crouse | Yara Rizk | Kelsey Bradford | Asim Munawar | Sadhana Kumaravel | Saurabh Goyal | Xin Wang | Luis A. Lastras | Pavan Kapanipathi |

Paper Details:

Month: November
Year: 2025
Location: Suzhou, China
Venue: EMNLP |