NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Benchmarking Failures in Tool-Augmented Language Models
Eduardo TreviƱo
|
Hugo Contant
|
James Ngai
|
Graham Neubig
|
Zora Zhiruo Wang
|
Paper Details:
Month: April
Year: 2025
Location: Albuquerque, New Mexico
Venue:
NAACL |
Citations
URL
No Citations Yet
https://github.com/EduardoTrevino/fail-talms
https://mixedanalytics.com/blog/
Field Of Study