NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism
Yifan Song
|
Guoyin Wang
|
Sujian Li
|
Bill Yuchen Lin
|
Paper Details:
Month: April
Year: 2025
Location: Albuquerque, New Mexico
Venue:
NAACL |
Citations
URL
No Citations Yet
https://github.com/Yifan-
https://github.com/tatsu-lab/alpaca_eval
Field Of Study