NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Which of These Best Describes Multiple Choice Evaluation with LLMs? A) Forced B) Flawed C) Fixable D) All of the Above
Nishant Balepur
|
Rachel Rudinger
|
Jordan Lee Boyd-Graber
|
Paper Details:
Month: July
Year: 2025
Location: Vienna, Austria
Venue:
ACL |
Citations
URL
No Citations Yet
https://blog.prepscholar.com/sat-analogies-and-
https://www.education.gouv.fr/reussir-au-lycee/le-
https://www.reddit.com/r/AskAnAmerican/comments/rjxns4/
https://huggingface.co/
Field Of Study