NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Elo Uncovered: Robustness and Best Practices in Language Model Evaluation
Meriem Boubdir
|
Edward Kim
|
Beyza Ermis
|
Sara Hooker
|
Marzieh Fadaee
|
Paper Details:
Month: December
Year: 2023
Location: Singapore
Venue:
GEM |
WS |
Citations
URL
No Citations Yet
https://github.com/chatarena/chatarena
Field Of Study