NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Sample-Efficient Human Evaluation of Large Language Models via Maximum Discrepancy Competition
Kehua Feng
|
Keyan Ding
|
Tan Hongzhi
|
Kede Ma
|
Zhihua Wang
|
Shuangquan Guo
|
Cheng Yuzhou
|
Ge Sun
|
Guozhou Zheng
|
Qiang Zhang
|
Huajun Chen
|
Paper Details:
Month: July
Year: 2025
Location: Vienna, Austria
Venue:
ACL |
Citations
URL
No Citations Yet
https://huggingface.co/spaces/lmsys/
https://tatsu-lab.github.io/alpaca_eval/
https://rank.opencompass.org.cn/
https://huggingface.co/datasets/lmsys/
https://github.com/
https://lmsys.org/blog/2023-03-30-vicuna/
https://github.com/
https://github.com/
Field Of Study