NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
ChatBench: From Static Benchmarks to Human-AI Evaluation
Serina Chang
|
Ashton Anderson
|
Jake M. Hofman
|
Paper Details:
Month: July
Year: 2025
Location: Vienna, Austria
Venue:
ACL |
Citations
URL
No Citations Yet
https://crfm.stanford.edu/helm/mmlu/latest/#/
https://aspredicted.org/n84n-sn3f.pdf
https://lmsys
https://www.bls.gov/web/empsit/
https://huggingface.co/datasets/microsoft/
https://gdpr.eu/eu-gdpr-personal-data/
https://aspredicted.org/n84n-sn3f.pdf
https://github
https://huggingface.co/datasets/cais/mmlu
https://huggingface.co/datasets/
Field Of Study