NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
UltraEval: A Lightweight Platform for Flexible and Comprehensive Evaluation for LLMs
Chaoqun He
|
Renjie Luo
|
Shengding Hu
|
Ranchi Zhao
|
Jie Zhou
|
Hanghao Wu
|
Jiajie Zhang
|
Xu Han
|
Zhiyuan Liu
|
Maosong Sun
|
Paper Details:
Month: August
Year: 2024
Location: Bangkok, Thailand
Venue:
ACL |
Citations
URL
No Citations Yet
https://github.com/
https://youtu
https://chat.lmsys.org/
https://github.com/EleutherAI/
https://github.com/vllm-project/vllm
https://github.com/tatsu-lab/alpaca_eval
https://github.com/lm-sys/FastChat
https://github.com/stanford-crfm/helm
https://huggingface.co/docs/transformers/main
https://flageval.baai.ac.cn
https://github.com/open-compass/opencompass
https://github.com/declare-lab/instruct-eval
https://github.com/openai/evals
https://github.com/GPT-Fathom/GPT-Fathom
https://huggingface.co/datasets
https://github.com/vllm-project/vllm
https://github.com/open-compass/
https://github.com/tatsu-lab/alpaca_eval
https://github.com/OpenBMB/UltraEval/blob/
Field Of Study