NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
P-MMEval: A Parallel Multilingual Multitask Benchmark for Consistent Evaluation of LLMs
Yidan Zhang
|
Yu Wan
|
Boyi Deng
|
Baosong Yang
|
Hao-Ran Wei
|
Fei Huang
|
Bowen Yu
|
Dayiheng Liu
|
Junyang Lin
|
Fei Huang
|
Jingren Zhou
|
Paper Details:
Month: November
Year: 2025
Location: Suzhou, China
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://huggingface.co/datasets/Qwen/P-
https://huggingface.co/datasets/openai/MMMLU
https://github.com/openai/simple-evals
https://github.com/open-compass/
https://github.com/tatsu-lab/alpaca_eval
Field Of Study