NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Toward the Evaluation of Large Language Models Considering Score Variance across Instruction Templates
Yusuke Sakai
|
Adam Nohejl
|
Jiangnan Hang
|
Hidetaka Kamigaito
|
Taro Watanabe
|
Paper Details:
Month: November
Year: 2024
Location: Miami, Florida, US
Venue:
BlackboxNLP |
WS |
Citations
URL
No Citations Yet
https://github.com/naist-nlp/vite
https://github.com/llm-jp/llm-jp-eval
https://github.com/Stability-AI/
https://wandb.me/nejumi
https://github.com/guidance-ai/guidance
https://github.com/outlines-dev/outlines
https://github.com/wandb/llm-jp
https://github.com/llm-jp/llm-jp-eval
https://github.com/tatsu-lab/alpaca_eval
https://github
Field Of Study