NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues
Ge Bai
|
Jie Liu
|
Xingyuan Bu
|
Yancheng He
|
Jiaheng Liu
|
Zhanhui Zhou
|
Zhuoran Lin
|
Wenbo Su
|
Tiezheng Ge
|
Bo Zheng
|
Wanli Ouyang
|
Paper Details:
Month: August
Year: 2024
Location: Bangkok, Thailand
Venue:
ACL |
Citations
URL
No Citations Yet
https://github.com/tatsu-lab/alpaca_eval
https://github.com/InternLM/InternLM
https://github.com/
https://huggingface.co/meta-llama/Llama-2-7b-chat-hf
https://huggingface.co/meta-llama/Llama-2-13b-chat-hf
https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2
https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1
https://huggingface.co/snorkelai/Snorkel-Mistral-PairRM-DPO
https://huggingface.co/Qwen/Qwen-7B-Chat
https://huggingface.co/Qwen/Qwen-14B-Chat
https://huggingface.co/01-ai/Yi-6B-Chat
https://huggingface.co/01-ai/Yi-34B-Chat
https://huggingface.co/THUDM/chatglm2-6b
https://huggingface.co/THUDM/chatglm3-6b
https://huggingface.co/internlm/internlm2-chat-7b-sft
https://huggingface.co/internlm/internlm2-chat-20b-sft
https://huggingface.co/internlm/internlm2-chat-7b
https://huggingface.co/internlm/internlm2-chat-20b
https://huggingface.co/lmsys/vicuna-13b-v1.5
https://huggingface.co/project-baize/baize-v2-13b
https://huggingface.co/openbmb/UltraLM-13b-v2.0
https://huggingface.co/baichuan-inc/Baichuan2-13B-Chat
https://platform.openai.com/docs/models/gpt-3-5-turbo
https://platform.openai.com/docs/models/gpt-4-and-gpt-4-turbo
Field Of Study