MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models

Wai-Chung Kwan | Xingshan Zeng | Yuxin Jiang | Yufei Wang | Liangyou Li | Lifeng Shang | Xin Jiang | Qun Liu | Kam-Fai Wong |

Paper Details:

Month: November
Year: 2024
Location: Miami, Florida, USA
Venue: EMNLP |