Chatbot Arena Estimate: towards a generalized performance benchmark for LLM capabilities

Lucas Spangher | Tianle Li | William F. Arnold | Nick Masiewicki | Xerxes Dotiwalla | Rama Kumar Pasumarthi | Peter Grabowski | Eugene Ie | Daniel Gruhl |

Paper Details:

Month: April
Year: 2025
Location: Albuquerque, New Mexico
Venue: NAACL |