Ask Me Like I’m Human: LLM-based Evaluation with For-Human Instructions Correlates Better with Human Evaluations than Human Judges

Rudali Huidrom | Anya Belz |

Paper Details:

Month: July
Year: 2025
Location: Vienna, Austria
Venue: TRL | WS |