SCORE: Systematic COnsistency and Robustness Evaluation for Large Language Models

Grigor Nalbandyan | Rima Shahbazyan | Evelina Bakhturina |

Paper Details:

Month: April
Year: 2025
Location: Albuquerque, New Mexico
Venue: NAACL |