SWE-MERA: A Dynamic Benchmark for Agenticly Evaluating Large Language Models on Software Engineering Tasks

Adamenko Pavel | Ivanov Mikhail | Aidar Valeev | Rodion Levichev | Pavel Zadorozhny | Ivan Lopatin | Dmitrii Babaev | Alena Fenogenova | Valentin Malykh |

Paper Details:

Month: November
Year: 2025
Location: Suzhou, China
Venue: EMNLP |