NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
SWE-MERA: A Dynamic Benchmark for Agenticly Evaluating Large Language Models on Software Engineering Tasks
Adamenko Pavel
|
Ivanov Mikhail
|
Aidar Valeev
|
Rodion Levichev
|
Pavel Zadorozhny
|
Ivan Lopatin
|
Dmitrii Babaev
|
Alena Fenogenova
|
Valentin Malykh
|
Paper Details:
Month: November
Year: 2025
Location: Suzhou, China
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://aider.chat
https://openai.com/index/
https://pypi.org/project/repositorytest;
https://hub.docker.com/layers/library/python/3.11
https://mistral.ai/news/codestral
https://www.llama.com/docs/
https://mistral.ai/news/devstral
https://github.com/DataDog/guarddog
https://github.com/fkie-cad/socbed
https://huggingface.co/datasets/
https://chatgpt.com
https://app.grammarly.com/
https://github.com/reframe-hpc/reframe/issues/2857):
Field Of Study