NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
U-MATH: A University-Level Benchmark for Evaluating Mathematical Skills in Large Language Models
Konstantin Chernyshev
|
Vitaliy Polshkov
|
Vlad Stepanov
|
Alex Myasnikov
|
Ekaterina Artemova
|
Alexei Miasnikov
|
Sergei Tilga
|
Paper Details:
Month: July
Year: 2025
Location: Vienna, Austria and virtual meeting
Venue:
GEM |
WS |
Citations
URL
No Citations Yet
https://github.com/toloka/u-math
https://huggin
https://de
https://mistral
https://mist
https://mi
https://nexusflow.ai/blogs/a
https://openai.c
https://openai.com/i
https://openai.c
https://openai.com
https://qwen
https://qw
https://platform.openai.com/docs/guides/vision
Field Of Study