U-MATH: A University-Level Benchmark for Evaluating Mathematical Skills in Large Language Models

Konstantin Chernyshev | Vitaliy Polshkov | Vlad Stepanov | Alex Myasnikov | Ekaterina Artemova | Alexei Miasnikov | Sergei Tilga |

Paper Details:

Month: July
Year: 2025
Location: Vienna, Austria and virtual meeting
Venue: GEM | WS |