NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
From Calculation to Adjudication: Examining LLM Judges on Mathematical Reasoning Tasks
Andreas Stephan
|
Dawei Zhu
|
Matthias Aßenmacher
|
Xiaoyu Shen
|
Benjamin Roth
|
Paper Details:
Month: July
Year: 2025
Location: Vienna, Austria and virtual meeting
Venue:
GEM |
WS |
Citations
URL
No Citations Yet
https://huggingface.co/meta-llama/Llama-3.1-70B-Instruct
https://huggingface.co/Qwen/Qwen2.5-72B-Instruct
https://huggingface.co/Qwen/Qwen2.5-14B-Instruct
https://huggingface.co/google/gemma-2-27b-it
https://huggingface.co/Qwen/Qwen2.5-7B-Instruct
https://huggingface.co/google/gemma-1.1-9b-it
https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct
https://huggingface.co/google/gemma-1.1-2b-it
Field Of Study