NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback
Yen-Ting Lin
|
Di Jin
|
Tengyu Xu
|
Tianhao Wu
|
Sainbayar Sukhbaatar
|
Chen Zhu
|
Yun He
|
Yun-Nung Chen
|
Jason E Weston
|
Yuandong Tian
|
Arash Rahnama
|
Sinong Wang
|
Hao Ma
|
Han Fang
|
Paper Details:
Month: November
Year: 2025
Location: Suzhou, China
Venue:
MathNLP |
WS |
Citations
URL
No Citations Yet
https://github.com/QwenLM/Qwen2.5-Math/blob/
https://github.com/QwenLM/Qwen2.5-Math/blob/
https://github.com/sympy/sympy
https://qwenlm
https://github.com/openai/simple-evals
https://huggingface.co/datasets/meta-llama/Llama-3.1-70B-Instruct-evals
Field Of Study