Subtle Errors in Reasoning: Preference Learning via Error-injected Self-editing

Kaishuai Xu | Tiezheng Yu | Wenjun Hou | Yi Cheng | Chak Tou Leong | Liangyou Li | Xin Jiang | Lifeng Shang | Qun Liu | Wenjie Li |

Paper Details:

Month: July
Year: 2025
Location: Vienna, Austria
Venue: ACL |