NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
S2R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning
Ruotian Ma
|
Peisong Wang
|
Cheng Liu
|
Xingyan Liu
|
Jiaqi Chen
|
Bang Zhang
|
Xin Zhou
|
Nan Du
|
Jia Li
|
Paper Details:
Month: July
Year: 2025
Location: Vienna, Austria
Venue:
ACL |
Citations
URL
No Citations Yet
https://github
https://hkust-nlp.notion.site/
https://openai.com/api/
https://huggingface.co/meta-llama/
https://github.com/Yale-LILY/FOLIO
https://github.com/facebookresearch/cruxeval
https://github.com/eladsegal/strategyqa
https://github.com/TIGER-AI-Lab/MMLU-Pro
https://github.com/huggingface/trl
https://github.com/vllm-project/vllm
https://github.com/QwenLM/Qwen2.5-Math
Field Of Study