NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
s3: You Don’t Need That Much Data to Train a Search Agent via RL
Pengcheng Jiang
|
Xueqiang Xu
|
Jiacheng Lin
|
Jinfeng Xiao
|
Zifeng Wang
|
Jimeng Sun
|
Jiawei Han
|
Paper Details:
Month: November
Year: 2025
Location: Suzhou, China
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://github.com/pat-jj/s3
https://github.com/huggingface/trl
https://github.com/StonyBrookNLP/ircot
https://huggingface.co/DeepRetrieval/
https://huggingface.co/PeterJinGo/
https://huggingface.co/PeterJinGo/SearchR1-nq_
https://huggingface.co/PeterJinGo/R1-nq_
https://github.com/volcengine/verl
https://github.com/RAGEN-AI/RAGEN
https://huggingface.co/Qwen/Qwen2
https://huggingface.co/datasets/RUC-NLPIR/
https://github.com/Teddy-XiongGZ/MIRAGE/blob/
https://github.com/castorini/pyserini/blob/
https://docs.google.com/spreadsheets/d/e/
Field Of Study