NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Reflect-RL: Two-Player Online RL Fine-Tuning for LMs
Runlong Zhou
|
Simon Du
|
Beibin Li
|
Paper Details:
Month: August
Year: 2024
Location: Bangkok, Thailand
Venue:
ACL |
Citations
URL
No Citations Yet
https://github.com/zhourunlong/Reflect-RL
Field Of Study