NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
trlX: A Framework for Large Scale Reinforcement Learning from Human Feedback
Alexander Havrilla
|
Maksym Zhuravinskyi
|
Duy Phung
|
Aman Tiwari
|
Jonathan Tow
|
Stella Biderman
|
Quentin Anthony
|
Louis Castricato
|
Paper Details:
Month: December
Year: 2023
Location: Singapore
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://github.com/huggingface/
http://www.cs
https://github.com/lvwerra/trl
https://arxiv.org/abs/2203.17189
https://github.com/kingoflolz/
https://arxiv
Field Of Study