NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Group-Aware Reinforcement Learning for Output Diversity in Large Language Models
Oron Anschel
|
Alon Shoshan
|
Adam Botach
|
Shunit Haviv Hakimi
|
Asaf Gendler
|
Emanuel Ben Baruch
|
Nadav Bhonker
|
Igor Kviatkovsky
|
Manoj Aggarwal
|
Gerard Medioni
|
Paper Details:
Month: November
Year: 2025
Location: Suzhou, China
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://github.com/huggingface/trl
Field Of Study