NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Enhancing Reinforcement Learning with Dense Rewards from Language Model Critic
Meng Cao
|
Lei Shu
|
Lei Yu
|
Yun Zhu
|
Nevan Wichers
|
Yinxiao Liu
|
Lei Meng
|
Paper Details:
Month: November
Year: 2024
Location: Miami, Florida, USA
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://github.com/google-research/
https://huggingface.co/lvwerra/
https://github.com/conversationai/
http://Skylion007.github.io/
https://arxiv
https://arxiv
https://github.com/
Field Of Study