NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Retrospex: Language Agent Meets Offline Reinforcement Learning Critic
Yufei Xiang
|
Yiqun Shen
|
Yeqin Zhang
|
Nguyen Cam-Tu
|
Paper Details:
Month: November
Year: 2024
Location: Miami, Florida, USA
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://github.com/Yufei-Xiang/Retrospex
https://huggingface.co/google/flan-t5-large
https://huggingface.co/datasets/THUDM/AgentInstruct
https://huggingface.co/datasets/anon8231489123/Share
https://huggingface.co/docs/trl/main/en/sft_trainer
Field Of Study