NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Improving Retrospective Language Agents via Joint Policy Gradient Optimization
Xueyang Feng
|
Bo Lan
|
Quanyu Dai
|
Lei Wang
|
Jiakai Tang
|
Xu Chen
|
Zhenhua Dong
|
Ji-Rong Wen
|
Paper Details:
Month: April
Year: 2025
Location: Albuquerque, New Mexico
Venue:
NAACL |
Citations
URL
No Citations Yet
https://www.mindspore.cn
https://github
https://anonymous
Field Of Study