NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective
Teng Xiao
|
Mingxiao Li
|
Yige Yuan
|
Huaisheng Zhu
|
Chao Cui
|
Vasant G Honavar
|
Paper Details:
Month: November
Year: 2024
Location: Miami, Florida, USA
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://github.com/tengxiao1/GSIL
https://huggingface.co/datasets/
https://huggingface.co/datasets/Anthropic/
https://github.com/EleutherAI/
https://github.com/openai/human-eval
https://github.com/lm-sys/FastChat/tree/main/
Field Of Study