NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning
Tianduo Wang
|
Shichen Li
|
Wei Lu
|
Paper Details:
Month: August
Year: 2024
Location: Bangkok, Thailand
Venue:
ACL |
Citations
URL
No Citations Yet
https://github.com/
https://huggingface.co/docs/transformers/
https://www.anthropic
https://llama.meta.com/
https://github.com/kingoflolz/
Field Of Study