Course-Correction: Safety Alignment Using Synthetic Preferences

Rongwu Xu | Yishuo Cai | Zhenhong Zhou | Renjie Gu | Haiqin Weng | Liu Yan | Tianwei Zhang | Wei Xu | Han Qiu |

Paper Details:

Month: November
Year: 2024
Location: Miami, Florida, US
Venue: EMNLP |