Path Drift in Large Reasoning Models: How First-Person Commitments Override Safety

Yuyi Huang | Runzhe Zhan | Lidia S. Chao | Ailin Tao | Derek F. Wong |

Paper Details:

Month: November
Year: 2025
Location: Suzhou, China
Venue: EMNLP |

Citations

URL

No Citations Yet