SAE-SSV: Supervised Steering in Sparse Representation Spaces for Reliable Control of Language Models

Zirui He | Mingyu Jin | Bo Shen | Ali Payani | Yongfeng Zhang | Mengnan Du |

Paper Details:

Month: November
Year: 2025
Location: Suzhou, China
Venue: EMNLP |

Citations

URL

No Citations Yet

Field Of Study