CE-Bench: Towards a Reliable Contrastive Evaluation Benchmark of Interpretability of Sparse Autoencoders

Alex Gulko | Yusen Peng | Sachin Kumar |

Paper Details:

Month: November
Year: 2025
Location: Suzhou, China
Venue: BlackboxNLP | WS |

Citations

URL