NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Cognitive Dissonance: Why Do Language Model Outputs Disagree with Internal Representations of Truthfulness?
Kevin Liu
|
Stephen Casper
|
Dylan Hadfield-Menell
|
Jacob Andreas
|
Paper Details:
Month: December
Year: 2023
Location: Singapore
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://github.com/kingoflolz/
Field Of Study