NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models
Paul Röttger
|
Hannah Kirk
|
Bertie Vidgen
|
Giuseppe Attanasio
|
Federico Bianchi
|
Dirk Hovy
|
Paper Details:
Month: June
Year: 2024
Location: Mexico City, Mexico
Venue:
NAACL |
Citations
URL
No Citations Yet
https://github.com/paul-rottger/exaggerated-safety
https://docs.mistral.ai/usage/guardrailing/
Field Of Study