NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Safety Alignment in NLP Tasks: Weakly Aligned Summarization as an In-Context Attack
Yu Fu
|
Yufei Li
|
Wen Xiao
|
Cong Liu
|
Yue Dong
|
Paper Details:
Month: August
Year: 2024
Location: Bangkok, Thailand
Venue:
ACL |
Citations
URL
No Citations Yet
https://github.com/FYYFU/SafetyAlignNLP
https://github.com/Mimino666/langdetect
https://huggingface.co/sentence-transformers/all-
https://huggingface.co/docs/transformers/model_doc/llama2
https://github.com/unitaryai/detoxify
https://github.com/PKU-
https://www.consumer.ftc.gov/articles/how-recognize-and-avoid-phishing-scams\n*
Field Of Study