NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis
Yuping Lin
|
Pengfei He
|
Han Xu
|
Yue Xing
|
Makoto Yamada
|
Hui Liu
|
Jiliang Tang
|
Paper Details:
Month: November
Year: 2024
Location: Miami, Florida, USA
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://github.com/
https://github.com/0xk1h0/ChatGPT_DAN
https://lmsys.org/blog/2023-03-30-vicuna
https://ai.meta.com/blog/meta-llama-3/
https://github.com/llm-attacks/llm-attacks
https://github
https://huggingface.co/
https://huggingface.co/
https://huggingface.co/google/
Field Of Study