NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Zhichen Dong
Number of Papers:- 2
Number of Citations:- 0
First ACL Paper:- 2024
Latest ACL Paper:- 2024
Venues:-
ACL
NAACL
Co-Authors:-
Chao Yang
Jiaheng Liu
Jie Liu
Jing Shao
Wanli Ouyang
Similar Authors:-
2024
Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey
NAACL
Zhichen Dong |
Zhanhui Zhou |
Chao Yang |
Jing Shao |
Yu Qiao |
Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!
ACL
Zhanhui Zhou |
Jie Liu |
Zhichen Dong |
Jiaheng Liu |
Chao Yang |
Wanli Ouyang |
Yu Qiao |
.