NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Yi Zeng
Number of Papers:- 1
Number of Citations:- 0
First ACL Paper:- 2024
Latest ACL Paper:- 2024
Venues:-
EMNLP
ACL
Co-Authors:-
Diyi Yang
Hongpeng Lin
Jingwen Zhang
Ruoxi Jia
Weiyan Shi
Similar Authors:-
2024
How Johnny Can Persuade LLMs to Jailbreak Them: Rethinking Persuasion to Challenge AI Safety by Humanizing LLMs
ACL
Yi Zeng |
Hongpeng Lin |
Jingwen Zhang |
Diyi Yang |
Ruoxi Jia |
Weiyan Shi |
BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction-tuned Language Models
EMNLP
Yi Zeng |
Weiyu Sun |
Tran Huynh |
Dawn Song |
Bo Li |
Ruoxi Jia |
.