NLPExplorer

Graph of Attacks with Pruning: Optimizing Stealthy Jailbreak Prompt Generation for Enhanced LLM Content Moderation

Daniel Schwartz | Dmitriy Bespalov | Zhe Wang | Ninad Kulkarni | Yanjun Qi |

Paper Details:

Month: November
Year: 2025
Location: Suzhou (China)
Venue: EMNLP |

Citations

URL

No Citations Yet

No URLs Found

Field Of Study