Graph of Attacks with Pruning: Optimizing Stealthy Jailbreak Prompt. Generation for Enhanced LLM Content Moderation

Daniel Schwarz | Dmitriy Bespalov | Zhe Wang | Ninad Kulkarni | Yanjun Qi |

Paper Details:

Month: August
Year: 2025
Location: Vienna, Austria
Venue: WOAH | WS |

Citations

URL

No Citations Yet

No URLs Found

Field Of Study