RedHit: Adaptive Red-Teaming of Large Language Models via Search, Reasoning, and Preference Optimization

Mohsen Sorkhpour | Abbas Yazdinejad | Ali Dehghantanha |

Paper Details:

Month: August
Year: 2025
Location: Vienna, Austria
Venue: LLMSEC | WS |
SIG: SIGSEC

Citations

URL

No Citations Yet