Can LLMs Deceive CLIP? Benchmarking Adversarial Compositionality of Pre-trained Multimodal Representation via Text Updates

Jaewoo Ahn | Heeseung Yun | Dayoon Ko | Gunhee Kim |

Paper Details:

Month: July
Year: 2025
Location: Vienna, Austria
Venue: ACL |