A Comparative Study of Vision Transformers and Multimodal Language Models for Violence Detection in Videos

Tomas Ditchfield-Ogle | Ruslan Mitkov |

Paper Details:

Month: September
Year: 2025
Location: Varna, Bulgaria
Venue: R2LM | WS |

Citations

URL