VQAGuider: Guiding Multimodal Large Language Models to Answer Complex Video Questions

Yuyan Chen | Jiyuan Jia | Jiaxin Lu | Siyue Li | Yu Guan | Ming Yang | Qingpei Guo |

Paper Details:

Month: July
Year: 2025
Location: Vienna, Austria
Venue: ACL |