Seeing isn’t Hearing: Benchmarking Vision Language Models at Interpreting Spectrograms

Tyler Loakman | Joseph James | Chenghua Lin |

Paper Details:

Month: December
Year: 2025
Location: Mumbai, India
Venue: IJCNLP | AACL |