Learning to See through Sound: From VggCaps to Multi2Cap for Richer Automated Audio Captioning

Sangyeon Cho | Mingi Kim | Jinkwon Hwang | Jaehoon Go | Minuk Ma | Sunjae Yoon | Junyeong Kim |

Paper Details:

Month: November
Year: 2025
Location: Suzhou, China
Venue: EMNLP |

Citations

URL

No Citations Yet

No URLs Found

Field Of Study