NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
The Power of Many: Multi-Agent Multimodal Models for Cultural Image Captioning
Longju Bai
|
Angana Borah
|
Oana Ignat
|
Rada Mihalcea
|
Paper Details:
Month: April
Year: 2025
Location: Albuquerque, New Mexico
Venue:
NAACL |
Citations
URL
No Citations Yet
https://github.com/MichiganNLP/MosAIC
https://huggingface.co/liuhaotian/LLaVA-v1.5-13b
https://huggingface.co/Salesforce/blip2-opt-2.7b
https://github.com/MichiganNLP/MosAIC
https://chat.openai.com/
https://huggingface.co/docs/trl/en/index
Field Of Study