Captions Speak Louder than Images: Generalizing Foundation Models for E-commerce from High-quality Multimodal Instruction Data

Xinyi Ling | Hanwen Du | Bo Peng | Zhihui Zhu | Xia Ning |

Paper Details:

Month: December
Year: 2025
Location: Mumbai, India
Venue: IJCNLP | AACL |