Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation

Yue Yang | Ajay Patel | Matt Deitke | Tanmay Gupta | Luca Weihs | Andrew Head | Mark Yatskar | Chris Callison-Burch | Ranjay Krishna | Aniruddha Kembhavi | Christopher Clark |

Paper Details:

Month: July
Year: 2025
Location: Vienna, Austria
Venue: ACL |