SIFT-50M: A Large-Scale Multilingual Dataset for Speech Instruction Fine-Tuning

Prabhat Pandey | Rupak Vignesh Swaminathan | K V Vijay Girish | Arunasish Sen | Jian. Xie | Grant Strimel | Andreas Schwarz |

Paper Details:

Month: July
Year: 2025
Location: Vienna, Austria
Venue: ACL |