IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages

Mohammed Khan | Priyam Mehta | Ananth Sankar | Umashankar Kumaravelan | Sumanth Doddapaneni | Suriyaprasaad B | Varun G | Sparsh Jain | Anoop Kunchukuttan | Pratyush Kumar | Raj Dabre | Mitesh Khapra |

Paper Details:

Month: August
Year: 2024
Location: Bangkok, Thailand
Venue: ACL |