BabyLlama-2: Ensemble-Distilled Models Consistently Outperform Teachers With Limited Data

Jean-Loup Tastet | Inar Timiryasov |

Paper Details:

Month: November
Year: 2024
Location: Miami, FL, USA
Venue: CoNLL | BabyLM | WS |

Citations

URL

No Citations Yet