Scaling Laws Across Model Architectures: A Comparative Analysis of Dense and MoE Models in Large Language Models

Siqi Wang | Zhengyu Chen | Bei Li | Keqing He | Min Zhang | Jingang Wang |

Paper Details:

Month: November
Year: 2024
Location: Miami, Florida, USA
Venue: EMNLP |

Citations

URL

No Citations Yet

No URLs Found

Field Of Study