NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Damai Dai
|
Chengqi Deng
|
Chenggang Zhao
|
R.x. Xu
|
Huazuo Gao
|
Deli Chen
|
Jiashi Li
|
Wangding Zeng
|
Xingkai Yu
|
Y. Wu
|
Zhenda Xie
|
Y.k. Li
|
Panpan Huang
|
Fuli Luo
|
Chong Ruan
|
Zhifang Sui
|
Wenfeng Liang
|
Paper Details:
Month: August
Year: 2024
Location: Bangkok, Thailand
Venue:
ACL |
Citations
URL
No Citations Yet
https://github.com/deepseek-ai/DeepSeek-MoE
https://github.com/huggingface/tokenizers
https://github.com/kingoflolz/
https://github.com/XueFuzhao/OpenMoE
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard
Field Of Study