NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
MLWQ: Efficient Small Language Model Deployment via Multi-Level Weight Quantization
Chun Hu
|
Junhui He
|
Shangyu Wu
|
Yuxin He
|
Chun Jason Xue
|
Qingan Li
|
Paper Details:
Month: November
Year: 2025
Location: Suzhou, China
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://github.com/hudevictor/mlwq
Field Of Study