NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
ScaleLLM: A Resource-Frugal LLM Serving Framework by Optimizing End-to-End Efficiency
Yuhang Yao
|
Han Jin
|
Alay Dilipbhai Shah
|
Shanshan Han
|
Zijian Hu
|
Dimitris Stripelis
|
Yide Ran
|
Zhaozhuo Xu
|
Salman Avestimehr
|
Chaoyang He
|
Paper Details:
Month: November
Year: 2024
Location: Miami, Florida, US
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://tensoropera.ai/prod/model/mistralai/
http://fireworks.ai
https://blog.invgate.com/chatgpt-statistics
https://tokio.rs/blog/
https://https://huggingface
https://github.com/
http://together.ai
http://vllm.ai
Field Of Study