NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Query-OPT: Optimizing Inference of Large Language Models via Multi-Query Instructions in Meeting Summarization
Md Tahmid Rahman Laskar
|
Elena Khasanova
|
Xue-Yong Fu
|
Cheng Chen
|
Shashi Bhushan Tn
|
Paper Details:
Month: November
Year: 2024
Location: Miami, Florida, US
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://huggingface.co/spaces/philschmid/
https://github.com/talkiq/
https://openai.com/chatgpt
https://platform.openai.com/docs/models
https://www.anthropic.com/
https://www.anthropic.com/news/
https://huggingface.co/togethercomputer/
https://huggingface.co/Qwen/
https://huggingface.co/docs/transformers/
https://huggingface.co/yzha/AlignScore/
https://github.com/ggerganov/llama.cpp
https://openai.com/pricing
Field Of Study