NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Examining the robustness of LLM evaluation to the distributional assumptions of benchmarks
Charlotte Siska
|
Katerina Marazopoulou
|
Melissa Ailem
|
James Bono
|
Paper Details:
Month: August
Year: 2024
Location: Bangkok, Thailand
Venue:
ACL |
Citations
URL
No Citations Yet
https://chat.openai.com/
https://platform.openai
https://openai.com/blog/new-and-improved-embedding-model
https://huggingface.co/datasets/anli
https://www.adept.ai/blog/persimmon-8b
https://lmsys.org/blog/2023-03-30-vicuna/
https://www.anthropic.com/index/claude-2
https://mistral.ai/news/announcing-mistral-7b/
https://crfm
Field Of Study