NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Using LLM Judgements for Sanity Checking Results and Reproducibility of Human Evaluations in NLP
Rudali Huidrom
|
Anya Belz
|
Paper Details:
Month: July
Year: 2025
Location: Vienna, Austria and virtual meeting
Venue:
GEM |
WS |
Citations
URL
No Citations Yet
https://github.com/RHuidrom96/Repro_LLM_as_
https://huggingface.co/CohereForAI/
https://huggingface.co/deepseek-ai/
https://huggingface.co/ibm-granite/
https://huggingface.co/meta-llama/
https://huggingface.co/meta-llama/Llama-3
https://huggingface.co/mistralai/
https://huggingface.co/Qwen/Qwen2
https://huggingface.co/Qwen/
https://cohere.com/blog/command-r-plus-microsoft-
Field Of Study