NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
SCORE: Systematic COnsistency and Robustness Evaluation for Large Language Models
Grigor Nalbandyan
|
Rima Shahbazyan
|
Evelina Bakhturina
|
Paper Details:
Month: April
Year: 2025
Location: Albuquerque, New Mexico
Venue:
NAACL |
Citations
URL
No Citations Yet
https://github.com/EleutherAI/lm-evaluation-
https://huggingface.co/spaces/nvidia/llm-robustness-
https://huggingface.co/meta-llama/
https://huggingface.co/mistralai/
https://huggingface.co/Qwen/
https://huggingface.co/01-ai/Yi-1.5-34B-Chat
https://huggingface
https://github.com/NVIDIA/TensorRT-LLM
Field Of Study