NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Beyond BLEU: Ethical Risks of Misleading Evaluation in Domain-Specific QA with LLMs
Ayoub Nainia
|
Régine Vignes-Lebbe
|
Hajar Mousannif
|
Jihad Zahir
|
Paper Details:
Month: September
Year: 2025
Location: Varna, Bulgaria
Venue:
R2LM |
WS |
Citations
URL
No Citations Yet
https://doi.org/10.26615/978-954-452-102-8-009
https://www.gbif.org
https://huggingface.co/meta-llama/Llama-2-7b
https://huggingface.co/meta-llama/Meta-Llama-3-8B
Field Of Study