NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
FactBench: A Dynamic Benchmark for In-the-Wild Language Model Factuality Evaluation
Farima Fatahi Bayat
|
Lechen Zhang
|
Sheza Munir
|
Lu Wang
|
Paper Details:
Month: July
Year: 2025
Location: Vienna, Austria
Venue:
ACL |
Citations
URL
No Citations Yet
https://huggingface.co/spaces/launch/factbench
https://serper.dev/
https://serper.dev/
https://ai.meta.com/blog/meta-llama-3/
https://www.anthropic.com/news/claude-3-5-
https://cohere.com/blog/command-r-plus-microsoft-
https://github
https://ai.meta.com/blog/
https://mistral.ai/news/mistral-large-2407
https://openai.com/index/hello-gpt-4o/
https://openai.com/index/
https://openai.com/blog/
Field Of Study