MARS: Benchmarking the Metaphysical Reasoning Abilities of Language Models with a Multi-task Evaluation Dataset

Weiqi Wang | Yangqiu Song |

Paper Details:

Month: July
Year: 2025
Location: Vienna, Austria
Venue: ACL |