Big Escape Benchmark: Evaluating Human-Like Reasoning in Language Models via Real-World Escape Room Challenges

Zinan Tang | QiYao Sun |

Paper Details:

Month: July
Year: 2025
Location: Vienna, Austria and virtual meeting
Venue: GEM | WS |

Citations

URL