Large Language Models Badly Generalize across Option Length, Problem Types, and Irrelevant Noun Replacements

Guangxiang Zhao | Saier Hu | Xiaoqi Jian | Wu Jinzhu | Yuhan Wu | Lin Sun | Xiangzheng Zhang |

Paper Details:

Month: November
Year: 2025
Location: Suzhou, China
Venue: EMNLP |