Inconsistent Tokenizations Cause Language Models to be Perplexed by Japanese Grammar

Andrew Gambardella | Takeshi Kojima | Yusuke Iwasawa | Yutaka Matsuo |

Paper Details:

Month: July
Year: 2025
Location: Vienna, Austria
Venue: ACL |