Cache-Efficient Posterior Sampling for Reinforcement Learning with LLM-Derived Priors Across Discrete and Continuous Domains

Ibne Farabi Shihab | Sanjeda Akter | Anuj Sharma |

Paper Details:

Month: November
Year: 2025
Location: Suzhou, China
Venue: EMNLP |