Cache-Efficient Posterior Sampling for Reinforcement Learning with LLM-Derived Priors Across Discrete and Continuous Domains
Ibne Farabi Shihab |
Sanjeda Akter |
Anuj Sharma |
Paper Details:
Month: November
Year: 2025
Location: Suzhou, China
Venue:
EMNLP |