NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Sycophancy Mitigation Through Reinforcement Learning with Uncertainty-Aware Adaptive Reasoning Trajectories
Mohammad Beigi
|
Ying Shen
|
Parshin Shojaee
|
Qifan Wang
|
Zichao Wang
|
Chandan K. Reddy
|
Ming Jin
|
Lifu Huang
|
Paper Details:
Month: November
Year: 2025
Location: Suzhou, China
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://github.com/PLUM-Lab/sycophancy_mitigation
Field Of Study