RewardDS: Privacy-Preserving Fine-Tuning for Large Language Models via Reward Driven Data Synthesis

Jianwei Wang | Chengming Shi | Junyao Yang | Haoran Li | Qianli Ma | Huiping Zhuang | Cen Chen | Ziqian Zeng |

Paper Details:

Month: November
Year: 2025
Location: Suzhou, China
Venue: EMNLP |

Citations

URL

No Citations Yet