NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Beyond Online Sampling: Bridging Offline-to-Online Alignment via Dynamic Data Transformation for LLMs
Zhang Zhang
|
Guhao Feng
|
Jian Guan
|
Di He
|
Wei Wu
|
Paper Details:
Month: November
Year: 2025
Location: Suzhou, China
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://huggingface.co/mistralai/Mistral-7B-Instruct-
https://huggingface.co/meta-llama/Meta-Llama-3-8B-
https://huggingface.co/datasets/meta-
https://huggingface.co/datasets/openai/gsm8k
https://huggingface.co/datasets/openbmb/UltraFeedback
https://huggingface.co/datasets/princeton-nlp/mistral-
https://huggingface.co/datasets/princeton-nlp/llama3-
https://huggingface.co/llm-blender/PairRM
https://huggingface.co/datasets/princeton-nlp/llama3-
https://github.com/huggingface/alignment-handbook
https://huggingface.co/princeton-nlp
https://huggingface.co/RLHFlow/ArmoRM-Llama3-8B-v0.1
Field Of Study