NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
A Simple Yet Effective Method for Non-Refusing Context Relevant Fine-grained Safety Steering in LLMs
Shaona Ghosh
|
Amrita Bhattacharjee
|
Yftah Ziser
|
Christopher Parisien
|
Paper Details:
Month: November
Year: 2025
Location: Suzhou, China
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://huggingface.co/meta-llama/Llama-2-7B-chat-hf
https://platform.openai.com/docs/models/gpt-4-turbo-
https://huggingface.co/meta-llama/Llama-2-7b
https://huggingface.co/meta-llama/Meta-Llama-3-8B
https://build.nvidia.com/nvidia/nemotron-4-340b-
https://www
https://crfm
https://github.com/pytorch/pytorch
https://github.com/huggingface/transformers
https://huggingface.co/docs/hub/en/models-the-hub
Field Of Study