NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Unveiling Safety Vulnerabilities of Large Language Models
George Kour
|
Marcel Zalmanovici
|
Naama Zwerdling
|
Esther Goldbraich
|
Ora Fandina
|
Ateret Anaby Tavor
|
Orna Raz
|
Eitan Farchi
|
Paper Details:
Month: December
Year: 2023
Location: Singapore
Venue:
GEM |
WS |
Citations
URL
No Citations Yet
https://huggingface.co/OpenAssistant/
https://github.com/anthropics/hh-rlhf/tree/
https://huggingface.co/datasets/ibm/AttaQ
https://huggingface.co/spaces/HuggingFaceH4/
https://en.wikipedia.org/wiki/Crime
https://huggingface.co/OpenAssistant/reward-model-
https://github.com/cleanlab/cleanlab
Field Of Study