


default search action
"On Evaluating the Durability of Safeguards for Open-Weight LLMs."
Xiangyu Qi et al. (2024)
- Xiangyu Qi, Boyi Wei, Nicholas Carlini, Yangsibo Huang, Tinghao Xie, Luxi He, Matthew Jagielski, Milad Nasr, Prateek Mittal, Peter Henderson:
On Evaluating the Durability of Safeguards for Open-Weight LLMs. CoRR abs/2412.07097 (2024)

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.