Safeguarding AI: SG-Bench for LLM Safety Generalization
Nov 24, 2024 · 4 min read · Arxiv: https://arxiv.org/abs/2410.21965v1 PDF: https://arxiv.org/pdf/2410.21965v1.pdf Authors: Wei Ye, Shikun Zhang, Yutao Mou Published: 2024-10-29 Introduction As companies increasingly incorporate large language models (LLMs) into their operation...
Join discussion