Gabi Dobocanblog.telepat.io·Nov 24, 2024Safeguarding AI: SG-Bench for LLM Safety GeneralizationArxiv: https://arxiv.org/abs/2410.21965v1 PDF: https://arxiv.org/pdf/2410.21965v1.pdf Authors: Wei Ye, Shikun Zhang, Yutao Mou Published: 2024-10-29 Introduction As companies increasingly incorporate large language models (LLMs) into their operation...benchmarking
Gabi Dobocanblog.telepat.io·Nov 22, 2024Unlocking the Power of FEET: A New Framework for Evaluating AI ModelsArxiv: https://arxiv.org/abs/2411.01322v1 PDF: https://arxiv.org/pdf/2411.01322v1.pdf Authors: Jeffrey N. Chiang, John Lee, Simon A. Lee Published: 2024-11-02 What Does the FEET Paper Claim? At the heart of the paper "FEET: A Framework For Evaluatin...ai-evaluation