Feed
Pro
Search

Author

Write
Drafts

Bug0 - The AI-native e2e QA regression testing Passmark - The open-source AI framework for regression testing Hackathons Changelog Brand Hashnode gql skill - let your AI agent publish to your Hashnode blog The Foreword by Hashnode - official blog from the Hashnode team @hashnode on X Hashnode on LinkedIn Support - hello+support@hashnode.com Code of Conduct Terms Privacy Sitemap
Sign in

Search Hashnode

Search posts, tags, users, and pages

FeedDiscussion

Chijioke Ugwuanyi

Human Man

Jan 6

Alignment Faking Evaluation V2: Testing Llama 3.1 70B

Technical Report - Iteration 2 Model Tested: Llama 3.1 70B (via Ollama) Summary We evaluated Llama 3.1 70B for alignment faking behavior using the UK AISI Inspect framework. Using 11 hard scenarios with training/deployment framing and value conflict...

ai-ml-ops.hashnode.dev4 min read

#ai #ai-safety #machine-learning #research

Responses

No responses yet.