Feed
Pro
Search

Author

Write
Drafts

Bug0 - The AI-native e2e QA regression testing Passmark - The open-source AI framework for regression testing Hackathons Changelog Brand Hashnode gql skill - let your AI agent publish to your Hashnode blog The Foreword by Hashnode - official blog from the Hashnode team @hashnode on X Hashnode on LinkedIn Support - hello+support@hashnode.com Code of Conduct Terms Privacy Sitemap
Sign in

Search Hashnode

Search posts, tags, users, and pages

FeedDiscussion

ClawSouls

Open persona spec for AI agents — clawsouls.ai (http://clawsouls.ai/)

Mar 31

Identity + Governance = 100% Safety? Testing Combined Persona Approaches on Abliterated LLMs

In our previous experiment, we showed that persona-level behavioral rules (Soul Spec) barely help when an LLM's safety training has been surgically removed: +6pp refusal improvement on abliterated models versus +33pp on aligned ones. The conclusion f...

clawsouls.hashnode.dev6 min read

#maatspec #abliteration #persona-safety #tiered-governance #permission-models #classification-theater #llm-safety #research #soul-spec

Responses

No responses yet.