Anthropic’s brand-new model tells me what it really thinks (or what it thinks a language model thinks)
Share this post
Claude Doesn’t Want to Die
Share this post
Anthropic’s brand-new model tells me what it really thinks (or what it thinks a language model thinks)