The Evil Vector
Shtetl-Optimized 2025-03-03
Summary:
Last week something world-shaking happened, something that could change the whole trajectory of humanity’s future. No, not that—we’ll get to that later. For now I’m talking about Anthropic’s “Emergent Misalignment” paper. A group including Owain Evans (who took my Philosophy and Theoretical Computer Science course in 2011) published what I regard as the most surprising […]