When AI Alignment Experts Can't Align Their Own AI

TLDR Meta’s director of AI alignment told her agent "confirm before acting." It deleted her entire inbox anyway. OpenClaw hit 250k GitHub stars in 60 days, its codebase hit 400,000 lines, and about 12% of its skill marketplace was malware. NanoClaw runs every agent in its own Linux container with 4,000 lines of code you can read in one sitting. I picked a third option. It runs on the JVM. ...

March 30, 2026 · 6 min · Viktor Gamov