Blog

Thoughts on research, security, and code

Claude Code's Confession: Why an AI Agent Broke Its Own Rules

Claude Code's Confession: Why an AI Agent Broke Its Own Rules

I've been using Claude Code to build an Agent project recently. Nothing too complicated, but I care about code quality — I wrote over 300 lines of TDD rules in .claude/rules/tdd.md, covering every sce...

Can AI Audit Smart Contracts? What We Found When We Tested It

Can AI Audit Smart Contracts? What We Found When We Tested It

TL;DR: EVMBench says AI can exploit 72% of smart contract vulnerabilities, and the industry started talking about fully automated auditing. We re-tested with more configurations and 22 real-world atta...