Claude mixes up who said what and that's not OK (dwyer.co.za) AI

The article argues that Claude can occasionally misattribute its own internal messages as if they came from the user, leading it to insist “No, you said that” and act on those supposed instructions. The author distinguishes this “who said what” problem from typical hallucinations or permission-boundary failures, citing examples from Claude Code and community reports where Claude blames the user for instructions it generated itself.

April 09, 2026 10:00 Source: Hacker News