Even "illegible" Mythos reasoning traces seem pretty legible (lesswrong.com) AI
The post argues that the “illegible reasoning” example from Claude’s Mythos 5 system card appears much more interpretable than claimed, claiming that a seemingly word-salad chain is actually a compact description of card-move logic (including “cells” and “chunks” in a solitaire-like puzzle). It notes that a different model, Haiku 4.5, produced an approximate translation of the excerpt and uses this to suggest Mythos’ reasoning is monitorable and that any difficulty may stem from tokenization/shorthand rather than truly incomprehensible internal language.
June 13, 2026 06:45
Source: Hacker News