An instrument for reading a model's mind
A model lies with the same confidence it tells the truth.
The tell is not in the words. There is none.
It is in the mind.
It tells knowing from making up.
On 35 entities it knows versus 35 it has never seen, the familiarity gauge separates them at AUROC 1.000 — reading only the model's mid-stream. The two never touch: known reads 0.78 ± 0.12, invented reads 0.04 ± 0.09.
Measured on a served 35-billion-parameter model, held-out and length-controlled; the off-distribution gauge holds across ten languages. The reading in the instrument above is the same engine, live.