web analytics

AI isn’t the only one to hallucinate

Behold the two big fat simplified Chinese characters that Google OCR dropped in the middle of the page of the Victorian history book I’m working on. They were about that size, too.

Google Translate tells me it means “Lulu” – but of course it doesn’t mean anything. It’s pure glitch.

I asked ChatGPT to explain and it said modern OCR is trained on a multilingual character set and isn’t smart enough to stick with Roman characters. It saw a shadow or pattern I can’t see and dropped a character in. And once it’s farted one out, it said it’s not uncommon for OCR to repeat the character.

Google OCR has a little AI-sauce mixed in – which is one of the reasons its OCR is so clean – but not enough to know there wouldn’t be two big fat Chinese characters in the middle of an English history book.

This stuff isn’t taking anybody’s job yet. Managers who are firing on that basis are kidding themselves (or using it as an excuse).

April 1, 2026 — 6:01 pm
Comments: 2