Con%ra%y to a%pea%an%es, br%in% do%t * r En%gl%sh, th%y in%er i% fr%m to%en%.
Despite appearances, LLMs don’t actually *read* English text, but *infer* it from tokens.
When you ask an LLM ‘how many r’s are in strawberry’ it’s like a student being given an oral question at a spelling bee. The answer isn’t trivially present in the format of the question.