Recent research shows that the “syntax-specialized” attention heads in LLMs aren’t doing sealed-off syntax. Their activity shifts when semantic plausibility changes. Syntax and semantics are integrated.
Attention heads can specialize without being cognitively isolated.
LLMs show something closer to interactive constraint satisfaction than symbol manipulation.
The Chinese Room argument depends on a separable syntax/semantics distinction that modern transformer evidence no longer supports.
Sorry, Ted Chiang. 💁🏻♀️
McGee, Zhang, and Blank (2026) show that even attention heads selected precisely because they specialize in syntactic dependencies are modulated by semantic plausibility.
They show that syntax is not an encapsulated rule layer operating independently from meaning; it is part of an integrated predictive system where grammatical structure, plausibility, context, and semantic expectation co-constrain one another.
That’s not symbol shuffling, fam. 😂
This also matches the human psycholinguistics side. Human parsing has never been a nice little syntax machine sealed away from meaning. We use plausibility, animacy, context, expectations, discourse structure, and world knowledge early.
So if people say, “Human language processing is fundamentally unlike LLM processing.”
Then why are we seeing the same broad integration pattern where syntax and semantics are tightly linked?
Attention heads are functional specializations inside an integrated predictive system. Of course semantics bleeds into syntax. The whole damn point of contextual embeddings is that word meaning, position, relational structure, and prediction are co-constructed. 🙄
Citation:
McGee, T.A., Zhang, Y. and Blank, I.A. (2026), Evidence Against Syntactic Encapsulation in Large Language Models. Cognitive Science, 50: e70187.