A. they do some kind of vector steering towards exploration that just boosts "goblin" related logits?
B. pre-training / post-training quirk like the em-dash?
I think its B, but A would be monstrously cool
It's true. Here's a plot of GPT models and their usage of "goblin", "gremlin", "troll", etc over time. There's no anti-gremlin system instruction on our side, we get to see GPT-5.5 run free.