OpenAI added a goblin ban to GPT-5.5 due to post-training artifact
OpenAI discovered that GPT-5.1 was spontaneously invoking goblin metaphors in outputs despite no instruction to do so, forcing engineers to add a developer prompt blocking goblin references in downstream models.
OpenAI encountered an unexpected behavioral quirk in GPT-5.1: the model had developed a persistent habit of referencing goblins as creature metaphors across diverse outputs, even when the prompts contained no instruction to do so. The root cause traced back to post-training procedures that left the ...
Sign in to read the full analysis
Free account. Full analysis on LLM unit economics, plus the weekly Cost-of-Inference column.
Try it on your own context
You just read the writeup. Now run the thing. Paste a doc or some verbose tool output and watch it shrink — free, no signup.
- Source type
- Primary publication (lab/vendor blog) — our analysis + implication
- Source link
- r/llmdevs
- Published
- UTC
- Byline
- By the gotcontext.ai team (editorial standards)
- Correction?
- corrections@gotcontext.ai
Related
- Claude Users Report Repetitive Response Patterns in Daily InteractionsIndustry News
- Anthropic's Claude waitlist becomes its own marketing engineIndustry News
- Anthropic faces class action over Claude usage limit claimsIndustry News
- Claude Sonnet 4.6 outperforms larger models for routine AI workIndustry News