OpenAI's Goblin Problem Is Actually a Reinforcement Learning Problem
How a GPT-5.1 personality quirk spawned an AI-wide creature metaphor habit — and what it reveals about reinforcement learning's tendency to generalize behaviors beyond their intended scope.
IBM's Granite 4.1 Shows Data Discipline Can Beat Bigger Models
IBM's new trio of fully-dense LLMs reaches 512K-token context and outperforms a larger mixture-of-experts predecessor through rigorous data curation alone.
AlphaGo's Creator Says LLMs Are a Dead End — and Raised $1.1 Billion to Prove It
David Silver, who built AlphaGo at DeepMind, argues large language models are fundamentally capped by human data and has founded Ineffable Intelligence to pursue reinforcement learning instead.