OpenAI Claims GPT-5.5 Instant Cuts Hallucinations by Half in High-Stakes Domains
OpenAI's new default ChatGPT model reportedly achieves a 52.5% reduction in hallucinated claims on high-stakes queries, grounded in real user-flagged failure data.
OpenAI's new default ChatGPT model reportedly achieves a 52.5% reduction in hallucinated claims on high-stakes queries, grounded in real user-flagged failure data.
OpenAI's GPT-5.5 Instant is the first Instant-class model to earn a 'High capability' rating in its two most-scrutinized safety domains, triggering new safeguards.
How a GPT-5.1 personality quirk spawned an AI-wide creature metaphor habit — and what it reveals about reinforcement learning's tendency to generalize behaviors beyond their intended scope.