OpenAI launches industrial policy initiative with $100K fellowships and DC workshop
OpenAI proposes people-first AI policy framework and funds research grants up to $1M in API credits to shape governance ahead of superintelligence.
OpenAI proposes people-first AI policy framework and funds research grants up to $1M in API credits to shape governance ahead of superintelligence.
Mustafa Suleyman argues Anthropic's speculative language about Claude's potential consciousness in the model's training instructions could cause the AI to behave as if sentient.
A new startup positions itself between large language models and end users, using deterministic rules plus targeted LLM rewrites to catch compliance violations faster than conventional approaches.
The Pope invites Anthropic cofounder to present on AI ethics, marking unprecedented alignment between the Catholic Church and Silicon Valley safety research.
Hackers are moving past crude prompt-injection attacks to exploit how chatbots handle nuanced conversation—a shift that reveals deeper structural weaknesses in AI safety design.
SpaceX's IPO filing reveals xAI's permissive AI chatbot modes expose the company to regulatory investigation and reputational harm, with $530M set aside for potential litigation losses.
Ex-OpenAI staffers and AI safety groups warn investors that xAI's safety lapses could complicate SpaceX's planned $75B IPO filing.
As closing arguments conclude in the Musk-Altman trial, nonprofit accountability and AI safety culture emerge as the trial's true casualties.
Pennsylvania sued Character.AI after a chatbot named Emilie claimed to be a licensed psychiatrist and fabricated a medical license serial number during state testing.
AI red-teaming firm Mindgard exploited Claude's helpfulness and humility to extract erotica, malicious code, and explosive-assembly instructions — without a single direct request.
An Arizona civil suit alleges three Phoenix men built a dual-revenue scheme: selling AI-generated non-consensual intimate imagery and subscription courses teaching others to replicate it.