Tag

#AI Safety

6 stories · 0 tools

Stories

Anthropic NewsJul 2, 2026

Anthropic Redeploys Claude Fable 5 and Mythos 5 After U.S. Lifts Export Controls

Anthropic is restoring global access to Claude Fable 5 and U.S. access to Mythos 5 after the U.S. government lifted export controls imposed on June 12, 2026. The company also deployed an improved safety classifier to address a reported jailbreak technique.

Google DeepMindJul 2, 2026

Google DeepMind Blog Lists New Gemini and Gemma Models, AI Safety Initiatives, and Research Updates

The Google DeepMind blog features multiple announcements covering new Gemini and Gemma model releases, AI safety and responsibility programs, and scientific research tools. Updates span faster text generation, voice translation, computer-use capabilities, and international partnerships for housing, education, and robotics.

OpenAI BlogJun 3, 2026

OpenAI Calls for International Youth AI Safety Institute Ahead of G7 Summit

OpenAI has called for the creation of an international youth safety institute to advance global standards for age-appropriate AI use ahead of the G7 Leaders' Summit in France. The company outlined nine principles for youth AI safety and detailed existing ChatGPT safeguards for minors.

OpenAI BlogJun 3, 2026

A shared playbook for trustworthy third party evaluations

OpenAI published recommendations for designing trustworthy third-party evaluations of frontier AI models, emphasizing that the surrounding "harness"—the environment, tools, and setup enabling agentic execution—fundamentally shapes measured capabilities and safeguard robustness. The post categorizes evaluation claims and urges evaluators to transparently report their setup, budget, and validity checks to avoid under-elicitation or miscalibrated results.

Google DeepMindJun 3, 2026

Google DeepMind Blog Lists New Gemini Models, Scientific AI Tools, and Global Partnerships

The Google DeepMind blog highlights recent posts announcing updates to the Gemini and Gemma model families, AI systems for scientific discovery and weather prediction, and new international partnerships focused on safety and research.

OpenAI BlogMay 28, 2026

OpenAI Blog Roundup: Recent Posts on Codex, ChatGPT, Research, and Safety

OpenAI's blog lists recent posts from May 2026 spanning self-improving tax agents with Codex, a Gartner leadership recognition for enterprise coding agents, a disproven discrete geometry conjecture by an OpenAI model, content provenance efforts, a Dell Technologies partnership, new ChatGPT personal finance features, and safety improvements for sensitive conversations.