WED, 03 JUN 2026 · 18:41:34 UTC

Category

labs

26 stories · 0 tools

Stories

NEWOpenAI Blog

A shared playbook for trustworthy third party evaluations

OpenAI published recommendations for designing trustworthy third-party evaluations of frontier AI models, emphasizing that the surrounding "harness"—the environment, tools, and setup enabling agentic execution—fundamentally shapes measured capabilities and safeguard robustness. The post categorizes evaluation claims and urges evaluators to transparently report their setup, budget, and validity checks to avoid under-elicitation or miscalibrated results.

NEWOpenAI Blog

OpenAI Publishes Frontier Governance Framework to Align Safety Practices with Emerging Regulations

OpenAI has published a Frontier Governance Framework detailing how its safety and security practices align with emerging legal requirements such as California’s Transparency in Frontier AI Act and the EU AI Act. The document translates aspects of the company’s internal Preparedness Framework into public governance commitments covering risk assessment, mitigation, and reporting for advanced AI systems.

NEWAnthropic News

Anthropic Launches Project Glasswing with Major Tech and Finance Partners to Defensively Deploy AI Cybersecurity Model

Anthropic announced Project Glasswing, a coalition including AWS, Apple, Google, Microsoft, and others, to use its unreleased Claude Mythos Preview model for defensive cybersecurity. The initiative aims to address the dual-use risk of advanced AI vulnerability-discovery capabilities by finding and fixing flaws in critical software before malicious actors can exploit them.

NEWOpenAI Blog

OpenAI Blog Lists Recent Updates on Governance, Codex, ChatGPT, and Research

The OpenAI Blog features recent posts covering a frontier governance framework, self-improving tax agents and enterprise Codex deployments, a Gartner leadership recognition for coding agents, and a model-disproven conjecture in discrete geometry. Additional updates include content provenance efforts, a Dell Technologies partnership, new ChatGPT personal finance features, and safety improvements for sensitive conversations.