OpenAI: AI News Week Ending 04/17/2026

Image created with gemini-3.1-flash-image-preview with claude-opus-4.7. Image prompt: Using the provided reference image, preserve everything exactly — the marigold-orange backdrop, the seated woman with closed eyes and faint smile in her purple-and-white windbreaker, the tattooed singer in red beanie and layered red vest, his hand position and leaning serenade pose, the lighting and shallow depth of field — but replace only the black handheld microphone with a sleek matte-black knotted spiral sculpture shaped like a six-loop rosette swirl, held to his mouth at the same angle and scale, photorealistic with realistic studio lighting and subtle specular highlights. After generating the image, overlay the text “OpenAI” in the upper-left corner of the frame in large, bold, all-caps ITC Avant Garde Gothic Pro Medium (or a near-identical geometric sans-serif if unavailable), pure white (#FFFFFF), with no date, subtitle, drop shadow, or outline. The text should be substantial in scale — taking up a meaningful portion of the upper-left area — with comfortable margin from the top and left edges, set against the negative space of the orange backdrop so it does not overlap or obscure the singer, the seated woman, or the replaced object.

2. Give Claude Code your full task context upfront: goal, constraints, acceptance criteria in the first turn. This lets Claude Code do its best work.
https://x.com/_catwu/status/2044808536790847693

Anthropic CPO leaves Figma’s board after reports he will offer a competing product | TechCrunch

Anthropic CPO leaves Figma’s board after reports he will offer a competing product

Bessent, Powell Summon Bank CEOs to Urgent Meeting Over Anthropic’s New AI Model – Bloomberg
https://www.bloomberg.com/news/articles/2026-04-10/anthropic-model-scare-sparks-urgent-bessent-powell-warning-to-bank-ceos

Multi-agent coordination patterns: Five approaches and when to use them | Claude
https://claude.com/blog/multi-agent-coordination-patterns

Our evaluation of Claude Mythos Preview’s cyber capabilities | AISI Work
https://www.aisi.gov.uk/blog/our-evaluation-of-claude-mythos-previews-cyber-capabilities

Five companies — Google, Microsoft, Meta, Amazon, and Oracle — now control about two-thirds of the world’s compute, up slightly from ~60% at the start of 2024. Many AI labs (including OpenAI and Anthropic) depend almost entirely on these hyperscalers for access to their compute.
https://x.com/EpochAIResearch/status/2044154042541301870

I asked Jensen: “2 out of the top 3 models in the world, Claude and Gemini, were trained on TPU. What does that mean for Nvidia going forward?” After a long technical back and forth about what the right accelerator for AI looks like (see full episode), Jensen lays down the
https://x.com/dwarkesh_sp/status/2044468295957635392

Anthropic co-founder confirms the company briefed the Trump administration on Mythos | TechCrunch

Anthropic co-founder confirms the company briefed the Trump administration on Mythos

First model from Anthropic, which openly acknowledges it isn’t the best model they have
https://x.com/nrehiew_/status/2044791293080121553

Internal Anthropic survey on Claude Mythos Preview 12/18 people thought that Mythos can manage day long ambiguous tasks 8/18 thought that it can execute week long tasks
https://x.com/scaling01/status/2044787521691742338

Nearly 1/3 of surveyed people in Anthropic now think entry-level engineers and researchers are likely replaced by Mythos within 3 months
https://x.com/arankomatsuzaki/status/2044808883928186936

Read OpenAI’s latest internal memo about beating the competition — including Anthropic | The Verge
https://www.theverge.com/ai-artificial-intelligence/911118/openai-memo-cro-ai-competition-anthropic

GPT-5.4 Pro solves Erdős Problem #1196! Very pleased with this result; definitely my favourite thus far! This problem has been thought about for some time which makes this reasonably impressive and meaningful (see Lichtman’s comments below). Formalisation is underway!
https://x.com/Liam06972452/status/2044051379916882067

In my doctorate, I proved the Erdős Primitive Set Conjecture, showing that the primes themselves are maximal among all primitive sets. This problem will always be in my heart: I worked on it for 4 years (even when my mentors recommended against it!) and loved every minute of it.
https://x.com/jdlichtman/status/2044298382852927894

More on GPT-5.4 Pro’s latest mathematical contribution: “The closest analogy I would give would be that the main openings in chess were well-studied, but AI discovers a new opening line that had been overlooked based on human aesthetics and convention.”
https://x.com/gdb/status/2044436998648193333

Paul Erdos had a concept of “”Proofs from The Book””, meaning that the argument is so compact and elegant that this is the proof God would’ve written down in “”The Book.”” After reading the GPT5.4 proof of Erdos #1196, I would say this is a Book Proof of the result. The conjecture
https://x.com/jdlichtman/status/2044307082275618993

The recent story of Erdős problem 1196 is a great example of how AIs can be used to complement and enhance human research – a quick thread.
https://x.com/thomasfbloom/status/2044319103310021078

This is becoming a pattern in AI that makes talking about capabilities challenging. First, there are overstated claims (like the flubbed Erdos problems last year), then minor wins (AI helps with discovery) then breakthroughs. The first stage feels like (& often is) hype, but…
https://x.com/emollick/status/2044455311118074124

try the TurboTax app in ChatGPT:
https://x.com/gdb/status/2044292247898992924

The next evolution of the Agents SDK | OpenAI
https://openai.com/index/the-next-evolution-of-the-agents-sdk/

We started Hiro with the vision of building an AI personal CFO. Joining @OpenAI gives us the chance to pursue that vision at a much greater scale. Important dates: – Today: Hiro is no longer accepting new signups – April 20, 2026: The product will stop working, but data export
https://x.com/hirofinanceai/status/2043751090232144159

Codex for (almost) everything | OpenAI
https://openai.com/index/codex-for-almost-everything/

Codex for (almost) everything. It can now use apps on your Mac, connect to more of your tools, create images, learn from previous actions, remember how you like to work, and take on ongoing and repeatable tasks.
https://x.com/OpenAI/status/2044827705406062670

OpenAI develops unified Codex app and new Scratchpad feature
https://www.testingcatalog.com/openai-develops-unified-codex-app-and-new-scratchpad-feature/

OpenAI tests web browsing feature on Codex Superapp
https://www.testingcatalog.com/openai-tests-web-browsing-feature-on-codex-superapp/

Microsoft Secures Former OpenAI “”Stargate”” Site in Norway for AI Infrastructure | TheEnergyMag
https://theenergymag.com/news/market-news/microsoft-secures-former-open-ai-stargate-site-in-norway-for-ai-infrastructure

OpenAI to spend more than $20 billion on Cerebras chips, receive stake, The Information reports
https://finance.yahoo.com/sectors/technology/articles/openai-spend-more-20-billion-013150907.html

OpenAI Stargate Execs to Join Meta’s New Compute Unit — The Information
https://www.theinformation.com/briefings/openai-stargate-execs-join-metas-new-compute-unit

OpenAI StarGate People Move To Meta Amid Data Center Boom
https://www.forbes.com/sites/johnwerner/2026/04/15/openai-stargate-people-move-to-meta-amid-data-center-boom/

– – Sam Altman – Attacks
https://blog.samaltman.com/2279512

Suspect in Molotov cocktail attack on Sam Altman’s home identified | The San Francisco Standard
https://sfstandard.com/2026/04/10/sam-altman-russian-hill-molotov-cocktail/

Introducing GPT-Rosalind for life sciences research | OpenAI
https://openai.com/index/introducing-gpt-rosalind/

Trusted access for the next era of cyber defense | OpenAI
https://openai.com/index/scaling-trusted-access-for-cyber-defense/

Long-running agents are the future – we’re excited to partner with OpenAI as a sandboxing partner for their new Agents SDK launch! Get started:
https://x.com/CloudflareDev/status/2044467412607901877

Migrate a Legacy Codebase with Sandbox Agents
https://developers.openai.com/cookbook/examples/agents_sdk/sandboxed-code-migration/sandboxed_code_migration_agent

OpenAI has purchased access to the FrontierMath: Open Problems verifiers. This allows them to check the validity of solutions their models generate. Thread with details.
https://x.com/EpochAIResearch/status/2044227029978284471

A lot of our education on writing well focuses on logic, clarity, and argument. AI will force us to think more about style. The boredom that comes from everything on the internet reading Claude-y now, no matter how good the substance is, should make us appreciate variety more.
https://x.com/emollick/status/2042963501199597950

All | Search powered by Algolia
https://hn.algolia.com/?dateRange=all&page=0&prefix=false&query=claude+down&sort=byPopularity&type=story

Anthropic Mythos AI Rollout Coming to US Agencies – Bloomberg
https://www.bloomberg.com/news/articles/2026-04-16/white-house-moves-to-give-us-agencies-anthropic-mythos-access

Anthropic: Claude quota drain not caused by cache tweaks • The Register
https://www.theregister.com/2026/04/13/claude_code_cache_confusion/

Anthropic’s Mythos seeded some panic and added anxiety (@matthewberman, I’m looking at you ;), but let’s think a little and calmly discuss what it means in the short term and in the long term. Let me know your thoughts
https://x.com/TheTuringPost/status/2042363395962274075

Anthropic’s randoms system prompt blockers are getting weirder and weirder.
https://x.com/steipete/status/2042537771865104653

Claude Code is redesigning the IDE for agentic coding. As Andrej said: “We’re going to need a bigger IDE. The basic unit is not a file, but an agent.” Cursor now has to fight to define that future of IDE too.
https://x.com/Yuchenj_UW/status/2044133573326934384

Claude Mythos #2: Cybersecurity and Project Glasswing
https://thezvi.substack.com/p/claude-mythos-2-cybersecurity-and

Coding agents are such game-changers for linux. For almost anything that doesn’t work, in the past I would have spent the afternoon, or even whole weekend, scourging forums, trying many many things, before fixing it or giving up. Now I just point codex and claude it at (and,
https://x.com/giffmana/status/2043401612035559445

Currently, ChatGPT has the best way of viewing thinking traces, a short summary of steps in the main window, and a detailed audit in the sidebar if you want it Claude does almost as well, but more summarized and harder to see calculations and code Its a big weak spot for Gemini
https://x.com/emollick/status/2043408661603594740

Given the messy naming scheme used by all the AI companies, I caused a chart to be made showing the gain in GPQA per 0.1 version in model names (estimated, since model names skip version numbers). There has never been a more misnamed model that Claude 3.7, should have been 4.4.
https://x.com/emollick/status/2044200225653326269

ICYMI — `deepagents deploy` is an open alternative to claude managed agents!
https://x.com/LangChain/status/2044097913698091496

It looks like everyone is finally catching up with the fact that agent sessions in CLI mode can only get you so far. It makes sense that the new Codex app, Cursor, and Claude Code (desktop) feel and look pretty similar now. This UI convergence is not an accident. This is a
https://x.com/omarsar0/status/2044172949003911532

Jensen Huang on Anthropic, OpenAI, China, and demand for inference tokens
https://davefriedman.substack.com/p/jensen-huang-on-anthropic-openai

OpenAI should probably bite the bullet and just name their next set of models something more human sounding. Everyone anthropomorphizes their AIs anyway, and “”Claude”” is an easier name to refer to than ChatGPT. Also easier to make a gerund, “”Clauding,”” or adjective, “”Claudy-y.””
https://x.com/emollick/status/2043190951632404760

We conducted cyber evaluations of Claude Mythos Preview and found that it is the first model to complete an AISI cyber range end-to-end. 🧵
https://x.com/AISecurityInst/status/2043683577594794183

Anthropic asked Christian leaders for advice on Claude’s moral future – The Washington Post
https://www.washingtonpost.com/technology/2026/04/11/anthropic-christians-claude-morals/

Distilled recap of the back-and-forth with Jensen on export controls: Dwarkesh: Wouldn’t selling Nvidia chips to China enable them to train models like Claude Mythos with cyber offensive capabilities that would be threats to American companies and national security? Jensen:
https://x.com/dwarkesh_sp/status/2044483393941848131

Just shipped **artifact-preview** for Hermes 🔥 Like Claude Artifacts, build dashboards, games, UIs, get a full interactive preview that instantly opens in a live browser. Real clickable code, smooth refreshes on prompt edits. cc @Teknium
https://x.com/ChuckSRQ/status/2044504539978465658

Jensen regrets that when Anthropic and OpenAI first needed billions to scale, Nvidia wasn’t in a position to invest. So these labs went to hyperscalers like Microsoft, Google, and Amazon instead, and in return committed to using their compute. “I’m not going to make that same
https://x.com/dwarkesh_sp/status/2044498492450869624

Qwen 3.6 is here, and open-source! Run it locally with improved agentic coding capabilities. Try it with Claude Code: ollama launch claude –model qwen3.6 Try it with OpenClaw: ollama launch openclaw –model qwen3.6 Run it: ollama run qwen3.6
https://x.com/ollama/status/2044779844672852465

@stochasticchasm yeah they tend to forget that releases are now monthly and now bi-anually
https://x.com/scaling01/status/2044795960224592329

Anthropic Changes Pricing to Bill Firms Based on AI Use as Demand Jumps — The Information
https://www.theinformation.com/articles/anthropic-changes-pricing-bill-firms-based-ai-use-amid-compute-crunch

Anthropic introduced xhigh reasoning effort
https://x.com/scaling01/status/2044785557058814059

Anthropic loses Claude Code trust in black-box fight
https://www.implicator.ai/claude-probably-wasnt-secretly-nerfed-anthropic-made-the-black-box-too-dark/

Anthropic tests Claude Code upgrade to rival Codex Superapp
https://www.testingcatalog.com/anthropic-tests-claude-code-upgrade-to-rival-codex-superapp/

anthropic? you mean the greedy token guzzler company?
https://x.com/dejavucoder/status/2044798065530528061

every engineer at anthropic has been using mythos for ~1.5 months. meanwhile, their uptime is horrendous, claude code still has rendering bugs, etc. one could conclude that it won’t be the end of software engineering.
https://x.com/benhylak/status/2042051048261722467

GitHub reports similar improvements
https://x.com/scaling01/status/2044792459125834029

OpenAI has released a plugin that lets you call Codex directly within Anthropic’s Claude Code environment It turns Claude Code into a multi-agent setup with Codex as a specialized coding assistant This gives you: – High-quality code reviews – Delegation of real tasks
https://x.com/TheTuringPost/status/2044561927905677558

So we now have a pretty good picture of the state of the frontier AI model makers. US closed source models continue to lead. Google, OpenAI, and Anthropic stand well ahead of the pack, and may have signs of recursive self-improvement. xAI has fallen from frontier status for now
https://x.com/emollick/status/2042088011748290750

The pace at which Anthropic is shipping Opus variants is a very new thing in the industry.
https://x.com/_arohan_/status/2044791678180167804

The pace at which useful things are shipping also seems to be accelerating. Model releases are coming faster, of course, but so are significant application and enterprise products (especially from Anthropic). Almost certainly faster than the market can track or absorb information
https://x.com/emollick/status/2042434850003534077

we were literally stuck at 80% SWE-Bench Verified for months and just jumped to almost 90% and you guys call it mid …
https://x.com/scaling01/status/2044790717722034511

Yeah folks, it’s gonna be harder in the future to ensure OpenClaw still works with Anthropic models.
https://x.com/steipete/status/2042615534567457102

GPT-5.4 Pro for making beautiful contributions to mathematics:
https://x.com/gdb/status/2044254201505611833

Agents need computers. And they need a lot of them. Modal is an official sandbox provider for the @OpenAI Agents SDK.
https://x.com/modal/status/2044469736483000743

Build long-running agents with more control over agent execution. New capabilities in the Agents SDK: • Run agents in controlled sandboxes • Inspect and customize the open-source harness • Control when memories are created and where they’re stored
https://x.com/OpenAIDevs/status/2044466699785920937

Codex for almost everything | Hacker News
https://news.ycombinator.com/item?id=47796469

Codex now helps with more of your work, from coding to staying on top of everything around it.
https://x.com/OpenAIDevs/status/2044828214867202519

Here’s how we use Codex to: > understand large codebases > review PRs faster > build macOS apps > turn Figma into code > automate bug triage > create a CLI as agent tools > analyze datasets > generate slide decks > coordinate new-hire onboarding > learn a new concept …and
https://x.com/gabrielchua/status/2043339151278506234

Improve agent performance with a harness that keeps long-running agents on track. It manages the agent loop across tools, context, and traces. The sandbox preserves working state across pauses, retries, and resumptions.
https://x.com/OpenAIDevs/status/2044466729712304613

OpenAI x E2B: build your agents with the new OpenAI Agents SDK, powered by E2B sandboxes. We’re excited to support OpenAI as a launch partner! The new @OpenAI Agents SDK will now get dedicated sandboxes – perfect for persistent, long-running agents. With E2B, you’ll get a
https://x.com/e2b/status/2044476275067416751

To show off what you can do with @OpenAI Agent SDK + @modal, we built an ML research agent (inspired by @karpathy). It can: – Spin up GPU sandboxes of any shape – Run a pool of subagents – Persist memory – Snapshot state for fork/resume Here it is playing Parameter Golf:
https://x.com/akshat_b/status/2044489564211880169

Today we launched a major update to the OpenAI Agents SDK to help developers build and deploy long-running, durable agents in production. You can now build your own Codex-style agents using powerful primitives for modern agents – file and computer use, skills, memory and
https://x.com/snsf/status/2044514160034324793

Top things we released in Codex today: > Computer use on Mac: Codex can see, click, and type across apps > In-app browser for faster frontend, app, and game iteration > Image generation with gpt-image-1.5 > 90+ new plugins across tools like JIRA, CircleCI, GitLab, Microsoft
https://x.com/reach_vb/status/2044830689313599827

Use Vercel Sandbox with the OpenAI agents SDK as an official extension. Build agents that can run code, read files, and analyze data safely inside isolated microVMs. Control the compute and data flow from your secure cloud environment.
https://x.com/vercel_dev/status/2044492058073960733

you can build a Python agent that accepts a coding task, executes it inside a Cloudflare Sandbox, and copies the output files to your local machine @OpenAIDevs x @CloudflareDev Check out our guide here:
https://x.com/whoiskatrin/status/2044477140662395182

Your agents need a sandbox, but you need a framework in which to create your agent. We’re excited to be a sandbox provider in the new @OpenAI Agents SDK. By combining the SDK and Daytona sandboxes, you get agent orchestration and secure code execution working together out of the
https://x.com/daytonaio/status/2044473859047313464

AIE Europe Keynotes & OpenClaw ft Deepmind, OpenAI, Vercel, ‪@pragmaticengineer‬ , ‪@mattpocockuk‬ – YouTube

Introducing GPT-Rosalind, our frontier reasoning model built to support research across biology, drug discovery, and translational medicine.
https://x.com/OpenAI/status/2044861690911850863

OpenAI released GPT-Rosalind today, a frontier reasoning model purpose-built for life sciences research, available in trusted-access preview to customers like Amgen, Moderna, and the Allen Institute. On the benchmarks, it’s competent rather than revolutionary, leading on
https://x.com/kimmonismus/status/2044867786099310949

Our Life Sciences model series is available as a research preview starting today for qualified customers including @Amgen, @moderna_tx, the @AllenInstitute, and @thermofisher Scientific through ChatGPT, Codex, and the API.
https://x.com/OpenAI/status/2044861695911477643

We’re expanding Trusted Access for Cyber with additional tiers for authenticated cybersecurity defenders. Customers in the highest tiers can request access to GPT-5.4-Cyber, a version of GPT-5.4 fine-tuned for cybersecurity use cases, enabling more advanced defensive workflows.
https://x.com/OpenAI/status/2044161906936791179