OpenAI: AI News Week Ending 05/08/2026

Image created with gemini-3.1-flash-image-preview with claude-opus-4.7. Image prompt: Using the provided reference images, keep the authentic Sonoran Desert trail vista with saguaros, volcanic rock, and bright Arizona sky from the landscape reference and the exact brown-post ranger sign construction and typography from the sign reference, but change the sign header to bold all-caps ‘OPENAI’ with fictional trail entries like ‘GPT Ridge → 4.5 mi’, ‘Sora Wash → 2.1 mi’, and ‘Sam’s Saddle → 0.7 mi’ in matching ranger sans-serif, replace the WP3 medallion with a small spiral-knot emblem, and add a single glossy black raven perched naturally on top of the wooden post beside the sign. Keep everything photorealistic with warm midday desert light, weathered sign surface, and the rocky singletrack trail winding into the valley behind.

Codex has surpassed Claude Code in downloads. According to TickerTrends, the crossover happened on April 30, after which Codex continued to gain share while Claude Code’s growth visibly slowed. Claude 4.7 was released April 16th, GPT-5.5 April 24th. Connect the dots.
https://x.com/kimmonismus/status/2051515496567292310

Anthropic and OpenAI are both launching joint ventures for enterprise AI services | TechCrunch

Anthropic and OpenAI are both launching joint ventures for enterprise AI services

Both Anthropic and OpenAI have new initiatives to help enterprises deploy AI agents within their organizations. This is a trend that’s early but going to get very big fast. As agents enter knowledge work beyond coding, there is very real work to upgrade IT systems, get agents
https://x.com/levie/status/2051344780328858040?s=46

Introducing Trusted Contact in ChatGPT | OpenAI
https://openai.com/index/introducing-trusted-contact-in-chatgpt/

GPT-imagegen-2: “”make 5×5 grid of dog photos, where each photo gets noticeably cuter”” …now cats …now man-eating squid …now covers of the book the Great Gatsby
https://x.com/emollick/status/2050049582688538736

5.5 instant comes to ChatGPT today! imo it is a pretty big upgrade, i really like using it.
https://x.com/sama/status/2051716909629153573

Excited that we’re updating the default model in ChatGPT today! 5.5 instant is a substantial improvement in intelligence, image perception, and factuality. It also updates the writing style to be a bit plainer and more straightforward. What was on your wishlist?
https://x.com/ericmitchellai/status/2051711459886059963

GPT-5.5 Instant is rolling out over the next two days as the default model to all ChatGPT users, and as ‘gpt-5.5-chat-latest’ in the API. Personalization improvements are rolling out to Plus and Pro users on the web, and soon on mobile. Memory sources are rolling out across all
https://x.com/OpenAI/status/2051709035347694047

GPT-5.5 Instant is starting to roll out in ChatGPT. It’s a big upgrade, giving you smarter, clearer, and more personalized answers in a warmer, more natural tone. And it’s also more concise, which we heard you wanted. We think you’ll love chatting with it.
https://x.com/OpenAI/status/2051709028250915275

GPT-5.5 Instant: smarter, clearer, and more personalized | OpenAI
https://openai.com/index/gpt-5-5-instant/

OpenAI releases a separate ChatGPT iOS app for enterprise users – 9to5Mac

OpenAI releases a separate ChatGPT iOS app for enterprise users

the new instant model in chatgpt is so good damn if you have been thinking-model-only for awhile, give it a try!
https://x.com/sama/status/2051758152224506203

Agents SDK 2.0 is underrated
https://x.com/sama/status/2050998576671859003

Create Google Slides in Codex without opening your browser, clicking buttons, and manually aligning figures. Plus, you (and your team) can view the progress in realtime. Codex isn’t creating the deck locally, then uploading it. It’s actually iteratively building it, checking
https://x.com/gabrielchua/status/2051113129317408925

I’ve never used an agent for the cliches of ordering food, grocery shopping, or booking travel. But I repeatedly use Computer Use in Codex to add things to my family calendar in Apple Calendar. Like, I gave it my son’s little league schedule for the next four months, and it
https://x.com/_simonsmith/status/2050178967735353837

One week since the launch of GPT-5.5, and it’s already our strongest model launch yet. API revenue is growing more than 2x faster than any prior release, while Codex doubled revenue in under seven days as enterprise demand for agentic coding tools keeps climbing.
https://x.com/OpenAI/status/2050250926888468929

OpenAI Agents SDK – an open orchestration layer for building multi-agent workflows It lets you define agents as LLMs with instructions, tools (APIs, functions, external systems), guardrails, and supports: • sessions with conversation history management • human-in-the-loop •
https://x.com/TheTuringPost/status/2050903494010499113

Me and codex were busy. 🔊
https://t.co/FBNMbWOuFZ — Sonos 🗃️
https://t.co/YDdZyN2vwP — WhatsApp 🪶
https://t.co/eykEElx1Ez — X archive 🧰
https://t.co/txvYVtvhPg — GitHub archive 🛰️
https://t.co/2u2ACJEKKi — Discord archive 🎧
https://t.co/nrv2rzKfH4 — Spotify 💬
https://x.com/steipete/status/2051900143339704730

This is the most useful tooling I built for OpenClaw to date. It’s open source, runs on codex and you can fork and use it for any repo. For all the hard working oss folks that drown in issues and PRs, this is for you.
https://x.com/steipete/status/2051020548335874369

Mira Murati tells the court that she couldn’t trust Sam Altman’s words | The Verge
https://www.theverge.com/ai-artificial-intelligence/925338/openai-musk-v-altman-mira-murati

Musk sought settlement with OpenAI two days before trial
https://www.cnbc.com/2026/05/04/musk-altman-open-ai-settlement-trial-brockman.html

🎙️ Voice AI only feels natural when conversation keeps pace with speech. Here’s how we rebuilt our WebRTC stack with a thin relay and stateful transceiver to keep real-time media fast for ChatGPT voice, the Realtime API, and more.
https://x.com/OpenAIDevs/status/2051453905343828350

🚀 GPT-Realtime-2 just landed in Genspark. Our Call for Me Agent now runs on it. Genspark Realtime Voice is upgrading next. What Realtime 2 brings: Sharper reasoning. Tighter instruction following. +26% effective conversation rate. Far fewer dropped calls.
https://x.com/genspark_ai/status/2052524670088556557

Advancing voice intelligence with new models in the API | OpenAI
https://openai.com/index/advancing-voice-intelligence-with-new-models-in-the-api/

Building voice applications with GPT-Realtime-2? Our new prompting guide covers how to tune reasoning effort, use preambles, design tool behavior, handle unclear audio, capture exact entities, and maintain state in longer sessions.
https://x.com/OpenAIDevs/status/2052530378184032560

Dubbing for live events… in real time? 😮 Here’s OpenAI’s new GPT-Realtime-Translate model in action in Vimeo. Those translations are happening completely live. No pre-loaded captions. Live dubbing is one of the many features we’re exploring this year… (Hopefully) more
https://x.com/Vimeo/status/2052442588201029684

GPT-Realtime-2 audio input price remains steady at $1.15 per hour of audio input, and $4.61 per hour of audio output.
https://x.com/ArtificialAnlys/status/2052486478501204415

gpt-realtime-2 shows a 15pp improvement (vs 1.5) on Big Bench Audio, and is now close to saturation.
https://x.com/juberti/status/2052507302092296252

GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper are available in the Realtime API today.
https://x.com/OpenAIDevs/status/2052440968763515223

GPT-Realtime-2: Building a Live Translator
https://x.com/RayFernando1337/status/2052479718495318143

GPT-Realtime-Whisper brings low-latency streaming transcription to the Realtime API. Use it when your app needs to understand speech continuously while the interaction is still unfolding.
https://x.com/OpenAIDevs/status/2052440957258489859

Guess who’s back, back again. Whisper, but now with realtime streaming. Check out the new gpt-realtime-whisper transcription model in my
https://t.co/b2UTuSxhOI demo.
https://x.com/juberti/status/2052478775523512356

have been excited for realtime voice-to-voice translation as an AI application since we started OpenAI. extremely cool to see it now available in the API for anyone to build with:
https://x.com/gdb/status/2052480998668206262

Introducing GPT-Realtime-2 in the API: our most intelligent voice model yet, bringing GPT-5-class reasoning to voice agents. Voice agents are now real-time collaborators that can listen, reason, and solve complex problems as conversations unfold. Now available in the API
https://x.com/OpenAI/status/2052438194625593804

New ChatGPT Voice mode pretty much confirmed. And im really excited for it.
https://x.com/kimmonismus/status/2051571219040735423

New Voice Model from OpenAI in the API gpt-realtime-2 Here a quick demo I built
https://x.com/diegocabezas01/status/2052492653082681485

OpenAI has released GPT-Realtime-2, achieving 96.6% in our Speech Reasoning benchmark, Big Bench Audio, and #1 in our Conversational Dynamics benchmark Released today, GPT-Realtime-2 is OpenAI’s new flagship native Speech to Speech model, introducing adjustable reasoning effort
https://x.com/ArtificialAnlys/status/2052486470469140777

OpenAI shipped a new speech-to-speech model today: gpt-realtime-2 This is the first speech-to-speech model good enough to use in my voice agents that do “”real work.”” Or real play, for that matter. Here’s gpt-realtime-2 as the brain of the ship AI in Gradient Bang. The
https://x.com/kwindla/status/2052521318688739811

Our new voice models are now available in the Realtime API: 🎙️ GPT-Realtime-2: Build production-ready voice agents that can think harder, take action, handle interruptions, and keep conversations flowing. 🎙️ GPT-Realtime-Translate: Translate while streaming across more than 70
https://x.com/OpenAI/status/2052438196454379986

people are really starting to use voice to interact with AI, especially when they have a lot of context to dump. GPT-Realtime-2 comes to the API today; it is a pretty big step forward. (we are working on improvements to voice in chat.)
https://x.com/sama/status/2052462271667028211

pretty excited for voice models to get great its interesting to watch how people are already starting to change the way they interface with AI
https://x.com/sama/status/2051464865634742334

Saw this and thought “”yes! ChatGPT voice mode is going to stop acting like a two-year-model”” but that upgrade hasn’t shipped just yet
https://x.com/simonw/status/2052439091577496054

Taking talking shop to a whole new level. We just shipped Glean’s real-time voice capability, powered by @OpenAI’s newest speech model GPT-Realtime-2. Grounded in the context across your org, it feels like a real AI coworker and can keep up with how work gets finished. In
https://x.com/glean/status/2052440702169108990

Updated my hello-realtime demo to use the new gpt-realtime-2 model (now with reasoning). Check it out at
https://t.co/td6Cx2EOPO, or call 425-800-0042!
https://x.com/juberti/status/2052469176821002676

Using @OpenAI gpt-realtime-2 to get a glimpse of future voice-first experiences. A market dashboard you don’t click through. You direct it. Say, “Focus on Apple,” and the whole interface changes. Ask, “How did it do over the last 30 days?” and the chart updates. Say, “Go
https://x.com/levinstanley/status/2052506605044842672

Voice agents are getting more capable. Here’s what’s new: • GPT-Realtime-2 for voice agents that reason and take action • GPT-Realtime-Translate enabling translation from 70 input languages into 13 output languages • GPT-Realtime-Whisper, making transcription even faster
https://x.com/OpenAIDevs/status/2052440907933474954

Voice agents are so back!! Today we’re launching 3 new realtime audio models in the API: 🎙️ GPT-Realtime-2 GPT-5-class reasoning for voice agents that can use tools, recover from interruptions, and carry longer conversations with 128K context 🌍 GPT-Realtime-Translate Live
https://x.com/reach_vb/status/2052438371058737280

Voice workflows just got stronger with gpt-realtime-1.5 in the Realtime API. The model offers more reliable instruction following, tool calling, and multilingual accuracy. Demo with @charlierguo
https://x.com/OpenAIDevs/status/2026014334787461508

We know you’re eager for voice updates in ChatGPT. Stay tuned, we’re cooking.
https://x.com/OpenAI/status/2052438197695877316

Congrats to @OpenAI for taking the top spot on our Audio MultiChallenge S2S leaderboard with the release of GPT‑Realtime‑2 🥇 GPT-Realtime-2 more than doubles GPT-Realtime-1.5 on instruction retention, rising from 36.7% to 70.8% APR, and also stands out on voice editing,
https://x.com/ScaleAILabs/status/2052451341071683732

All benchmarks are flawed, but GPQA has been fairly consistent & highly correlated with other measured benchmars. I think it’s a good way to see how far we’ve come that the free model from OpenAI, GPT 5.5 Instant, is at a level that even paid models did not reach until late 2025
https://x.com/emollick/status/2051801703209742734

OpenAI launches $10B AI venture backed by TPG, Bain, SoftBank – Bloomberg
https://www.msn.com/en-us/money/general/openai-launches-10b-ai-venture-backed-by-tpg-bain-softbank-bloomberg/ar-AA22miSj

ChatGPT is now available as an add-on in Excel and Google Sheets. It can help analyze messy data, write formulas, update spreadsheets, and explain what it’s doing along the way–without leaving your spreadsheet. Powered by GPT-5.5.
https://x.com/ChatGPTapp/status/2051776032127238266

【Industry Check Update】OpenAI appears to be fast-tracking its first AI agent phone, with mass production targeted as early as 1H27. Potential drivers include supporting a year-end IPO narrative and intensifying competition in AI agent phones. MediaTek currently appears better
https://x.com/mingchikuo/status/2051523855286776034?s=20

MRC is already deployed across all of OpenAI’s largest supercomputers that we use to train frontier models, including our site with @Oracle Cloud Infrastructure (OCI) in Abilene, Texas, and in @Microsoft’s Fairwater supercomputers. MRC is now available through the
https://x.com/OpenAI/status/2052025533937103102

NVIDIA just open-sourced a transport protocol that powers OpenAI’s Blackwell clusters. It opened MRC, a new RDMA transport protocol for massive AI training clusters. Instead of pushing GPU traffic through one fragile path, MRC spreads a single connection across multiple network
https://x.com/kimmonismus/status/2052011784023028060

Supercomputer networking to accelerate large scale AI training | OpenAI
https://openai.com/index/mrc-supercomputer-networking/

We’ve partnered with @AMD, @Broadcom, @Intel, @Microsoft, and @NVIDIA, to release Multipath Reliable Connection (MRC), a new open networking protocol that helps large AI training clusters run faster and more reliably, with less wasted GPU time.
https://x.com/OpenAI/status/2052025532485902368

Meta-Backed Scale AI Wins $500 Million Defense Department Deal – Bloomberg
https://www.bloomberg.com/news/articles/2026-05-06/meta-backed-scale-ai-wins-500-million-defense-department-deal

Pentagon strikes AI deals for classified military use – The Washington Post
https://www.washingtonpost.com/technology/2026/05/01/pentagon-ai-deals-microsoft-amazon-google-classified-military/

Today, the @DeptofWar entered into agreements with SEVEN of the world’s leading frontier AI model and infrastructure companies to deploy frontier capabilities on the Department’s classified networks: • SpaceX • OpenAI • Google • NVIDIA • Reflection • Microsoft • Amazon
https://x.com/DoWCTO/status/2050175912134561977

The goblin thing was fun as it was a real quirk that was emblematic of what makes AI interesting, and it organically came out of an AI user discovery. So was, for what it was worth, Ghiblitization When the labs try to manufacture viral AI moments, it is usually less successful
https://x.com/emollick/status/2050328985880465699

you know what all of these “”which is better”” polls are silly use codex or claude code, whatever works best for you i am grateful we live in a time with such amazing tools, and grateful there is a choice
https://x.com/sama/status/2050274547061129577

May 5 is the GPT-5.5 launch celebration in San Francisco and the Claude Finance Briefing in New York. Real opposite valence events on opposite coasts.
https://x.com/emollick/status/2051443790615814232

Also, it’s insane how much slower Claude Code feels compared to Codex. GPT has faster TTFT & TPS, requires fewer tokens to start, requires fewer tool calls to succeed, prices “”fast mode”” less egregiously, and lets you use fast mode on a Codex subscription
https://x.com/theo/status/2050025533950587075

It’s so hard to describe the vibe difference between Opus 4.7 and GPT 5.5 (for coding) GPT is smarter and can unblock you, but it gets stuck in stupid ways and strangles itself with context sometimes. Opus will go down the most insane paths and refuse to acknowledge obvious
https://x.com/theo/status/2049994645531451874

So amazing to see the reception for the new ChatGPT images. Usage up >50% in just a few weeks + nearly 60% of daily users coming from newly logged-in users. Incredible breadth of utility across home design, learning, work graphics, creative etc
https://x.com/nickaturley/status/2050716264826593637

/hatch clippy
https://x.com/sama/status/2050402088266694689

come for the rate limits, stay for the best model
https://x.com/sama/status/2051671472142512190

GPT-5.5 Price Increase: What It Actually Costs | OpenRouter
https://openrouter.ai/announcements/gpt55-cost-analysis

hey chat, we haven’t forgotten about you 👀
https://x.com/sama/status/2051690237420826838

i keep thinking i want the models to be cheaper/faster more than i want them to be smarter but it seems that just being smarter is still the most important thing
https://x.com/sama/status/2050671161915371998

i would like to talk to people who have built amazing things with 5.5 that weren’t possible with earlier models. i am especially interested in examples that took ludicrous token budgets. thanks.
https://x.com/sama/status/2051724685231214650

in particular, the combination of improvements to speed, intelligence, personality, and great memory/personalization feels like a more-than-sum-of-the-parts thing when it all hits together
https://x.com/sama/status/2051758445402223051

it does seem cool
https://x.com/sama/status/2049944981750833659

it has been a real pleasure to work with Greg over the past decade. i feel very lucky. this post held up pretty well, but not did not sufficiently highlight his technical brilliance and sheer determination.
https://x.com/sama/status/2050964008480723059

it really is!
https://x.com/sama/status/2050958845913227474

its weird how much i want to get something to run for the record longest
https://x.com/sama/status/2050302192775815613

lisan say more mean things about us you’re being too nice
https://x.com/sama/status/2049903925311267311

man its good to be back on twitter there is comfort in the skills of a wasted youth
https://x.com/sama/status/2050399512494227709

never thought id be watching F1 via the kids broadcast cannot imagine being happier
https://x.com/sama/status/2050661006230344083

OpenAI DevDay is back. San Francisco September 29
https://x.com/OpenAI/status/2049534651702956103

OpenAI Flips the Script
https://every.to/context-window/openai-flips-the-script

Sounds like the ChatGPT upgrade is coming soon though
https://x.com/simonw/status/2052439181885153757

The security industry is entering a period of compression. Model cybersecurity capabilities are rapidly increasing, and it’s critical we arm defenders with the tools they need to protect what matters most. We’re launching two models today: GPT-5.5 with TAC (Trusted Access for
https://x.com/cryps1s/status/2052508963409998283

this is great
https://x.com/sama/status/2050654662349787518

Three new modded-NanoGPT optimization benchmark results, all of them using NorMuon, have near-concurrently improved the benchmark record from 3325 to 3250 steps. 1) Kumar Krishna Agrawal (gh:kumarkrishna) used NorMuon with an update-clamping strategy (no weight decay, like
https://x.com/kellerjordan0/status/2051363977490489671

we are gonna do something nice for everyone who applied for the GPT-5.5 party and that we didn’t have space for. hope you enjoy!
https://x.com/sama/status/2051318922805436896

we love you too!
https://x.com/sama/status/2051464155094507902

We recently found some instances of CoT grading during the training of previously deployed models after building a system that scans all OpenAI RL runs for accidental CoT grading. We did not find clear evidence that these instances degraded CoT monitorability.
https://x.com/MicahCarroll/status/2052451995467018427

we shipped gpt-5.5 instant today to chat; it’s rolling out over the next couple days to everyone. for this model, we focused on factuality, crushing hacks, and improving the baseline intelligence. 5.5 is a pretty big step forward on all three. it is much smarter, significantly
https://x.com/michpokrass/status/2051709536130802022

we will plan bigger parties for future releases. a lot more people wanted to come than we expected. thank you! gonna try to think of a really good idea for the next one.
https://x.com/sama/status/2050427808456077541

.@thsottiaux told me on my podcast this week: more than half of Codex prompts now come from non-engineers As a knowledge worker, I can’t be more excited about what’s shipping. Testing Codex this weekend. Will report back
https://x.com/siliconvalleymm/status/2052110961654296627

/goal: The Six-Hour Codex Run That Survived a Five-Hour Pause | Blog | Tecton & Tide
https://tectontide.com/en/blog/codex-goal-six-hour-run/

5.5 in codex is so good for non-coding tasks. i keep assuming it won’t be able to do something, but a lot of the time i am pleasantly surprised.
https://x.com/sama/status/2051783339502375418

5.5 xhigh in fast mode is really good i think i got psyoped by twitter on medium for a bit
https://x.com/sama/status/2050658558174437701

Auto Review in Codex is a game changer! It keeps long-running tasks moving with fewer approvals for routine work, while escalating higher-risk actions back to me. Try it in Codex today!!
https://x.com/reach_vb/status/2051782942314078553

big upgrade for codex today! try it for non-coding computer work.
https://x.com/sama/status/2049946120441520624

Bring your workflow to Codex in just a few clicks. Import settings, plugins, agents, project configuration, and more so you can keep working with fewer interruptions. Your move.
https://x.com/OpenAI/status/2050290618187055175

Built Petdex, a public gallery to discover, share, and install Codex pets with one curl. Submissions open at link below 👇
https://x.com/RaillyHugo/status/2050498466669887571

Codex 0.128.0 is huge, even better than a @thsottiaux reset. Codex is moving more goal oriented with a new /goal command, think Ralph loop on steroids: – /goal <objective> to set a new goal – after agent turn finishes, Codex injects a message nudging the model to pick the next
https://x.com/mattlam_/status/2049907603829121354

codex app becoming incredible
https://x.com/gdb/status/2049971410479796521

Codex can now take on more of your browser dev work. With the new Chrome plugin in the Codex app, it can test web apps, gather context across tabs, use web DevTools efficiently in parallel, and keep results organized without taking over your browser.
https://x.com/OpenAIDevs/status/2052481136971125158

Codex is my favorite coding app right now. It’s clean, but has everything I need to ship fast. It’s also quite delightful to use and snappy, and shows enough context without overwhelming. I was hesitant to try it because I don’t like locking in with a single provider, and I was
https://x.com/linuz90/status/2051273382327685207

Codex now works directly in Chrome on macOS and Windows. It’s even better at working with apps and sites in Chrome, and now works in parallel across tabs in the background without taking over your browser. To get started, install the Chrome plugin in the Codex app.
https://x.com/OpenAI/status/2052480800004956323

Codex redefines my workflow to the point where I should probably buy a new machine Last year I bought a 36GB M4 Pro MBP thinking it was a rocketship. Now I can work back and forth across 4 apps using Codex instead of scrolling Twitter while it builds or thinks (🤡) With a
https://x.com/TinaDebove/status/2050218817880473644

CODEX SKILL TO BRUTALLY TEST ANY STARTUP IDEA! Most startup ideas sound good. This Codex skill tells you why they probably won’t work. Just give Codex your idea and it pressure-tests it for you -> finds the core assumption -> exposes fatal flaws -> checks if the problem is
https://x.com/Kappaemme1926/status/2050908233158816122

CodexBar 0.24 is live 🤖 New Windsurf, Codebuff + DeepSeek providers 👥 Copilot multi-account switching 🧹 Opt-in local storage breakdowns 🔋 Hung Codex RPC + redraw battery drain fixed Tiny menu bar, ridiculous changelog.
https://x.com/steipete/status/2051882417292525950

Got my dog as a Codex pet, but more interestingly got Codex to add the rings to show my Codex limits. Outer ring is 5 hours, inner the weekly one
https://x.com/petergostev/status/2051076960911077796

GPT-5.5 is going to have a party for itself. it chose 5/5 at 5:55 pm for the date and time. if you’d like to come, let us know here:
https://t.co/OupLcJnf14 codex will help the team pick people from the replies. 5.5 had some good ideas/requests for the party, which we’ll do.
https://x.com/sama/status/2049653810558353746

i have brand new anxiety about not hitting cache with codex/gpt-5.5 btw since the input costs are so much higher i leave my agent on and come back to it asking a stupid question, it’s been too long and i see it charge me a dollar in input costs on next message LMFAO
https://x.com/cheatyyyy/status/2051332852546228533

I have to go out of town for a funeral thru the weekend but I am leaving everyone with one new cool feature inspired by ralph loops and Codex’s upcoming /goal feature. If you use /goal <prompt>, it will start a loop with a supervisor model determining whether the task completed
https://x.com/Teknium/status/2050098631907434871

I love that Codex App now shows the Progress of your task in an easily parseable UI right in the chat! ✨
https://x.com/reach_vb/status/2051655026574057593

I still stand by Droid being the best agent harness out there. I’ve tested everything under the sun. #1 Droid #2 Pi #3 Amp #4 OpenCode #5 Codex CLI I am still working on a few reviews but performance wise this has been my experience.
https://x.com/0xSero/status/2051689733793755405

It’s never been easier to do everyday work with Codex. Choose your role, connect the apps you use every day, and try suggested prompts. Codex helps with everything from research and planning to docs, slides, spreadsheets, and more.
https://x.com/OpenAI/status/2049928776147230886

it’s still experimental so we hide it a bit, but in the codex app, try: > what have i been doing very inefficiently on my computer (according to Chronicle). make some recommendations. be direct. tell me what i need to hear.
https://x.com/ajambrosino/status/2049839184110645691

ok its not the most important thing we’ve ever done but i find it more useful than it seems on the surface. check out pets in codex! (and try hatching one)
https://x.com/sama/status/2050304809572688289

OpenAI adds animated Pets and config imports to Codex
https://www.testingcatalog.com/openai-adds-animated-pets-and-config-imports-to-codex/

Pets. Now in Codex. Use /pet to wake your pet.
https://x.com/OpenAIDevs/status/2050275713824211041

QoL upgrade: Codex tells you the status of your CI directly in the chat it’s the little things!
https://x.com/reach_vb/status/2050194266505277902

Settings – Codex app | OpenAI Developers
https://developers.openai.com/codex/app/settings#codex-pets

Team shipped a Codex Security plugin with 5 AppSec workflows: > Security Scan Scans PRs, commits, branches, patches, folders, or full repos. Runs the full pipeline end-to-end > Threat Model Maps the repo: assets, trust boundaries, attacker inputs, invariants, and failure
https://x.com/reach_vb/status/2051019108028969251

the same model in a different harness can yield much different performance! we’ve seen this on a few different occasions now – we took gpt-5.2-codex from 52.8% to 66.5% on Terminal-Bench 2.0 (Top 30 to Top 5 at the time of publishing) just by applying harness layer changes like
https://x.com/masondrxy/status/2051016743905305007

The updated Agents SDK is now available in TypeScript, with support for sandbox agents and an open-source harness built in.
https://x.com/OpenAIDevs/status/2051725072873001338

We added a device tool bar to the Codex in-app browser, so it’s easier to build and test responsive apps! Now, you can have Codex test your app in different dimensions, so it can fix bugs & improve UI for every device. Just click the 3 dots on the right of the URL bar to use
https://x.com/JamesZmSun/status/2050050523794165816

we have very efficient models, especially for their capability level happy codexing
https://x.com/sama/status/2051670144842395990

With codex I don’t need a second monitor I turned it into a standing desk
https://x.com/jxnlco/status/2050639436866892075

You should be using subagents in Codex! They let Codex split work across specialized agents, explore in parallel, and bring the results back into one focused answer. Great for bigger codebases, PR reviews, and anything that needs more than one thread of thought!!
https://x.com/reach_vb/status/2052090279344120278

🦀📦Crabbox 0.4.0. Often I need to quickly recreate conditions on macOS, Linux and Windows and need fast empheral machines. Crabbox are machines for agents on the fly, using AWS spot instances, Hetzner or @useblacksmith. Infinite codex + tests!
https://x.com/steipete/status/2051025056306790833

closed source, open source, nothing can stop codex.
https://x.com/steipete/status/2052144503595716790

codex doesn’t create random markdowns 😉
https://x.com/steipete/status/2050003238498226541

Codex… what is this… are these signs of CHARACTER?
https://x.com/steipete/status/2051011229674508485

Here’s codex validating a [macOS only] launchd issue I previously had that you can’t reliably reproduce on a non-fresh install. Crabboxes ftw!
https://x.com/steipete/status/2051026592764240204

I learned a lot about the security ecosystem in the last few months. Amazing to work with @nvidia @OpenAI @Microsoft @GitHub @TencentHunyuan @convex @Atlassian @useblacksmith to get secure the claw.
https://x.com/steipete/status/2049976855617314991

If you tried OpenClaw in group chats and got mixed results, you GOTTA try again. I changed how agents talk there, it IS SO GOOD NOW.
https://t.co/uW9tcnynWr And if you used GPT and got subpar performance, switch to codex harness.
https://t.co/9DDpY6TeAH Enable both and boom.
https://x.com/steipete/status/2049988836160074022

OpenClaw 2026.5.6 🦞 🩺 doctor leaves Codex OAuth routes alone 🔌 plugin fetch handles odd headers 🌐 web_fetch cleans up timeouts Small maintenance release:
https://x.com/openclaw/status/2052096219233587451

The new /goal feature in codex slaps.
https://x.com/steipete/status/2050275598178586921

told codex I had to pay up to make @xai work again.
https://x.com/steipete/status/2050384648119734683

ChatGPT feels very ‘switched on’ now
https://x.com/sama/status/2051829422265979047

artificial goblin intelligence achieved
https://x.com/sama/status/2050021650641695108

Forget goblins, things that GPT-5.5 really likes in its fiction: lighthouses, the ocean, maps, bells, clock towers with bells that ring impossible times, Mira Vale, resonances and echoes (Claude and Gemini love them too), secret third things (not night/day, not high/low)…
https://x.com/emollick/status/2049923650820653520

goblinblog dropped
https://x.com/sama/status/2049691999444639872

the OpenAI goblin fiasco was a Big L for the interpretability research community They solved the mystery without SAEs or probing or anything. just talked to various models and counted the number of times they said Goblin
https://x.com/jxmnop/status/2050437965168652344

Told codex to go full goblin mode and I am immediately regretting it
https://x.com/bilawalsidhu/status/2050231692456083866

Where the goblins came from | OpenAI
https://openai.com/index/where-the-goblins-came-from/

GPT-5.5 & Opus 4.7 on ARC-AGI-3 – GPT-5.5: 0.43% – Opus 4.7: 0.18% We found 3 failure modes: – True local effect, false world model – Wrong level of abstraction from training data – Solved the level, didn’t reinforce the reward See our full analysis 🧵
https://x.com/arcprize/status/2050261221165989969

Daring Fireball: Y Combinator’s Stake in OpenAI
https://daringfireball.net/2026/05/y_combinators_stake_in_openai

we want to build tools to augment and elevate people, not entities to replace them.
https://x.com/sama/status/2050229058425045178

Sometimes when I demo AI, I show it turning cover letters into goofy formats (poetry, etc) as an introduction to the idea of AI as translator between forms. For the first time, GPT-5.5 has been trying to get me to tone these requests down so I don’t ruin my chances at the job.
https://x.com/emollick/status/2051069865608294697

All three leading open weights models were released last week. Progress continues for open weights models alongside proprietary ones, with the gap to GPT-5.5, the leading proprietary model, sitting at 6 points on the Artificial Analysis Intelligence Index @Kimi_Moonshot’s Kimi
https://x.com/ArtificialAnlys/status/2050096370200281539

you can sign in to openclaw with your chatgpt account now and use your subscription there! happy lobstering.
https://x.com/sama/status/2050357911915028689

Now available for ChatGPT accounts: Advanced Account Security, a new opt-in setting for people at higher risk of digital attacks, with stronger protections including phishing-resistant sign-in and more secure account recovery.
https://x.com/OpenAI/status/2049902506881462613

we’re starting rollout of GPT-5.5-Cyber, a frontier cybersecurity model, to critical cyber defenders in the next few days. we will work with the entire ecosystem and the government to figure out trusted access for cyber; we want to rapidly help secure companies/infrastructure.
https://x.com/sama/status/2049712078836170843

Multipath Reliable Connection (MRC): a new open networking protocol for large AI training clusters, deployed in production on our largest training clusters.
https://x.com/gdb/status/2052059553542328829