Internet: AI News Week Ending 03/06/2026

Internet: AI News Week Ending 03/06/2026

March 6, 2026

Image created with gemini-3.1-flash-image-preview with claude-sonnet-4-5. Image prompt: Wide angle aerial photography of a person in freefall striking an ecstatic pose while holding a tangled bundle of bright ethernet cables streaming upward like a useless parachute, crisp blue sky background, earth visible far below, bold title typography reading ‘INTERNET’ integrated into the sky, bright daylight, dynamic action shot, clean simple composition, humorous optimistic mood.

ChatGPT users research products but won’t buy there, forcing OpenAI to rethink its commerce strategy https://the-decoder.com/chatgpt-users-research-products-but-wont-buy-there-forcing-openai-to-rethink-its-commerce-strategy/

GPT-5.4 is here. Native computer-use capabilities. Up to 1M tokens of context in Codex and the API. Best-in-class agentic coding for complex tasks. Scalable tool search across larger ecosystems. More efficient reasoning for long, tool-heavy workflows. https://x.com/OpenAIDevs/status/2029620984853188738

GPT-5.4 reportedly brings a million-token context window and an extreme reasoning mode https://the-decoder.com/gpt-5-4-reportedly-brings-a-million-token-context-window-and-an-extreme-reasoning-mode/

GPT-5.4 set a new record on FrontierMath, our benchmark of extremely challenging math problems! We had pre-release access to evaluate the model. On Tiers 1-3, GPT-5.4 Pro scored 50%. On Tier 4 it scored 38%. See thread for commentary and additional experiments.”” https://x.com/EpochAIResearch/status/2029626255776395425

Big GPT-5.4 updates (via TheInformation) – 1M token context window -New “Extreme reasoning mode” → more compute, deeper thinking – Parity with Gemini and Claude long-context models – Better long-horizon tasks (can run for hours) – Improved memory across multi-step workflows”” https://x.com/kimmonismus/status/2029213568155992425

BOOOOM! Introducing GPT-5.4 Thinking & Pro in Codex, API & ChatGPT 🔥 It combines GPT-5.3-Codex-level coding with stronger reasoning, better knowledge-work generation, native computer use for agents, and up to ~1M token context! > Tool search cuts token usage by ~47% on”” https://x.com/reach_vb/status/2029620416546017491

excited for GPT-5.3 Instant rolling out today! a lot of work went into improving the everyday chatgpt experience, things that don’t always show up in benchmarks: better tone, fewer unnecessary refusals, and stronger answers from search. we’ve also reduced hallucinations ↓”” https://x.com/christinahkim/status/2028900228196384978

GPT-5.2 Pro is a really solid fact checker. Put in anything you write into it and it hums away and gives you objections & caveats & “”well, actually”” qualifications, plus it checks your math Outside of narrow areas (Academic pubs, New Yorker articles) this was not possible pre-AI”” https://x.com/emollick/status/2029235053339804132

GPT-5.3 Instant is rolling out in ChatGPT starting today. We heard the feedback on 5.2 – sometimes too cautious, too many caveats, and conversations that didn’t flow as naturally as they should. 5.3 Instant tackles that with fewer unnecessary refusals, fewer defensive”” https://x.com/nickaturley/status/2028894581191000404

GPT-5.3-chat-latest now also in the API”” https://x.com/scaling01/status/2028906108291616773

GPT-5.4 also has a 1M context window, but their evals show that needle-in-a-haystack (MRCR v2) scores 97% at 16-32K tokens, drops to 57% at 256-512K, and just 36% at 512K-1M. So it’s a good idea to compact regularly!”” https://x.com/cline/status/2029642984351010874

GPT-5.4 coming soon (as first leaked by humble self) – exceeding 1 million context window – featuring an extreme reasoning mode”” https://x.com/scaling01/status/2029215437922169254

GPT-5.4 is a big step up in computer use and economically valuable tasks (e.g., GDPval). We see no wall, and expect AI capabilities to continue to increase dramatically this year.”” https://x.com/polynoamial/status/2029622090152956335

GPT-5.4 is launching, available now in the API and Codex and rolling out over the course of the day in ChatGPT. It’s much better at knowledge work and web search, and it has native computer use capabilities. You can steer it mid-response, and it supports 1m tokens of context.”” https://x.com/sama/status/2029622732594499630

GPT-5.4 only slightly better than GPT-5.3-Codex on SWE-Bench-Pro”” https://x.com/scaling01/status/2029620496627597364

GPT-5.4 Pricing MORE EXPENSIVE THAN GPT-5.2″” https://x.com/scaling01/status/2029619520860565648

GPT-5.4 Thinking is rolling out to ChatGPT. You can now interrupt it before it produces the final answer. That means you can steer the response while it’s still working instead of needing multiple back-and-forth turns. We also improved deep web research and long-context”” https://x.com/nickaturley/status/2029639058864099543

GPT-5.4-high is now in the Text Arena, tied with Gemini-3-Pro. Highlights: – Top 3 in Creative Writing, and top 10 in Instruction Following, Hard Prompts. – Top 6 for Occupational categories: Writing, Literature & Language, Entertainment, Sports & Media, Business, Management &”” https://x.com/arena/status/2029648008602857694

It’s GPT-5.4 day! The first general-purpose AI model that beats humans at operating a computer. 75% on OSWorld vs 72.4% for humans. It can navigate desktops, click through UIs, send emails, fill out forms all from screenshots. Additional nuggets: – 1M token context and”” https://x.com/TheRundownAI/status/2029625695593435286

It’s happening: GPT-5.4 landed in the arena. Release Thursday very likely”” https://x.com/kimmonismus/status/2029325405212070200

We also evaluated GPT-5.4 Pro on FrontierMath: Open Problems. It did not solve any problems. It made some novel observations on one problem, but of a form that the author had anticipated and characterized as relatively uninteresting. More here:”” https://x.com/EpochAIResearch/status/2029626331764605365

GPT-5.4 scores 83% on GDPval”” https://x.com/scaling01/status/2029618924375965992

GPT-5.4 and GPT-5.4 Thinking are now available in Perplexity for Pro and Max subscribers.”” https://x.com/perplexity_ai/status/2029629694489006347

SerpApi: Google Search API https://serpapi.com/