Image created with gemini-3.1-flash-image-preview with claude-sonnet-4-5. Image prompt: Photorealistic wide shot of an antique printing press half-embedded in a massive ice sheet on a frozen winter bay at dusk, with individual book pages frozen mid-motion within the surrounding ice surface, golden sunset light catching metal and paper through translucent blue ice, deep blue to orange gradient sky, National Geographic quality, 4K, landscape format with bold sans-serif ‘Publishing’ text overlay.
❤️ We are partnering with @MiniMax_AI to give Ollama users free usage of MiniMax M2.5 for the next couple of days! ollama run minimax-m2.5:cloud Use MiniMax M2.5 with OpenCode, Claude Code, Codex, OpenClaw via ollama launch! OpenCode: ollama launch opencode –model”” https://x.com/ollama/status/2022018134186791177
Eigent day 0 supports @MiniMax_AI M2.5! Try M2.5 on your open source cowork! With Chinese New Year (Horse) coming, we asked Eigent to generate 10 complete HTML/CSS/JS games (no libraries) across arcade, puzzle, runner, strategy, memory, idle and more. The Developer Agent called”” https://x.com/Eigent_AI/status/2021983494407069926
Introducing M2.5, an open-source frontier model designed for real-world productivity. – SOTA performance at coding (SWE-Bench Verified 80.2%), search (BrowseComp 76.3%), agentic tool-calling (BFCL 76.8%) & office work. – Optimized for efficient execution, 37% faster at complex”” https://x.com/minimax_ai/status/2021980761210134808
MiniMax M2.5 is live now on OpenRouter! @MiniMax_AI’s update to their powerful agentic model M2.1 comes with improved reliability and performance on long running tasks. It’s become a powerful general agent, capable of much more than writing code.”” https://x.com/OpenRouter/status/2021983955898315238
MiniMax M2.5: Built for Real-World Productivity. – MiniMax News | MiniMax https://www.minimax.io/news/minimax-m25
MiniMax’s new open M2.5 and M2.5 Lightning near state-of-the-art while costing 1/20th of Claude Opus 4.6 | VentureBeat https://venturebeat.com/technology/minimaxs-new-open-m2-5-and-m2-5-lightning-near-state-of-the-art-while
MiniMax-M2.5 is a surprising new step in open coding models. The first model where I’ve been able to independently confirm that it’s better than the most recent Claude Sonnet. It showed up in our benchmarks below, and in my vibe checks it felt strong and diverse.”” https://x.com/gneubig/status/2021988250240598108
80.2% on SWE-Bench Verified and 76.3% on BrowseComp is quite impressive. Try @MiniMax_AI M2.5 on @Eigent_AI”” https://x.com/guohao_li/status/2021984827923476922
M2.5 runs at 100 tokens per second. That’s 3x faster than Opus. At $0.06/M blended with caching, you can run subagents in the CLI and just leave them going. Fast models exist. Cheap models exist. Both at SOTA performance is new.”” https://x.com/cline/status/2022034678065373693
Can I get a six pack quickly? – YouTube https://www.youtube.com/watch?v=kQRu7DdTTVA
High speed FPV drones got the winter olympics looking like a real life video game. Really makes you appreciate whoever wrote the rendering stack for reality.”” https://x.com/bilawalsidhu/status/2021207946240078271
This will change the way we experience sports forever — watching the game from a gods eye view. Arcturus is building 4D gaussian splatting tech that can capture every angle of a sporting event and pushes the bar for volumetric video. I tested this in a headset and it makes 360″” https://x.com/bilawalsidhu/status/2019831831110258883
Over 20% of YouTube videos are now “”AI slop”” says a new report Kapwing’s research found that 104 videos out of the first 500 recommended to them were identified as AI-generated, an additional 33% were classified as “brainrot””” https://x.com/dexerto/status/2006330639960694808?s=46
An updated Gemini 3 Deep Think is out today: 📈 Achieves SOTA on ARC-AGI-2, MMMU-Pro, and HLE. 🥇Gold-medal level on Physics & Chemistry Olympiads. It turns out the best way to solve hard problems is still to think about them. Read more: https://x.com/NoamShazeer/status/2021988459519652089
Gemini 3 Deep Think (2/26) Semi Private Eval – ARC-AGI-1: 96.0%, $7.17/task – ARC-AGI-2: 84.6% $13.62/task New ARC-AGI SOTA model from @GoogleDeepMind”” https://x.com/arcprize/status/2021985585066652039
Gemini 3 Deep Think scores 84.6% on ARC-AGI-2″” https://x.com/scaling01/status/2021981766249328888
Sundar buried the real story in the cost data. Gemini 3 Deep Think went from 45.1% to 84.6% on ARC-AGI-2 in under 3 months. That’s an 88% improvement on a benchmark specifically designed to resist brute-force scaling. The number that matters: $13.62 per task. The previous Deep”” https://x.com/aakashgupta/status/2022025020839801186
The new Gemini Deep Think is achieving some truly incredible numbers on ARC-AGI-2. We certified these scores in the past few days.”” https://x.com/fchollet/status/2021983310541729894
Thrilled to announce a big upgrade to Gemini 3 Deep Think that hits new records on the most rigorous benchmarks in maths, science & reasoning – including 84.6% on ARC-AGI-2, 48.4% Humanity’s Last Exam without tools, and 3455 Elo rating on Codeforces!”” https://x.com/demishassabis/status/2022053593910821164
Today, we updated Gemini 3 Deep Think to further accelerate modern science, research and engineering. With 84.6% on ARC-AGI-2 and a new standard on Humanity’s Last Exam, see how this specialized reasoning mode is advancing research & development 🧵↓”” https://x.com/Google/status/2021982003818823944
We updated Gemini 3 Deep Think in @GeminiApp. Available for Ultra subscribers and slowly opening Gemini API access (fill out form below). – 48.4%, without tools on Humanity’s Last Exam. – 84.6% on ARC-AGI-2, verified by the ARC Prize Foundation. – Elo of 3455 on Codeforces. -“” https://x.com/_philschmid/status/2021989093110927798
An updated & faster Gemini 3 Deep Think is taking off! 🚀 Our smartest mode to date!™️ PhD-level reasoning to the most rigorous STEM challenges (models’ gotta think harder). Gold medal-level results on Physics & Chemistry Olympiads. 🧪💻 Full details: https://x.com/OriolVinyalsML/status/2021982720860233992
Anupam Pathak, a Google R&D lead in Google’s Platforms and Devices division, tested Deep Think’s ability to speed up the design of physical components. It’s proving that deep reasoning can translate directly into faster, more efficient prototyping.”” https://x.com/Google/status/2022007994897379809
At Duke University, the Wang Lab used Deep Think to optimize crystal growth for new semiconductors. Deep Think designed a recipe to grow thin films larger than 100 μm — hitting a precision target that previous methods had challenges to hit.”” https://x.com/Google/status/2022007988823973977
Gemini 3 Deep Think: AI model update designed for science https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-deep-think/
nano-banana is Gemini‑2.5‑Flash‑Image, beating Flux Kontext by 170 Elo with SOTA Consistency, Editing, and Multi-Image Fusion | AINews https://news.smol.ai/issues/25-08-26-nano-banana
The upgraded Gemini 3 DeepThink is now live! 🚀 We’re already seeing engineers and researchers leverage it as a partner in their design and development processes I love this example of Anupam Pathak using DeepThink to go from prompt to physical prototype–actually designing”” https://x.com/tulseedoshi/status/2021997867305775324
We’ve updated Gemini 3 Deep Think to better tackle the complexity of real-world research, science, and engineering. ♊ 🚀 It achieves gold-medal standards on the written portions of the Physics and Chemistry Olympiads, building on gold-level performance at IMO and ICPC and has”” https://x.com/JeffDean/status/2021989820604539250
We’ve upgraded our specialized reasoning mode Gemini 3 Deep Think to help solve modern science, research, and engineering challenges – pushing the frontier of intelligence. 🧠 Watch how the Wang Lab at Duke University is using it to design new semiconductor materials. 🧵”” https://x.com/GoogleDeepMind/status/2021981510400709092
What’s ahead for commercial experiences in 2026 https://blog.google/products/ads-commerce/digital-advertising-commerce-2026/
people sleep on last week’s open multimodal releases > GLM-OCR: sota OCR model > MiniCPM-o-4.5: Gemini 2.5-flash level Omni model that runs on your phone > InternS1: efficient generalist VLM outperforming on science tasks all allow commercial use freely 🔥”” https://x.com/mervenoyann/status/2021233480957304913
Gemini in Chrome: Your agentic browsing assistant – YouTube https://www.youtube.com/watch?v=5OR4c87Xt-E
Testing ads in ChatGPT | OpenAI https://openai.com/index/testing-ads-in-chatgpt/
Proud of the team for getting Pantheon and The Singularity is Near in the same Super Bowl ad”” https://x.com/sama/status/2020677993673433330
Our Super Bowl Ad, The Age of Orchestration, Walmart hits $1T, SaaSpocalpyse, Ken is Sick of Griftin – YouTube https://www.youtube.com/live/KVOBSXqhTco?t=5839s
we’re about to find out what happens when humans have to compete with optimized AI influencers that don’t sleep, always look perfect, and output 100x the content”” https://x.com/0xgaut/status/2013684399796023760?s=46
@MiniMax_AI M2.5 is now in Cline. + 80.2% SWE-Bench Verified. + 100 tps. $0.06/M blended cost. + 10B activated parameters. And it’s free in Cine for a limited time!”” https://x.com/cline/status/2022034591075512636
🚨Busy week for new models in the Arena: MiniMax M2.5 by @MiniMax_AI is now available in the Text and Code Arena. Bring your toughest prompts and see how it stacks up against the latest models in real-world use. In Battle mode, your votes power the leaderboards. Learn more”” https://x.com/arena/status/2021987555655422257
Honestly I wanna release this beast ASAP — I’m dying to go back to my hometown for Spring Festival 😂 But the more training compute we put in, the more it keeps rising. Painfully happy problem. We hear you guys. M2.5 soon.”” https://x.com/SkylerMiao7/status/2021587213230715306
Instant access to M2.5 on MiniMax Agent web/desktop! @MiniMax_AI”” https://x.com/MiniMaxAgent/status/2021595954143515106
MiniMax M2.5 is now live on BLACKBOX AI. A frontier model designed for real world execution with strong reasoning, reliable tool use, and complex multi step workflows. Engineered for demanding workloads. Ready for production scale orchestration. Switch instantly in the”” https://x.com/blackboxai/status/2022140484601225420
A glance of MiniMax 2.5, are you ready?”” https://x.com/SkylerMiao7/status/2021578926884053084
Congrats @MiniMax_AI! 🎉 Free for 3 days on Qoder, it’s time to put M2.5 through some serious coding sessions!”” https://x.com/qoder_ai_ide/status/2021983111161213365
MiniMax just dropped M2.5 and it’s on par with Opus 4.6 while being 20x cheaper and 3x faster???”” https://x.com/shydev69/status/2021989925143597123
Meet Audiobooks in ElevenCreative. The complete toolkit to create, refine, and publish audiobooks using lifelike AI voices – from first draft to published audio.”” https://x.com/elevenlabsio/status/2020906310837870873?s=20
Can just a 4B model solve IMO-level proof problems at the level of much stronger LLMs like Gemini 3 Pro? Yes, if you can train the LLM to scale test-time compute well! We’re very excited to release our 4B model “”QED-Nano””, built via an awesome open collab! Details below🧵⬇️”” https://x.com/aviral_kumar2/status/2022057927368995097
Early testers of Gemini 3 Deep Think are already seeing results. We partnered with researchers to explore how this model could tackle rigorous, real-world applications — from spotting hidden flaws in research papers to optimizing semiconductor growth. Here’s how early testers”” https://x.com/Google/status/2022007977419415958
If you’re an Ultra subscriber, you can try the latest in the Gemini App, but we’re also making Deep Think available for the first time in the Gemini API! Request early access here:”” https://x.com/tulseedoshi/status/2021997870858350640
@GeminiApp Do people realize how crazy that thing is??”” https://x.com/LexnLin/status/2021986194780041394
Codeforces results is “”no tools””? So Gemini 3.0 Deep Think cannot write test cases to test its solution before submission? I guess even the top1 human can’t get 3455 under this condition.”” https://x.com/YouJiacheng/status/2021985843074994534
Gemini 3 Deep Think benchmarks look amazing! On Codeforces, it scored 3,455 Elo. Apparently, only 7 humans in the world have a higher coding Elo score! A friend just sent me an output about a cancer mechanism that was so great that I am now resubscribing to Ultra for DT access!”” https://x.com/DeryaTR_/status/2022030594037989493
Gemini 3 Deep Think can help make things. 🧠 Here’s our side project: We sketched a laptop stand and Deep Think coded that into an interactive prototyping tool. We used that tool to generate a STL file, which we sent to @fleet_ai. And now I have a new laptop stand! What will”” https://x.com/joshwoodward/status/2022001967795777996
Gemini 3 Deep Think is available now in the @GeminiApp for Google AI Ultra subscribers and via the Gemini API to select researchers, engineers and enterprises through our early access program. Learn more ↓”” https://x.com/Google/status/2021982018679312829
Gemini 3 Deep Think is getting a significant upgrade. We’ve refined Deep Think in close partnership with scientists and researchers to tackle tough, real-world challenges. And it’s pushing the frontier across the most challenging benchmarks, achieving an unprecedented 84.6% on”” https://x.com/sundarpichai/status/2022002445027873257
Gemini 3 Deep Think now excels across scientific domains like chemistry and physics — achieving gold medal-level results on the written sections of the 2025 International Physics and Chemistry Olympiads.”” https://x.com/Google/status/2021982010739503138
Parsing PDFs at scale with LLMs is cost prohibitive. Newer models (e.g. gemini 3) are good at reading pdfs, but you burn unnecessary vision tokens even when the page is text heavy. We’ve built in a “cost-optimizer” within LlamaParse that will dynamically route pages to”” https://x.com/jerryjliu0/status/2021267495123140760
The upgraded Deep Think mode is rolling out now in the @GeminiApp for Google AI Ultra subscribers. For scientific researchers and developers, we’re opening a Vertex AI Early Access Program for the API. Start discovering → https://x.com/GoogleDeepMind/status/2021981517791342807
There are only 7 people on the planet who can beat Gemini 3 Deep Think in coding competitions. It has an Elo of 3455. A bit over a year ago the best systems were at 2727 (o3-preview).”” https://x.com/scaling01/status/2021983388442509478
Today, we’re releasing a significant upgrade to our specialized reasoning mode, Gemini 3 Deep Think. Deep Think is built to drive practical applications, enabling researchers to interpret complex data and engineers to model physical systems through code. With the updated Deep”” https://x.com/GeminiApp/status/2021985731577852282
We’re starting to roll out a test for ads in ChatGPT today to a subset of free and Go users in the U.S. Ads do not influence ChatGPT’s answers. Ads are labeled as sponsored and visually separate from the response. Our goal is to give everyone access to ChatGPT for free with”” https://x.com/OpenAI/status/2020936703763153010
Dead Internet Theory – Dmitry Kudryavtsev https://kudmitry.com/articles/dead-internet-theory/





Leave a Reply