Image created with gemini-3.1-flash-image-preview with claude-sonnet-4-5. Image prompt: Wide-angle interior of a decaying Chinese industrial warehouse with concrete pillars and fluorescent lighting, workers in plain clothes sorting paper documents on metal tables, a red-brown horse standing calmly at the far end of the room, muted desaturated colors, overcast natural light through dirty windows, observational realism, large white text overlay reading GOOGLE in poster style, documentary film still aesthetic

AI Makes Degrees Obsolete Former Google AI leader Jad Tarifi warns that long degrees like law, medicine, and even PhDs may become outdated before students graduate, as AI rapidly reaches PhD-level performance. With 70% (!) of AI PhDs now heading into private sector jobs (up”” https://x.com/kimmonismus/status/2023446044873560178

All tracks generated in Gemini are embedded with SynthID, our imperceptible watermark for identifying Google AI-generated content. We are also giving you more tools to help identify AI content, broadening our verification capabilities to include audio. Simply upload a file and”” https://x.com/GeminiApp/status/2024153548641177781

Is that track AI-generated? Now you can just ask @GeminiApp. We’ve broadened our verification tools so you can now upload audio files to Gemini to check for SynthID — our imperceptible watermark on AI-generated content. Just upload a file and ask: “”Was this created using Google”” https://x.com/Google/status/2024172104711823678

We just dropped Lyria 3: our latest generative music model. 🔊 It can turn photos and text into dynamic tracks – complete with vocals and lyrics. 🧵”” https://x.com/GoogleDeepMind/status/2024153067654902014

Excited to launch Gemini 3.1 Pro! Major improvements across the board including in core reasoning and problem solving. For example scoring 77.1% on the ARC-AGI-2 benchmark – more than 2x the performance of 3 Pro. Rolling out today in @GeminiApp, @antigravity and more – enjoy!”” https://x.com/demishassabis/status/2024519780976177645

Gemini 3.1 Pro Benchmarks 77.1% ARC-AGI-2 80.6% SWE-Bench Verified”” https://x.com/scaling01/status/2024514798470181370

Gemini 3.1 Pro is here. Hitting 77.1% on ARC-AGI-2, it’s a step forward in core reasoning (more than 2x 3 Pro). With a more capable baseline, it’s great for super complex tasks like visualizing difficult concepts, synthesizing data into a single view, or bringing creative”” https://x.com/sundarpichai/status/2024516418855981298

Gemini 3.1 Pro landed today. This is based on the same model behind the agentic DeepThink released last week; it is now available to all Gemini users on many apps. This is a really good model especially in reasoning and multimodal understanding/generation. Try it out.”” https://x.com/mirrokni/status/2024525808501477568

Gemini 3.1 Pro: Announcing our latest Gemini AI model https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-1-pro/

Holy sh*t, thats what I call an improvement! Gemini 3.1 pro is insane: – Arc agi 2 77% – SWE verified 80% – HLE 44%/51%”” https://x.com/kimmonismus/status/2024521970184868000

To the Scientist, the Engineer, and the Developer: Gemini 3.1 Pro has arrived in @GeminiApp It’s a significant leap in complex reasoning (77.1% on ARC-AGI-2) so it’s great at agentic tasks, intricate coding, and data synthesis projects. You should see fewer errors, better”” https://x.com/joshwoodward/status/2024515741819842623

Today, we’re continuing to push the boundaries of AI with our release of Gemini 3.1 Pro. This updated model scores 77.1% on ARC-AGI-2, more than double the reasoning performance of its predecessor, Gemini 3 Pro. Check out the visible improvement in this side-by-side comparison,”” https://x.com/JeffDean/status/2024525132266688757

Gemini 3.1 Pro is here! It’s top 3 across Text and Vision Arena, and #6 in Code Arena, tied closely with Claude Opus 4.5. Highlights: ▪️Tied #1 in Text (scoring 1500), 4 pts from Opus 4.6 ▪️Top 3 in Arena Expert Leaderboard (scoring 1538), just behind Opus 4.6 ▪️#6 in Code”” https://x.com/arena/status/2024519891295089063

Gemini 3.1 Pro WebDev Arena results: – 6th place behind Opus 4.5/4.6 and GPT-5.2-high”” https://x.com/scaling01/status/2024522048312054142

Multimodal function calling is now available in the Gemini Interactions API, build agents that can see and process images natively. 🖼️ Tools return actual images, not text descriptions 👁️ Gemini 3 natively processes returned images 🛠️ Function results support mixed text and”” https://x.com/_philschmid/status/2022349886318928158

Update regarding Gemini 3.1 Pro: -Ranked #1 among all Gemini models released to date. -Ranked #1 among all models I have tested so far. (GPT-5.2 high 165.9 vs Gemini 3.1 Pro 166.6) However, please note that my testing has limitations due to budget constraints: -I have not”” https://x.com/Hangsiin/status/2024605310913216614

Introducing Lyria 3, our latest and most advanced music model, available in the Gemini App starting today : ) Go from idea, image, or video to music in seconds!”” https://x.com/OfficialLoganK/status/2024153948488118513

Meet Lyria 3, our latest music generation model from @GoogleDeepMind. 🎶 Now, you can create custom music tracks in the @GeminiApp — just by describing an idea or uploading an image or video.”” https://x.com/Google/status/2024154379838705920

We just launched Lyria 3! Our most advanced AI music model in the @GeminiApp 🎵 – Generates 30-second tracks from text or image prompts. – Support custom lyrics, vocals, and cover art. – Supports 8 languages including English, Japanese, and Korean. – All outputs watermarked with”” https://x.com/_philschmid/status/2024154542061805988

Use Lyria 3 to create music tracks in the Gemini app https://blog.google/innovation-and-ai/products/gemini-app/lyria-3/

News Alert: Today, the #FBI arrested three Silicon Valley engineers who are facing charges of conspiring to commit trade secret theft from Google and other leading technology companies, theft and attempted theft of trade secrets, and obstruction of justice. Samaneh Ghandali, 41,”” https://x.com/FBISanFrancisco/status/2024670479974363376

OpenAI and Anthropic are much further ahead than what benchmarks show. While you are token constrained they are blasting millions of tokens at 4x the API speed without batting an eye and they scaffold like they are trying to build a skyscraper.”” https://x.com/scaling01/status/2023837889478758495

Dario acknowledges the multi-trillion dollar robotics opportunity, yet Anthropic is not hiring robotics talent; even as OpenAI and Google DeepMind aggressively build out their own robotics teams.”” https://x.com/TheHumanoidHub/status/2022416551270662427

TLDR: Opus 4.6 demonstrates better reasoning and use of memory than Gemini 3.1 Pro and solves more levels. I’m now much more confident that current and future models will be able to solve ARC-AGI-3, given that they have access to harness with simple memory. My speculative take”” https://x.com/scaling01/status/2024642420177096769

> installed Antigravity > chose Gemini 3.1 Pro (High) > ask which model it is > telling me it’s powered by Claude 3.7 Sonnet Is the UI lying, or is the agent/model lying/hallucinating?”” https://x.com/Yuchenj_UW/status/2024721228842565851

Experimenting with new AI Studio vibe coding start screens today, New vs Old. What do you think?”” https://x.com/OfficialLoganK/status/2023101878087926103

Gemini 3.1 Pro is rolling out starting today. Here’s where you can find it: Consumers: @GeminiApp and @NotebookLM Developers: Start building with it in preview via the Gemini API in @GoogleAIStudio, @antigravity, @geminicli, and @AndroidStudio. Enterprise: Vertex AI and Gemini”” https://x.com/Google/status/2024519482383736841

Gemini 3.1 Pro Update! A upgrade to our best coding and agentic gemini model! 🚀 Here is all you need to know: – Same 1M context with 64k output, knowledge cut of Jan 2025. – Same $2 / $12 (<200k tokens); $4 / $18 (>200k tokens). – 2.5x better abstract reasoning (77.1% on”” https://x.com/_philschmid/status/2024516444847776209

Generate an SVG of a rollercoaster”””” https://x.com/OriolVinyalsML/status/2024519610683576422

Installed Gemini CLI for the first time today. Waited all day, still no Gemini 3.1 Pro in the model list. Installed Antigravity for the first time too, hit multiple bugs. Requests failing, agent acting weird. Google needs to polish its coding tools, not just ship stronger”” https://x.com/Yuchenj_UW/status/2024708583829753909

It’s a good model”” https://x.com/andrew_n_carr/status/2024523689040183355

make it first person view (i want to see the rollercoaster in front of me)”””” https://x.com/OriolVinyalsML/status/2024519612579422598

Starting today, Gemini 3.1 Pro is rolling out globally to the Gemini app, with higher limits for users with the Google AI Pro and Ultra plans. Learn more about these updates in our blog:”” https://x.com/GeminiApp/status/2024516782816710920

we’ve revamped our entire billing and dashboard experience to give you more control and insight when building and scaling your apps monitor your rate limits in real-time, filter costs by project, and diagnose traffic spikes with our new success and failure metrics plus, upgrade”” https://x.com/GoogleAIStudio/status/2022409735287537999

@OfficialLoganK Looks like Antigravity is working great now. Gemini CLI still doesn’t. Gemini Code Assist is still announcing it just got Flash 3 🤦‍♂️”” https://x.com/matvelloso/status/2024566224152383824

3.1 Pro can even generate website-ready, animated SVGs from a simple text prompt. Since these are built in pure code — not pixels — they stay crisp at any scale and keep file sizes tiny compared to traditional video. Go ahead, try generating an animated SVG of a pelican riding”” https://x.com/Google/status/2024519468395733477?s=20

Today, we’re releasing Gemini 3.1 Pro. It’s the same core intelligence that powers Gemini 3 Deep Think, now scaled for your practical applications. It’s a smarter model for your most complex tasks. See 3.1 Pro in action 🧵↓”” https://x.com/Google/status/2024519455389192204

Writing my latest guide on what AI to use made it really clear how confusing the Google AI situation is. Great models with radically different harnesses in different apps. Great AI products, mixed in with some bad ones. None of which seem to clearly connect or interact together.”” https://x.com/emollick/status/2023965642357907854

The model is a step forward in reasoning, designed for workflows where a simple answer isn’t enough. On ARC-AGI-2 – which tests for novel logic patterns – it more than doubles 3 Pro’s score. This means it can help you visualize complex topics, organize scattered data, and bring”” https://x.com/GoogleDeepMind/status/2024516467618656357

Earlier today I wanted to doom about Gemini 3.1 Pro completely failing ARC-AGI-3. Turns out this was due to a bug in the config introduced by GPT-5.3. It was still calling Gemini 3.0 Pro instead of 3.1. I fixed it, made the harness simpler and spend $120. Performance of Gemini”” https://x.com/scaling01/status/2024642220096442772

Gemini 3.1 is the faster horse. It’s like a horse with rocket fuel. Truly insane. Everyone else makes cars now.”” https://x.com/theo/status/2024808734053347608

Gemini 3.1 Pro on ARC-AGI Semi-Private Eval @GoogleDeepMind – ARC-AGI-1: 98%, $0.52/task – ARC-AGI-2: 77%, $0.96/task Gemini to push the Pareto Frontier of performance and efficiency”” https://x.com/arcprize/status/2024522812728496470

Gemini 3.1 Pro Preview scored highest in the Artificial Analysis Intelligence Index but its most significant advantage might be its price and token efficiency. Our evaluations cost <50% to run on Gemini 3.1 Pro Preview compared to Claude Opus 4.6 (max) and GPT-5.2 (xhigh) Gemini”” https://x.com/ArtificialAnlys/status/2024677979390169536

Gemini Pro 3.1 (& other frontier models) are still terrible at Connect 4. Yet smashing ARC-AGI-2 That is weird, right? ARC was built to be resistant to overfitting. I guess the fully generalised world of ARC AGI puzzles is still a very narrow slice of spatial reasoning”” https://x.com/paul_cal/status/2024748708223402120

I gave Gemini 3.1 Pro an ARC-AGI-2 challenge WITH solution and it bombed it … SVG’s might have been successfully sloptimized GPT-5.2 Thinking realizes it after 14s of Thinking that I gave it the solution in the input and just repeats it Gemini 3.1 Pro thought for 8 minutes”” https://x.com/scaling01/status/2024268831321993590

Loving Gemini 3.1 Pro! It made 3 huge improvements to my compiler and saw things that even ChatGPT 5.2 Pro Extended and Claude Opus 4.6 Extended couldn’t see.”” https://x.com/QuixiAI/status/2024545096532733967

oh and ARC-AGI-3 is crazy expensive to run”” https://x.com/scaling01/status/2024650634746610041

By the way, the recent Gemini 3.1 Pro is also a really good model for RLMs. Claude Opus 4.6 is the worst of the ones I tested. Probably not optimized for the type of decomposition that RLMs need. I am just impressed by GPT-5.2-Codex. The strategies it uses are brilliant.”” https://x.com/omarsar0/status/2024973182436831629

Claude Sonnet 5: The “Fennec” Leaks – Fennec Codename: Leaked internal codename for Claude Sonnet 5, reportedly one full generation ahead of Gemini’s “Snow Bunny.” – Imminent Release: A Vertex AI error log lists claude-sonnet-5@20260203, pointing to a February 3, 2026 release”” https://x.com/pankajkumar_dev/status/2018187650927349976?s=46

Gemini 3.1 Pro will be a massive step-up! There’s a decent chance it’s on par with Opus 4.6 and GPT-5.3. The main reason for that: similarly to Claude 4.6 and GPT-5.2/5.3 it thinks much longer than Gemini 3 Pro The same request on aistudio, tested multiple times, had 6″” https://x.com/scaling01/status/2024251668771066362

Google is once again the leader in AI: Gemini 3.1 Pro Preview leads the Artificial Analysis Intelligence Index, 4 points ahead of Claude Opus 4.6 while costing less than half as much to run @GoogleDeepMind gave us pre-release access to Gemini 3.1 Pro Preview. It leads 6 of the”” https://x.com/ArtificialAnlys/status/2024518545510662602

In Arena Expert, with expert level prompts, Gemini 3.1 Pro Preview lands in the top 3 (scoring 1538), just behind Claude Opus 4.6″” https://x.com/arena/status/2024519895623598423

Sonnet 4.6 crushes Gemini 3 and GPT-5.2 on Vending-Bench 2″” https://x.com/scaling01/status/2023833660546499053

Claude Sonnet 4.6 has landed #3 in Code and #13 in Text Arena! Highlights: ▪️+130 pts jump in Code Arena (#22 -> #3) compared to Sonnet 4.5, surpassing top-tier thinking models like Gemini-3.1 and GPT-5.2 ▪️Strong gains in Text categories: Math (#4) and Instruction Following”” https://x.com/arena/status/2024883614249615394

📊 Let’s dive deeper into Gemini 3.1 Pro gains. It ranks 13 points above Gemini 3 Pro overall. We see the largest rank gains for @GoogleDeepMind’s latest model in the following categories: Text: ▪️Coding (+5) ▪️Math (+4) ▪️Expert (+3) ▪️Instruction Following (+3) ▪️Multi-Turn”” https://x.com/arena/status/2024588456463389040

Check out the skills for the Gemini API! More soon!”” https://x.com/osanseviero/status/2022259577232785866

Context Arena Update: Added @Google’s Gemini 3.1 Pro Preview to the MRCR leaderboards (2-,4-,8-needle)! Meant to send this out earlier today. Thanks to @GoogleDeepMind and others over there for early access! Thinking budget barely matters on simpler retrieval – 2-needle AUC”” https://x.com/DillonUzar/status/2024655613293215855

Gemini 3.1 Pro has landed! Amazing performance / capabilities across the board. Beyond SOTA, the best are all the things that evals can’t measure. E.g. SVG has gotten so much better (see 🧵) https://x.com/OriolVinyalsML/status/2024519605570720185

Gemini 3.1 Pro in 1st place on the Artificial Analysis Leaderboard”” https://x.com/scaling01/status/2024517196727099847

Gemini 3.1 Pro is rolling out now in the @GeminiApp, and exclusively to Google AI Pro and Ultra users in @NotebookLM. Developers can access it in preview via the API in @GoogleAIStudio. Find out more → https://x.com/GoogleDeepMind/status/2024516471720743295

Gemini 3.1 Pro’s GDPval scores are concerning”” https://x.com/scaling01/status/2024515061163704336

Gemini Deep Think 3 is the world’s most capable model by many measures, huge amounts of progress on reasoning benchmarks and more. Available right now via the Gemini App for Ultra subscribers and in the API soon : )”” https://x.com/OfficialLoganK/status/2021996626144080015

Good news: Google AI Studio and the Gemini API are now live in Moldova, Andorra, San Marino, and Vatican City! 🌍”” https://x.com/OfficialLoganK/status/2022688445957820610

Google is back on the intelligence-cost frontier with Gemini 3.1 Pro”” https://x.com/scaling01/status/2024519007018373202

Google test NotebookLM integration for Opal workflows https://www.testingcatalog.com/google-test-notebooklm-integration-for-opal-workflows/

I would expect only a few models to make progress with this rather simple harness: GPT-5.2-xhigh, Opus 4.5 and Opus 4.6 and Gemini 3.1 Pro other models will have a very hard time”” https://x.com/scaling01/status/2024661145286557872

Last week we upgraded Gemini 3 Deep Think. Today, we’re shipping the core intelligence that makes those breakthroughs possible: Gemini 3.1 Pro. A noticeably smarter, more capable baseline for your hardest challenges. Available now: https://x.com/NoamShazeer/status/2024519946764734574

Multimodal Function Calling with Gemini 3 and Interactions API https://www.philschmid.de/interactions-multimodal-fc

My vibe is unchanged: Gemini 3.1 is a previous gen model. It naively lives in a context-universe engineered by the God-User. Opus is a friend-type AI. It sits with you in a KFC. 5.2 sees a vast expanse of thought. Below there’s a given context. A user makes some noise, perhaps.”” https://x.com/teortaxesTex/status/2024574416747671556

Saw Gemini 3.1 announcement, got super excited. Tried Google Antigravity… not available. Tried Gemini CLI… not available. Tried Gemini Code Assist… not available. @OfficialLoganK put AI Studio in an Electron Shell and just launch it. You will deliver these faster.”” https://x.com/matvelloso/status/2024548414198091922

Today we’re releasing a preview of Gemini 3.1 Pro and making it available to our users and developers. Very excited to bring the upgraded core we used in Deep Think to everyone. Learn more about Gemini 3.1 Pro: https://x.com/koraykv/status/2024517699595124902

We just made paying for the Gemini API 10x easier : ) You can now upgrade to a paid Gemini API account without leaving AI Studio, track your usage, filter spend by model, and much more to come!”” https://x.com/OfficialLoganK/status/2022409335465480346

We made a skill for the Gemini API!”” https://x.com/OfficialLoganK/status/2022123808296251451

Here are some useful prompting tips to get the most out of our new music generation model in Gemini, Lyria 3 ↓”” https://x.com/GeminiApp/status/2024167107538407783

Introducing Lyria 3, our new music generation model in Gemini that lets you turn any idea, photo, or video into a high-fidelity track with custom lyrics. From funny jingles to lo-fi beats, you can create custom 30-second soundtracks for any moment. See how it works. 🧵”” https://x.com/GeminiApp/status/2024152863967240529

Learn How To Build a Gemini-Powered Robotics Simulator in the Browser with MuJoCo WASM. MuJoCo (WebAssembly) + Three.js + Gemini ER Thanks for sharing, @osanseviero! 📍Tutorial: https://t.co/OpIb74K2Od Demo: https://t.co/hp4TJLED02 —- if it matters in AI or Robotics you’ll”” https://x.com/IlirAliu_/status/2023835606791356428

Generate an SVG of a pelican riding a car in France with a cat sitting beside it. Background has Eiffel tower.”””” https://x.com/OriolVinyalsML/status/2024519608833810496

Leave a Reply

Trending

Discover more from Ethan B. Holland

Subscribe now to keep reading and get access to the full archive.

Continue reading