RT @yawnxyz: Kimi K2 is **INCREDIBLE** at using tools. I built a chrome extension to chat with Google Maps, but I never posted it. All th…”” / X https://x.com/bigeagle_xd/status/1945087963408351728

New from our security teams: Our AI agent Big Sleep helped us detect and foil an imminent exploit. We believe this is a first for an AI agent – definitely not the last – giving cybersecurity defenders new tools to stop threats before they’re widespread.https://x.com/tulseedoshi/status/1945113799297536313

OpenAI’s Windsurf deal is off — and Windsurf’s CEO is going to Google | The Verge https://www.theverge.com/openai/705999/google-windsurf-ceo-openai

RT @jordihays: Here is most of what I’ve gathered on the Windsurf / Google Deal The founders and dozens of engineers are going to Google.…”” / X https://x.com/_arohan_/status/1944203727059226784

The Next Stage of Windsurf https://windsurf.com/blog/windsurfs-next-stage

The Windsurf Dynamics: On the need for a social contract, an analysis of the potential payouts / cap table math, what a better outcome might have looked like instead, and why –– maybe? –– the Windsurf founders and board might have actually done the right thing, leaving a graceful https://x.com/haridigresses/status/1944406541064433848

Dia Browser has a built in AI Chat tab. You can reference any tab you have open and even make comparisons between them. Dia is able to understand the page you’re on and give answers. It’s pretty cool! https://x.com/jerrod_lew/status/1933132174921961807

Dia Skills are one of the things that makes @diabrowser so powerful. Brave doesn’t have this in Leo, and no, you can’t just “”get a Chrome extension to do this for you”” 🫠 Here’s how @joshm and team started with Skills and some of the rad things you can do with them today, a Dia https://x.com/morganlinton/status/1942589297200390165

My top 4 features from @browsercompany’s new Dia Browser so far: ⚡ CMD+T → chat with AI instantly 🧠 CMD+E → ask Dia about the current page (no more copy-paste into GPT) ✍️ Select text → CMD+E → ask to revise my writing → replace 🔗 Type @ → pull context from other tabs https://x.com/zineanteoh/status/1909618736199598276

New Dia browser came out to be a great tool to keep stay updated with the latest dev drama without watching the whole 40 minute video 😅 Simple prompt for video summary and boom, you saved yourself 40 minutes. https://x.com/vasilije_luka/status/1942900540397998574

Quickly chat with any pdf with Dia browser open any pdf in with Dia ask anything like you’d do it with any llm + you can use any custom skill you have from Dia to speed up more https://x.com/pugni_vito/status/1942964581825200293

Talk to your youtube videos with AI, straight from your browser! Love this new AI Dia Browser @diabrowser https://x.com/diegocabezas01/status/1934066414257860610

This AI browser just made watching YouTube videos obsolete. It literally reads your screen and does everything for you ✨ How Dia AI Browser is changing everything: ✅ Summarizes entire YouTube videos in seconds ✅ Creates custom automation skills with one command ✅ Manages https://x.com/JulianGoldieSEO/status/1942795852474360068

Trying out pair browsing with the DIA browser for the first time—writing this post as part of the experiment! https://x.com/cleeeeeeeeement/status/1932861729664377103

Using Dia Browser is a super power. Just used it to quickly summarize spending just by having account info open in a tab. Mind blown. 🤯 @browsercompany”” / X https://x.com/talkaboutdesign/status/1933120237282472337

The @browsercompany team had it all: millions of users, Chrome’s former lead, Silicon Valley darling status. They threw it away to build Dia—a browser that learns from every tab you open. They shared the story with @danshipper on AI &I. https://x.com/every/status/1940427109467570430

Google to invest $25 billion in data centers, AI infrastructure in PJM https://www.cnbc.com/2025/07/15/google-to-invest-25-billion-in-data-centers-ai-infrastructure-in-pjm.html

CDAO Announces Partnerships with Frontier AI Companies to Address National Security Mission Areas > Chief Digital and Artificial Intelligence Office > PR-View https://www.ai.mil/Latest/News-Press/PR-View/Article/4242822/cdao-announces-partnerships-with-frontier-ai-companies-to-address-national-secu/

A new agentic browser just shipped from Perplexity and it’s pretty wild. Watch this video of @PerplexityComet taking over my LinkedIn tab and taking actions on my part. Interesting UX where the tab glows blue as it’s taking actions. I like the integration of agentic actions https://x.com/ryancarson/status/1942962447369036201

AI-powered browsers like Perplexity’s Comet promise to do your web surfing for you. But do they really save time, or just add more noise? 🌐 https://x.com/fdaudens/status/1945121374063698080

Ask Comet to book a meeting or send an email. Comet transforms entire sessions into single, seamless interactions. https://x.com/PerplexityComet/status/1943026179960873207

asked @PerplexityComet to load up our brand colors in @MeetGamma then shifted my focus to building the actual content of the deck https://x.com/jennysvng/status/1943074383091671529

Been using @PerplexityComet, and there are soo many new use cases for it, but this has got to be one of my favs: I received a verification link sent to my Gmail, and I asked Comet Assistant to click it and verify me on my behalf. And it did it! Simple yet useful ^_^ https://x.com/_Matskuu/status/1942977239974400170

BREAKING 🚨: Comet Browser can now control an open web page from a sidecar! Now it can simply take it over and click around. Making Comet to publish a blog post for me 👀 https://x.com/testingcatalog/status/1928546603448562087

Browse at the speed of thought. https://x.com/PerplexityComet/status/1942968195419361290

Comet browser applying for a job for me 👀 Soon, you will be able to execute such things on a schedule. https://x.com/testingcatalog/status/1926043202684854674

Comet has become a natural extension of all my workflows, ideas, and content since I started using it. I can easily recall any saved information and connect to all of my personal knowledge management tools. Effortless networked intelligence. Proud of this team! https://x.com/camerontstow/status/1943047355944833153

Comet… is nuts. I asked it to go find the subreddits that people would ask cooking questions on. Then, find common questions and come up with ad angles for those questions for Hexclad. For kicks, I asked it to make a static ad for me with my fav angle Results. Are. Insane. https://x.com/NathanSnell/status/1943095214932943291

cool query on my comet browser for handling my X addiction. https://x.com/AravSrinivas/status/1912592179291385896

First test of Perplexity’s new agentic browser, Comet 👇 Comet authenticates into your accounts (e.g. email, calendar) to take actions on your behalf. It pulled a list of all my email newsletters, and unsubscribed from the specific ones I asked it to 🤯 https://x.com/omooretweets/status/1943078090718220653

Hooolllyyy crap. Perplexity’s comet browser is insane. Operator was a total dud. Manus is better but meh. Videos coming. I asked it to duplicate a meta campaign for me. No problem. All automated. Anyone want me to try anything specific? https://x.com/NathanSnell/status/1943062637656338805

How to watch YouTube on Comet https://x.com/AravSrinivas/status/1946240617031606672

I feel like I’m living in the future right now. Been using the new browser called Comet from @perplexity_ai (thanks @AravSrinivas for getting me access!) Like millions of others, I spend hours and hours a day in a browser. Specifically, Chrome. And, Chrome hasn’t”” / X https://x.com/dharmesh/status/1943084541733933189

Let Comet handle the customer support reps for you. Customer support is already a lot of AI anyway. So let your AI talk to the other AIs while you watch YouTube or do some work :-)”” / X https://x.com/AravSrinivas/status/1944778316323717437

Memory is magic when it works. Comet is “memory-native” – the closest approximation of truly understanding the user there is. https://x.com/AravSrinivas/status/1944078543324844077

Perplexity Comet https://comet.perplexity.ai/

Perplexity Comet vs ChatGPT Agent”” / X https://x.com/AravSrinivas/status/1946076236683624616

PERPLEXITY COMET WORKS ON DUNE FOR CONTENT IDEATION!!!! SO COOL! https://x.com/0xDataWolf/status/1943265415322595630

Perplexity is testing new feature with Comet browser which will be able to just go out there and do things for you via prompts. Exciting times ahead https://x.com/AIProductPM/status/1940108252559081764

Prime Day Shopping with Comet. User saves $280 in less than 5 minutes by asking Comet to compare prices.”” / X https://x.com/AravSrinivas/status/1944183680915714548

RT @itsPaulAi: Perplexity Comet can automate any task in your browser This is the first time you REALLY have an AI agent working autonomou…”” / X https://x.com/denisyarats/status/1945321982725382170

RT @PerplexityComet: Clean up your inbox. Ask Comet to unsubscribe you from spam and unwanted emails. https://x.com/AravSrinivas/status/1945232153609978273

RT @rowancheung: Perplexity Comet is not like other agents I’ve been testing it all week, and it’s starting to actually *stick* Having in…”” / X https://x.com/AravSrinivas/status/1945620938068037633

The Cursor for Web Browsing, is here. And it’s better than Comet at turning your open tabs and bookmarks into a codebase. Here is a full breakdown of how i’m using @diabrowser Exploring the Future of Browsing with DIA Browser: Essential Features for Content Creators & https://x.com/rileybrown_ai/status/1943041778304847889

The most interesting thing about Perplexity Comet is that it can actually do things in Cal / Gmail Ex. I asked it to reschedule a 1:1 – it moved the invite and sent an email Neither Google nor OpenAI have done this in their agents…maybe for safety reasons, but it’s limiting 🤔 https://x.com/omooretweets/status/1943116119243416009

The TAM for Comet is bigger than Perplexity because it appeals to people who don’t even want AI. Just the best core browser in the market at the end of the day.”” / X https://x.com/AravSrinivas/status/1946035102150238475

USE CASE 2: Cross-tab product comparison If you’re looking for a new product or looking for flights, Comet can compare tabs in real time It’s surprisingly fast and analyzes the reviews of the tabs too https://x.com/rowancheung/status/1945524017915674879

USE CASE 3: Summarize any YT video with a click You can summarize + chat with any long YT video and get key moments This is also possible in Gemini, but having it in the browser means you can watch the video AND chat/learn with Comet in the side tab at the same time https://x.com/rowancheung/status/1945524019681480992

Vibe coding with @PerplexityComet – asked the browser agent to build me a simple (locally run) yt-dlp wrapper. It navigated to github,created the repo, wrote/committed/pushed the code. You can even make changes to your code from the sidecar, feels like an AI IDE lmao 😂 https://x.com/killuaz0ldyck07/status/1942976067075281248

When you’re on Comet, you’re operating at an abstraction above which AI to use and how to pull in relevant context. Agents are powerful and operate like a human would to complete the task. You go from chat turns to end-to-end workflows. https://x.com/AravSrinivas/status/1944024356138758367

New AI features in Google Search: Call a business or do research
https://blog.google/products/search/deep-search-business-calling-google-search/

We’re bringing Gemini 2.5 Pro to AI Mode: giving you access to our most intelligent AI model, right in @Google Search. With its advanced reasoning capabilities, watch how it can tackle incredibly difficult math problems, with links to learn more ↓ https://x.com/GoogleDeepMind/status/1945515683451736246

Google and Brookfield strike $3bn hydro power deal https://www.ft.com/content/d8bef8a3-5988-4080-ad7d-61bc9885e6ba

Google just inked a $3B deal for hydro power to run its AI data centers. Big Tech is scrambling for clean, reliable energy as AI’s appetite explodes. ⚡ https://x.com/fdaudens/status/1945121372465754471

Google’s latest AI security announcements https://blog.google/technology/safety-security/cybersecurity-updates-summer-2025/

Walmart revealed details of Element, an internal platform that lets its engineers build AI apps for internal use based on shared resources without spending time evaluating tools or risking vendor lock-in. Element runs on Google Cloud, Microsoft Azure, or Walmart data centers https://x.com/DeepLearningAI/status/1945257067389821399

Diffusion video models but now – **realtime**! Simple video filters are real-time but can only do basic re-coloring and styles. Video diffusion models (Veo and friends) are magic, but they take many seconds/minutes to generate. MirageLSD is real-time magic. Unlike simple video”” / X https://x.com/karpathy/status/1945979830740435186

🎥 Want the text from any YouTube video? Now you can — no plugins, no installs. Just drop the link, and our YouTube MCP turns it into text instantly. Try it now with this Agent: https://x.com/OmniMCP/status/1942855673324397021

I built an MCP server for editing videos It takes Google Drive video links and editing instructions You can export the output to any video editor https://x.com/itstundealao/status/1939675731077796099

NotebookLM introduces curated featured notebooks with partners https://blog.google/technology/google-labs/notebooklm-featured-notebooks/

Coming off @Google IO, we’ve made it possible to build AI Agents with real-time data from verified sources via Google ADK + Dappier 🧠⚡ – Define agents and tools using Google ADK – Plug into Dappier for web search + latest data for stocks, sports, news, and more https://x.com/DappierAI/status/1928430036257759269

MedGemma is a really interesting model – very small, multimodal, open, and does quite well in out-of-distribution medical tasks compared to much larger models. Would love to see more work thinking about how to improve & deploy this sort of LLM to support medical professionals https://x.com/emollick/status/1943142004537393456

I made my first internet dollar(s) this weekend🤯 Been releasing free resources (tutorials, source code) for a while now, esp. on YouTube. Decided to release a Builder Pack that helps accelerate agent development with Google’s ADK/A2A So grateful & humbled for the support ❤ https://x.com/chongdashu/status/1934599562959720607

Just dropped: ADK-Agent-Examples! Build AI agents with @Google’s Agent Development Kit + Nebius AI Studio. From multi-agent pipelines to tool integrations, it’s a playground for LLM-powered apps. 🧠 Powered by @AIatMeta Llama 3, @Alibaba_Qwen 3, @deepseek_ai R1, via Nebius https://x.com/nebiusaistudio/status/1927759640961466404

Host MCP servers on Cloud Run! Need a hosting platform to support the tools and resources your AI agents interact with? Deploy MCP servers to Cloud Run to take advantage of Cloud Run’s pay-per-use, automatic scaling infrastructure with GPU instances → https://x.com/GoogleCloudTech/status/1940825366936813846

Built something fun: Dia Browser insertion cursor but works everywhere https://x.com/naveennaidu_m/status/1889554727362593215

so I integrated the google_search tool for my agent to be able to search the internet. Here, I asked ‘tory’ my agent, what infoFi was, and it told me! if this isn’t cool, I don’t know what is. https://x.com/islathebuilder/status/1930913771284738112

Holy Shit! Dia Browser (@diabrowser) killed 𝕏’s X Pro feature with their “”Split View Pane”” feature. 🤯 https://x.com/MehulFanawala/status/1940640193008288021

RT @pfau: Just saw the phrase “”Big Token”” to describe OAI/Anthropic/GDM/xAI/Meta and now I can’t stop thinking about it.”” / X https://x.com/zacharynado/status/1945585062109417899

Fine-tune Gemma3n on videos with audios inside with Colab A100 🔥 Just dropped the notebook where you can learn how to fine-tune Gemma3n on images+audio+text at the same time! https://x.com/mervenoyann/status/1945481841298813403

Google to pay $2.4 billion in deal to license tech of Windsurf, WSJ reports | Reuters https://www.reuters.com/business/google-pay-24-billion-deal-license-tech-windsurf-wsj-reports-2025-07-12/

RT @deedydas: Google DeepMind just dropped this new LLM model architecture called Mixture-of-Recursions. It gets 2x inference speed, reduc…”” / X https://x.com/algo_diver/status/1945397388946104742

🚑 Built a health chatbot that suggests the right drugs based on your symptoms like a mini-doctor in your pocket. Powered by Google ADK. Just smart automation. 📷Demo below 👇 #TechAndVibes #NaijaToTheWorld #CodeAndCulture #DeveloperLifestyle https://x.com/ali_ogochu2581/status/1934663525022007341

I’m building some Java and Python agents with ADK, and bumping into issues that I can’t (yet) find Google results for. Thankfully, we shipped llms.txt files for the ADK ( https://x.com/rseroter/status/1929292162593706321

USE CASE 1: Email and Calendar management With the Google Cal and Gmail integrations, it can summarize emails, schedule meetings, and manager calendar events directly from the browser It’s weird that Google hasn’t done this yet https://x.com/rowancheung/status/1945524016103796993

I built a scraper with @n8n_io to scrape tiktok videos and breaks down best performing viral hooks on tiktok account All you have to do is: 1. simply give it a profile link 2. Save hooks data in your google sheet 3. it will break down their top performing hooks for you Amazing https://x.com/heart_yi/status/1940054596694761623

[2507.06261] Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities https://arxiv.org/abs/2507.06261

Building a Multi-Agent Deep Researcher with Gemini 2.5 Pro 🧑‍🔬📑 We’re excited to collaborate with @_philschmid and the @googleaidevs team on a brand-new tutorial 🧑‍🏫 : Build a multi-agent system with a researcher, writer, review agent that can search the web, record input, https://x.com/jerryjliu0/status/1944882346731430127

Google engineers shifted to a sparse mixture‑of‑experts transformer that picks only the needed mini‑networks per token, so compute stays low while total capacity rises. —- Paper – arxiv. org/abs/2507.06261 Paper “”Gemini 2.5: Pushing the Frontier with Advanced https://x.com/rohanpaul_ai/status/1944022179869241354

Google’s Gemini 2.5 paper has 3295 authors https://x.com/hardmaru/status/1944385851435205035

New Guide! Learn how to build a multi-agent “Deep Research” system with Gemini 2.5 and @llama_index. It dynamically searches the web, takes notes, and writes a comprehend research report with a feedback loop 🚀 🔍 Search the web with google 📝 Take notes with a dedicated https://x.com/_philschmid/status/1944835088039977124

Today we are rolling out our first Gemini Embedding model, which ranks #1 on the MTEB leaderboard, as a generally available stable model. It is priced at $0.15 per million tokens and ready for at scale production use! https://x.com/OfficialLoganK/status/1944806630979461445

Tried out Google’s ADK(agent development kit) and legit, the inbuilt UI with Gemini Free API is wild 🤯 So easy to use and looks sick! #ADK #GoogleAI #GeminiAPI https://x.com/027_Priyanshu/status/1934106038632153243

What if you had a smart personal assistant living in your watch that could share info and manage tasks for you when your hands are full? 🧠 You’re about to find out. Meet Gemini, rolling out now on Wear OS 4+ watches: https://x.com/WearOSbyGoogle/status/1942961942693359894

Build your first AI agent + MCP Server in Python. Here is everything you need to build your first AI agent in less than 20 minutes. About the code you’ll see here: 1. I used Google ADK with Gemini Flash to power the agent 2. The agent connects to an MCP server 3. It also https://x.com/svpino/status/1929881755915366772

Gemini CLI can automate your computer using MCP 🔥 Add Windows MCP (or macOS MCP) to Gemini CLI and you can tell it what to do autonomously. Gemini then takes control of your entire system to achieve the goal you’ve set. Links below https://x.com/itsPaulAi/status/1940903613888696776

Someone vibe coded an Al Agent that can use your phone on its own. He outlines using this as a ChatGPT-like interface, except things actually get done automatically. The person built this using Google ADK and the Gemini API 💀 Credits: Tyrange-D via r/singularity https://x.com/DigestibleAICo/status/1924218874678960504

RT @OfficialLoganK: Today we are rolling out our first Gemini Embedding model, which ranks #1 on the MTEB leaderboard, as a generally avail…”” / X https://x.com/demishassabis/status/1944870402251219338

Gemini-CLI is bad compared to Claude code in very fixable ways codex-cli is bad in odd ways. Feels unfriendly, unlike the GUI version of Codex and unusual for product-strong OpenAI”” / X https://x.com/kylebrussell/status/1945242558487044118

18/ How are you using comet? Any use cases that I missed? Follow @AtomSilverman and @AgentOpsAI for everything AI agent-related Have you tried the @AgentOpsAI MCP server? Link in bio. Last week’s thread: https://x.com/AtomSilverman/status/1944456541169762363

a fresh batch of comet invites just went out”” / X https://x.com/AravSrinivas/status/1945669970618421699

Looks like Grok 4 is 10^27 FLOPs given their graphs? HLE score is 26% without tools, Gemini 2.5 is 21.6% without tools. Curious what the tool piece is.”” / X https://x.com/emollick/status/1943162710725657055

LLMs for IMO 2025: gemini-2.5-pro (31.55%), o3 high (16.67%), Grok 4 (11.90%). https://x.com/denny_zhou/status/1945887753864114438

Gemini generates the best prompts for Veo 3. Full code below. ““python import time from google import genai from google.genai import types client = genai.Client() operation = client.models.generate_videos( model=””veo-3.0-generate-preview””, prompt=””””””{ “”character_name””: https://x.com/_philschmid/status/1945898590821584989

Generate videos with Veo 3  |  Gemini API  |  Google AI for Developers https://ai.google.dev/gemini-api/docs/video

Start building with Veo 3: our state-of-the-art video generation model now available in paid public preview via the Gemini API and @Google AI Studio. 🎨 Here’s how to try it → https://x.com/GoogleDeepMind/status/1945886603328778556

RT @GeminiApp: A new Gemini feature just dropped and everything is alive?! Now you can turn photos into videos with sound in Gemini.”” / X https://x.com/demishassabis/status/1944939563170062804

Just shipped our first fully-automated newsletter! An experiment in merging editorial quality with AI workflows. Here’s how we built it: – The Funnel filters signal from noise → RSS feeds, Google Scripts, OpenAI – The Editor curates and cleans → @gumloop_ai – The Delivery https://x.com/kazsatamai/status/1933196696781214064

It’s going to be a decade long grind. But, success is not guaranteed for anyone yet. Not even Google. Fun times ahead.”” / X https://x.com/AravSrinivas/status/1944895074774737130

Google’s Gemini 2.5 paper has 3295 authors
https://x.com/hardmaru/status/1944385851435205035

Hello from #ICML2025! 👋 Together with @GoogleResearch, we’re presenting over 140 papers, as well as hosting workshops, talks and demo sessions. Check out our schedule. → https://x.com/GoogleDeepMind/status/1945126704785011188

Veo 3 filling the hold time. https://x.com/emollick/status/1943159044052603100

Trending

Discover more from Ethan B. Holland

Subscribe now to keep reading and get access to the full archive.

Continue reading