Ethan B. Holland

Over 51,300 manually organized AI links and counting

Security: AI News Week Ending 10/31/2025

October 31, 2025

Image created with gemini-2.5-flash-image with claude-sonnet-4-5. Image prompt: Cinematic 80s film still of a neighborhood watch volunteer in reflective vest standing under warm streetlamp glow on autumn suburban street at dusk, holding clipboard and walkie-talkie, watching costumed children trick-or-treat among decorated houses with jack-o’-lanterns and orange lights, fallen leaves on sidewalk, protective vigilant mood, Spielberg-era cinematography

Devin now has full computer use capabilities and can share screen recordings. You can control desktop apps, build and QA mobile apps, and automate tedious work. Here are some examples that blew our team away: 1. Making a desktop game https://x.com/cognition/status/1983983151157563762

Taking Bold Steps to Keep Teen Users Safe on Character.AI https://blog.character.ai/u18-chat-announcement/

14/ @svpino shared 8 rules improving AI coding agents. It automates checks for security and quality using Codacy integration. https://x.com/AtomSilverman/status/1981855890358858159

Introducing Aardvark, our agentic security researcher:”” / X https://x.com/gdb/status/1983971650531160319

Proud to introduce Aardvark, our agentic security researcher powered by GPT-5. Aardvark hunts for vulnerabilities the way a security engineer would: by reading and analyzing code, writing and running tests, and proposing patches. Now in private beta. https://x.com/embeddedsec/status/1983956550239842474

We reviewed Anthropic’s unredacted report and agreed with its assessment of sabotage risks. We want to highlight the greater access & transparency into its redactions provided, which represent a major improvement in how developers engage with external reviewers. Reflections: 🧵”” / X https://x.com/METR_Evals/status/1983248509752213526

Exclusive first look at Shield AI’s X-Bat AI-piloted fighter drone https://www.cnbc.com/2025/10/21/exclusive-first-look-at-shield-ais-x-bat-ai-piloted-fighter-drone.html

The government does, in fact, use SQL https://x.com/stupidtechtakes/status/1984124850575962280

“OpenAI is now able to release open weight models that meet requisite capability criteria.” let’s gooooo”” / X https://x.com/reach_vb/status/1983167809975922845

GPT OSS 120B | Model library https://www.baseten.co/library/gpt-oss-120b/

gpt-oss-safeguard lets developers use their own custom policies to classify content. The model interprets those policies to classify messages, responses, and conversations. These models are fine-tuned versions of our gpt-oss open models, available under Apache 2.0 license. Now”” / X https://x.com/OpenAI/status/1983507394316710039

Now in research preview: gpt-oss-safeguard Two open-weight reasoning models built for safety classification. https://x.com/OpenAI/status/1983507392374641071

ollama run gpt-oss-safeguard Ollama is partnering with @OpenAI and robust open online safety tools (ROOST) to bring the latest gpt-oss-safeguard reasoning models to users for safety classification tasks. 20B: ollama run gpt-oss-safeguard:20b 120B: ollama run”” / X https://x.com/ollama/status/1983509776530039014

ROOST is also launching the ROOST Model Community, bringing together T&S practitioners and researchers to share best practices for implementing open source AI models into safety workflows. https://x.com/OpenAIDevs/status/1983508959505084849

We just added 4 new models to Tinker from the gpt-oss and DeepSeek-V3.1 families. Sign up for the waitlist: https://x.com/thinkymachines/status/1983041573517701327

Now in private beta: Aardvark, an agent that finds and fixes security bugs using GPT-5. https://x.com/OpenAI/status/1983956431360659467