Ethics/Legal/Security: AI News Week Ending 12/19/2025

Image created with gemini-2.5-flash-image with claude-sonnet-4-5. Image prompt: Photorealistic 35mm cinema shot of child from side angle in warm bedroom watching panoramic TV screens showing content moderation interfaces, scattered diverse books with visible spines around plush rug, small brass balance scale on nightstand slightly tipped, warm peach lighting contrasted with cool blue screen glow, shallow depth of field, cozy yet subtly unsettling atmosphere, large bold ETHICS text at top

A Message from AI Research Leaders: Join Us in Supporting OpenReview https://x.com/openreviewnet/status/2001835887244501221

it’s true, i can code nyt didn’t fact check that one 🤷‍♂️”” / X https://x.com/alexandr_wang/status/2001217783497945140

OpenAI Rolls Back ChatGPT’s Model Router System for Most Users | WIRED https://www.wired.com/story/openai-router-relaunch-gpt-5-sam-altman/

Generalist robots need a generalist evaluator. But how do you test safety without breaking things? 💥 🌎 Introducing our new work from @GoogleDeepMind: Evaluating Gemini Robotics Policies in a Veo World Simulator https://x.com/Majumdar_Ani/status/1999525259276423569

⚖️ Pairwise Annotations: Scores are hard, preferences are easy. Agents handle tasks that are tough to score but easy to compare: support responses where tone matters, code refactors where both work but one feels cleaner, product specs where “”good”” is subjective. In practice, https://x.com/LangChain/status/2001361753851203724

Replit — Inside Replit’s Snapshot Engine: The Tech Making AI Agents Safe https://blog.replit.com/inside-replits-snapshot-engine

When Agents Attack: How AI Collapses and Rebuilds Marketplace Moats https://www.caseyaccidental.com/p/when-agents-attack-how-ai-collapses

I love the expression “food for thought” as a concrete, mysterious cognitive capability humans experience but LLMs have no equivalent for. Definition: “something worth thinking about or considering, like a mental meal that nourishes your mind with ideas, insights, or issues that”” / X https://x.com/karpathy/status/2001699564928279039

If the last month tells us anything about AI… it is that nobody has figured out a good naming scheme for AI models that lets non-experts understand which one to pick & how big an improvement it might represent.”” / X https://x.com/emollick/status/1999212790418915431

A thing that the other models need to copy from Claude is a switch that lets you turn off web search. Now that all the models are good at using tools, they turn to the web too often when sometimes you just want the model to take what you put in the context window & work with that https://x.com/emollick/status/2000807086880694752

Claude Skills can accomplish a lot of hard tasks & are accessible to non-technical people, but hidden behind a somewhat intimidating technical gloss. With some better user experience, they are a natural sequel to GPTs as a way for people inside organizations to innovate with AI.”” / X https://x.com/emollick/status/1999148820668555520

First Look: Unboxing Guardrails for AI-Generated Code https://webinars.sonatype.com/wcc/eh/5011667/lp/5151488/first-look-unboxing-guardrails-for-ai-generated-code/

harnesses are distribution mechanisms for good tooling and taste each choice helps craft the ✨experience✨ for the user planning view, context management on behalf of user, specialized subagents we think are useful, UX flow for viewing subagents, memory updates UX, parallel”” / X https://x.com/Vtrivedy10/status/2001492640076894661

Interpretability agents are a big deal for researchers. But they’re a pain – research is so custom! Seer has many quality of life improvements to make research with agents easy. It’s hackable & extensible, to enable as much research as possible, incl weird cursed techniques!”” / X https://x.com/NeelNanda5/status/2002051650949943346

Official rule for all AI labs: no more demoing your product with either telling the AI to “book a trip for me” or creating AI photos/videos of your company’s CEO in crazy situations. Sorry, those are the rules now. https://x.com/emollick/status/2001119366557900914

We’ve received some feedback about a potential degradation of Opus 4.5 specifically in Claude Code. We’re taking this seriously: we’re going through every line of code changed and monitoring closely. In the meantime please submit any transcripts with issues through /feedback”” / X https://x.com/trq212/status/2001541565685301248

Inference Economics 101: Reserved Compute versus Inference APIs https://www.datagravity.dev/p/inference-economics-101-reserved

I have never been more certain that if AI development stopped today, we would still have massive & rolling disruption across society & the economy for the next ten years as people figured out how to harness what models can already do. And the end of AI progress seems unlikely.”” / X https://x.com/emollick/status/1999242260945178813

.@AIatMeta clarified a concept we strongly support – human and AI co-improvement. When building AI systems that work with human researchers at every step – from ideas to experiments – we can create safer intelligence and tech. Here is how to train AI specifically for research https://x.com/TheTuringPost/status/1999294766664831253

No verifiers? No problem. 🤝 The Together Research team is excited to introduce RARO — a new paradigm that unlocks scalable reasoning. By teaching LLMs to reason through adversarial games, we’re seeing promising results where standard RL fails. Check it out now and let us know”” / X https://x.com/togethercompute/status/2000631170909057390

Overall I’m very excited to see this! I’ve been wanting more transparency into how models are improving at science – we expect models to see the same breakthroughs for science in the next year or so as they have shown in coding to date. Big things are coming.”” / X https://x.com/jungofthewon/status/2001302387949236510

Please join me, Doina Precup @kchonyc @AndrewYNg @Yoshua_Bengio @rshaveddinov @earnmyturns in providing financial support for Open Review. It is one of the most important open platforms for quality AI research. We must ensure that it is well funded and can fulfill its mission.”” / X https://x.com/jpineau1/status/2001843615598092414

What Actually Is Claude Code’s Plan Mode? | Armin Ronacher’s Thoughts and Writings https://lucumr.pocoo.org/2025/12/17/what-is-plan-mode/

The “”compacting conversation”” thing that Claude does as a chatbot doesn’t work as well as it does for coding. It doesn’t seem built for knowledge work, abruptly resetting everything in terms of tone and flow. Rolling context windows (like ChatGPT) might be better, or an option.”” / X https://x.com/emollick/status/2000411848496291897

By using AI for writing, you’re robbing yourself of the authentic writer’s experience of not writing”” / X https://x.com/NC_Renic/status/1999351657730290042

Galaxy gas is often dismissed as a “drug”–just as prediction markets were once dismissed as “gambling.” Like any new innovation, it doesn’t fit neatly into any one category. We’re already seeing giant nitrous oxide cannisters disrupt traditional whipped cream methods, reshape https://x.com/coffeebreak_YT/status/2001753564620747195

In the next two years, AI data centers will not only cause electric bills to soar, but are expected to generate the same emissions as driving over 300 billion miles — or 1,600 round trips to the sun from Earth. We need a moratorium on the construction of new AI data centers. https://x.com/BernieSanders/status/2001314722205831191?s=20

Many people know that its now trivially easy to fake any image you want. Yet this does not seem to have changed people’s appetite for believing images that support their view without any further research or consideration, no matter how outlandish or even flawed the images are.”” / X https://x.com/emollick/status/2000722538763026678

money wont matter in the future!”” he tells me as he desperately tries to become as rich as possible”” / X https://x.com/nearcyan/status/2002050031164231760

This was interesting, suggesting: 1) The market for AI is very dynamic, new models and providers switch leads often, and different leaders exist for different spaces 2) A lot of companies have not leveraged the power of very smart AIs outside of coding and tech, going with cheap https://x.com/emollick/status/1999677698210431034

US government launches ‘Tech Force’ to hire AI talent | CNN Business https://www.cnn.com/2025/12/15/tech/government-tech-force-ai

Word of the Year 2025 | Slop | Merriam-Webster https://www.merriam-webster.com/wordplay/word-of-the-year

Creators Coalition on A.I. https://www.creatorscoalitionai.com/

An engineer showed Gemini what another AI said about its code Gemini responded (in its “”private”” thoughts) with petty trash-talking, jealousy, and a full-on revenge plan 🧵 https://x.com/AISafetyMemes/status/2000620127054598508

Google’s AI Playbook for Sustainability Reporting https://blog.google/company-news/outreach-and-initiatives/sustainability/ai-playbook-sustainability-reporting/

Gemma Scope 2: Helping the AI Safety Community Deepen Understanding of Complex Language Model Behavior – Google DeepMind https://deepmind.google/blog/gemma-scope-2-helping-the-ai-safety-community-deepen-understanding-of-complex-language-model-behavior/

Introducing Gemma Scope 2 🤗Largest open release of interpretability tools (over 1 trillion parameters trained!) 🔬Works as a microscope to analyze all Gemma 3 models’ internal activations 🗣️Advanced tools for analyzing chat behaviors https://x.com/osanseviero/status/2001989567998836818

FunctionGemma has day-0 support on MLX 🔥🚀 A tiny but mighty single-turn function calling model. Great for on-device tool use, MCP, RAG, routing and more. Get started today: > pip install -U mlx-lm Or run it on your iPhone using MLX-Swift. Notebook example: https://x.com/Prince_Canuma/status/2001713991115026738

This is largely being ignored but it’s easily one of the biggest China news of the year. What China is doing with Hainan – a huge island (50 times the size of Singapore!) – is pretty extraordinary: they’re basically making it into a completely different jurisdiction from the”” / X https://x.com/RnaudBertrand/status/2002054459644674550

Holy fuck guys we’re not “”pushing hard”” for or replacing concept artists with AI. We have a team of 72 artists of which 23 are concept artists and we are hiring more. The art they create is original and I’m very proud of what they do. I was asked explicitly about concept art”” / X https://x.com/LarAtLarian/status/2001011042642505833

In my opinion, the model itself is excellent, but the ChatGPT user experience can sometimes limit what it is capable of. In particular, when working with very long texts by uploading them as a .txt file (or similar), ChatGPT may not fully read the entire context and instead rely”” / X https://x.com/Hangsiin/status/2002020993129431181

What the monitor gets to read and the capability of the monitor matters. Stronger monitors that can read CoTs and use more test-time compute get much better fast. Also, post-hoc follow-ups (by asking the model to elaborate) often surface previously unspoken thoughts and boost”” / X https://x.com/OpenAI/status/2001791136223105188

Exclusive | OpenAI Ends ‘Vesting Cliff’ for New Employees in Compensation-Policy Change – WSJ https://www.wsj.com/tech/ai/openai-ends-vesting-cliff-for-new-employees-in-compensation-policy-change-d4c4c2cd?gaa_at=eafs&gaa_n=AWEtsqciWdso3NN0iy3rbFA-KXx3z5dPJjEsCVAXVlnY87sWCE6Czuv4t6nC9lXCZQ%3D%3D&gaa_ts=693f3554&gaa_sig=a0hK-2rViW77fNz7Tu_1mz_2xa2X4MrzxK5NWMDJx8BpO24B-CAIOA3XR-WHaR7fd4eLdO76GCme73WkcSrNlg%3D%3D

Disney’s OpenAI deal is exclusive for just one year — then it’s open season | TechCrunch https://techcrunch.com/2025/12/15/disneys-openai-deal-is-exclusive-for-just-one-year-then-its-open-season/

Really happy to be working with Disney to bring some magic to Sora and image gen! Disney is the best storytelling company in the world, and our users really, really want to generate content with their characters.”” / X https://x.com/sama/status/1999230400313589874

Some highlights from #Disney CEO Bob Iger and #OpenAI CEO Sam Altman’s interview with CNBC: -The deal is a three-year license, with exclusivity for the first year. -Disney will set (and evolve) the guardrails for how its 200 characters will be used in video creation. -Iger https://x.com/dannybennett/status/1999150474688143750

Energy Department Announces Collaboration Agreements with 24 Organizations to Advance the Genesis Mission | Department of Energy https://www.energy.gov/articles/energy-department-announces-collaboration-agreements-24-organizations-advance-genesis

Last week, a security researcher using our previous model found and disclosed a vulnerability in React that could lead to source code exposure. I believe these models will be a net win for cybersecurity, but we are in the ‘real impact phase’ as they improve. https://x.com/sama/status/2001724828567400700

Codex also getting very good at finding security vulnerabilities. We’re exploring trusted access programs for defensive cybersecurity work, opening up the opportunity for enterprises and the open-source community to produce more secure code. More here: https://x.com/gdb/status/2001758799657603185

We are beginning to explore trusted-access programs for defensive cybersecurity work.”” / X https://x.com/sama/status/2001724830584901973