Ethics/Legal/Security: AI News Week Ending 12/12/2025

Image created with gemini-2.5-flash-image with claude-sonnet-4-5. Image prompt: Black and white photograph of a massive dark cumulonimbus storm cloud advancing from left with sharp shadow line dividing bright sky from darkness, bold white sans-serif text reading ETHICS positioned in clear sky area, high contrast cinematic composition, film grain texture, dramatic atmospheric depth, square format

We tested one of the most common prompting techniques: giving the AI a persona to make it more accurate We found that telling the AI “”you are a great physicist”” doesn’t make it significantly more accurate at answering physics questions, nor does “”you are a lawyer”” make it worse. https://x.com/emollick/status/1998063517681799418

Yes, there is a leak. I had investigated this. Some of the ARC-AGI-1 public evaluation examples can be found in the ARC-AGI-2 training examples. So training on both ARC-AGI-1 and ARC-AGI-2 training data is cheating as it leads to crazy good accuracy for ARC-AGI-1.”” / X https://x.com/jm_alexia/status/1998487516182467055

Trump signs executive order seeking to block state laws on AI https://www.nbcnews.com/tech/tech-news/trump-signs-executive-order-seeking-ban-state-laws-ai-rcna248741

Nvidia Gets US Approval for H200 AI Chip Exports to China – Bloomberg https://www.bloomberg.com/news/articles/2025-12-08/nvidia-set-to-win-us-approval-to-export-h200-ai-chips-to-china

Trump: Nvidia can sell H200 AI chips to China if U.S. gets 25% cut https://www.cnbc.com/2025/12/08/trump-nvidia-h200-sales-china.html

We have just used the @Nvidia H100 onboard Starcloud-1 to train the first LLM in space! We trained the nano-GPT model from Andrej @Karpathy on the complete works of Shakespeare and successfully ran inference on it. We have also run inference on a preloaded Gemma model, and we https://x.com/AdiOltean/status/1998769997431058927

LLMs Make Legal Advice Lossy — /dev/lawyer https://writing.kemitchell.com/2025/12/07/LLMs-Make-Legal-Advice-Lossy

New York Times sues AI startup for ‘illegal’ copying of millions of articles | AI (artificial intelligence) | The Guardian https://www.theguardian.com/technology/2025/dec/05/new-york-times-perplexity-ai-lawsuit

Nvidia-backed Starcloud trains first AI model in space, orbital data centers https://www.cnbc.com/2025/12/10/nvidia-backed-starcloud-trains-first-ai-model-in-space-orbital-data-centers.html

Agentic AI Foundation — advancing open-source agentic AI:”” / X https://x.com/gdb/status/1998897086079832513

Agentic AI Foundation (AAIF) https://aaif.io/

Anthropic is donating the Model Context Protocol to the Agentic AI Foundation, a directed fund under the Linux Foundation. In one year, MCP has become a foundational protocol for agentic AI. Joining AAIF ensures MCP remains open and community-driven. https://x.com/AnthropicAI/status/1998437922849350141

Block – Block, Anthropic, and OpenAI Launch the Agentic AI Foundation https://block.xyz/inside/block-anthropic-and-openai-launch-the-agentic-ai-foundation

Donating the Model Context Protocol and establishing the Agentic AI Foundation \ Anthropic https://www.anthropic.com/news/donating-the-model-context-protocol-and-establishing-of-the-agentic-ai-foundation

Linux Foundation Announces the Formation of the Agentic AI Foundation (AAIF), Anchored by New Project Contributions Including Model Context Protocol (MCP), goose and AGENTS.md https://www.linuxfoundation.org/press/linux-foundation-announces-the-formation-of-the-agentic-ai-foundation

We’re donating MCP to the @linuxfoundation and launching the Agentic AI Foundation with @OpenAI, @blocks, @AWS, @Bloomberg, @Cloudflare, @Google, and @Microsoft. MCP went from internal project to industry standard in a year. Now it gets the long-term stewardship it deserves.”” / X https://x.com/mikeyk/status/1998456026136457532

The GPT-5 Auto router casts a long shadow over AI perceptions. So many examples of “”ChatGPT got X wrong”” are really “”ChatGPT-5 Instant got things wrong,”” leading to beliefs about the state of AI that aren’t true. Which model you get could be clearer &better explained for all.”” / X https://x.com/emollick/status/1998838007609119010

ChatGPT’s ‘Adult Mode’ Is Coming in 2026 https://gizmodo.com/chatgpts-adult-mode-is-coming-in-2026-2000698677

Disney has signed a deal with OpenAI & invested $1 billion into the company Sora will now be able to AI generate videos based on animated, masked & creature characters from Disney, Marvel, Pixar & Star Wars Curated selections of AI generated videos will be released on Disney+ https://x.com/DiscussingFilm/status/1999121515678208153

Disney investing $1 billion in OpenAI, will allow characters on Sora https://www.cnbc.com/2025/12/11/disney-openai-sora-characters-video.html

https://t.co/HngrXph6kU “”The Walt Disney Company and OpenAI reach landmark agreement to bring beloved characters from across Disney’s brands to Sora”” https://x.com/TheRealAdamG/status/1999118075879129140

The Walt Disney Company and OpenAI Reach Agreement to Bring Disney Characters to Sora | The Walt Disney Company https://thewaltdisneycompany.com/news/disney-openai-sora-agreement/

we’re partnering with @Disney to bring 200+ characters from disney, pixar, marvel, and star wars to sora and image generation we are also excited to welcome disney as an investor, and deploy openai models and products alongside the disney team https://x.com/bradlightcap/status/1999177616860020788

AI created visual ads got 20% more clicks than ads created by human experts as part of their jobs… unless people knew the ads are AI-created, which lowers click-throughs to 31% less than human-made ads Importantly, the AI ads were selected by human experts from many AI options https://x.com/emollick/status/1997363286015180856

The War Department Unleashes AI on New GenAI.mil Platform > U.S. Department of War > Release | U.S. Department of War https://www.war.gov/News/Releases/Release/Article/4354916/the-war-department-unleashes-ai-on-new-genaimil-platform/

Horses https://andyljones.com/posts/horses.html

Debugging misaligned completions with sparse-autoencoder latent attribution https://alignment.openai.com/sae-latent-attribution/

Accenture and Anthropic launch multi-year partnership to move enterprises from AI pilots to production \ Anthropic https://www.anthropic.com/news/anthropic-accenture-partnership

We’re expanding our partnership with @Accenture to help enterprises move from AI pilots to production. The Accenture Anthropic Business Group will include 30,000 professionals trained on Claude, and a product to help CIOs scale Claude Code. Read more: https://x.com/AnthropicAI/status/1998412600015769609

Historian Thomas Hughes argued that technologies are malleable when young, then harden. Right now we’re still shaping AI, or at least it is being shaped by our institutions, norms & use cases Eventually these systems build a momentum of their own. That is why choices now matter https://x.com/emollick/status/1998184719817793788

Google Online Security Blog: Architecting Security for Agentic Capabilities in Chrome https://security.googleblog.com/2025/12/architecting-security-for-agentic.html

I meet a lot of very smart AI critics who never seriously try to make AI work for them by spending a couple of hours with a frontier model. People can be (and should be & are) critical after realizing what AI can do, but experience leads to better-informed and sharper critiques.”” / X https://x.com/emollick/status/1998398372986736777

Alignment Is Capability https://www.off-policy.com/alignment-is-capability/

Made this video to explain evals https://x.com/HamelHusain/status/1998452926935695649

Prediction: AI will make formal verification go mainstream — Martin Kleppmann’s blog https://martin.kleppmann.com/2025/12/08/ai-formal-verification.html

Interesting study, but this is somewhat unexpected. (green is programming, yellow is role playing) https://x.com/emollick/status/1996758326877868268

Today we’re introducing OfficeQA, a new benchmark grounded in ~89,000 pages of U.S. Treasury Bulletins that reflects the complex, document-heavy tasks enterprises actually face. Unlike existing benchmarks, OfficeQA measures economically valuable, real-world reasoning: parsing https://x.com/databricks/status/1998424470881525822

So after all these hours talking about AI, in these last five minutes I am going to talk about: Horses. Engines, steam engines, were invented in 1700. And what followed was 200 years of steady improvement, with engines getting 20% better a decade. For the first 120 years of https://x.com/andy_l_jones/status/1998060552565002721

A.I. Is About to Solve Loneliness. That’s a Problem | The New Yorker https://www.newyorker.com/magazine/2025/07/21/ai-is-about-to-solve-loneliness-thats-a-problem

This is incorrect. LLMs can call tools to get info and change about the outside world, including viewing videos, moving robotic arms, etc. The pic below shows a minimal falsifying example. The value of this number squared is new information–it’s never been documented before. https://x.com/jeremyphoward/status/1998177975376986575

Gemini 3 Pro scores 69% trust in blinded testing up from 16% for Gemini 2.5: The case for evaluating AI on real-world trust, not academic benchmarks | VentureBeat https://venturebeat.com/ai/gemini-3-pro-scores-69-trust-in-blinded-testing-up-from-16-for-gemini-2-5

EU investigates Google over AI-generated summaries in search results https://www.bbc.com/news/articles/crl95eg33k1o

US accuses 2 men of smuggling Nvidia chips to China amid Trump AI announcement https://www.axios.com/2025/12/09/us-doj-nvidia-chips-smuggling-china-ai

The New York Times is suing Perplexity for copyright infringement | TechCrunch https://techcrunch.com/2025/12/05/the-new-york-times-is-suing-perplexity-for-copyright-infringement/

DeepSeek is Using Banned Nvidia Chips in Race to Build Next Model — The Information https://www.theinformation.com/articles/deepseek-using-banned-nvidia-chips-race-build-next-model

Some highlights from #Disney CEO Bob Iger and #OpenAI CEO Sam Altman’s interview with CNBC: -The deal is a three-year license, with exclusivity for the first year. -Disney will set (and evolve) the guardrails for how its 200 characters will be used in video creation. -Iger https://x.com/dannybennett/status/1999150474688143750

we are investing in cybersecurity preparedness:”” / X https://x.com/gdb/status/1998882274847461423

Introducing our latest breakthrough in AI search and retrieval: Rerank 4! It’s the most advanced set of reranking models on the market, with best-in-class performance across search relevance, speed, deployment flexibility, multilingual support, and domain-specific understanding. https://x.com/cohere/status/1999162791966745079

Introducing Rerank 4: Cohere’s most powerful reranker yet https://cohere.com/blog/rerank-4

Robot policies fail on the hard parts of manipulation. The moment contact, friction, or force uncertainty shows up, the success rate drops fast. CR DAgger shows a very different path. You take a pre trained policy. You let a human correct it in the real world for a short https://x.com/IlirAliu_/status/1996871611392069708

Department of Energy, PNNL Partner to Power the Nation’s Bioeconomy | News Release | PNNL https://www.pnnl.gov/news-media/department-energy-pnnl-partner-power-nations-bioeconomy

NDAA would mandate new DOD steering committee on artificial general intelligence | DefenseScoop https://defensescoop.com/2025/12/08/fy26-ndaa-dod-ai-artificial-intelligence-futures-agi-steering-committee/

I have been pretty frustrated with the current focus of interpretability research. Promising to see the focus on scalability and generalization. Without these two properties, works often end up being neuron interpretation overfit to a single model and not particularly”” / X https://x.com/sarahookr/status/1997795206096429415

Large scale-experiments in UK, US & Poland where people chatted with LLMs about political topics found AI is very good at persuasion, primarily by providing lots of fact-based claims Plus, AI is getting more persuasive as models grow bigger & persuasion effects lasted over time. https://x.com/emollick/status/1996770000389169205

Chris Olah’s talk is happening right now at the NeurIPS mech interp workshop, room 30, top floor. Called “”reflections on interpretability””! Followed by invited lightning talks at 16:00 https://x.com/NeelNanda5/status/1997812818788467157