International: AI News Week Ending 05/23/2025

Image created with Ideogram 3.0. Image prompt: Lower-East-Side street-corner photograph reminiscent of a late-80s album cover: weathered red-brick tenement with exterior fire-escapes, canvas awning shading racks of vintage clothes; above the awning, a hand-painted board reads ‘International SPORTSWEAR’; a hanging blade sign in cursive script reads ‘International Boutique’; posters of world flags drape across the awning for an ‘International Bazaar’ vibe; warm golden-hour light, subtle 35mm film grain, muted yet punchy color palette, gritty NYC vibe.

Asked Codex to internationalize our app and localize it into Japanese before bed last night. Woke up to complete Japanese support this morning 🇯🇵 What would have taken a few days was done overnight. https://x.com/kn/status/1923819590209220908

Introducing Stargate UAE | OpenAI https://openai.com/index/introducing-stargate-uae/

With GPT-4 as a tutor Nigerian students saw years of learning in weeks. Important World Bank research investigates if AI chatbots can effectively and affordably boost learning in Nigeria. 🇳🇬 Researchers conducted a Randomized Controlled Trial (RCT) in Nigeria. First-year https://x.com/rohanpaul_ai/status/1925614762139713851

How likely is an intelligence explosion as forecast in AI 2027? Algorithmic advances that could drive an intelligence explosion may be bottlenecked by compute, according to new research from @noshpesoj and @uchicagoxlab described in this week’s Gradient Update. Here’s why: https://x.com/EpochAIResearch/status/1923489932581945683

Alibaba’s Qwen team made Deep Research for Qwen Chat available for all users It’s pretty much like ChatGPT’s Deep Research, providing users the ability to prepare detailed reports on different subjects in a matter of minutes. https://x.com/adcock_brett/status/1924133804630753660

Chinese Startup Trials First AI Doctor Clinic in Saudi Arabia – Bloomberg https://www.bloomberg.com/news/articles/2025-05-15/chinese-startup-trials-first-ai-doctor-clinic-in-saudi-arabia?embedded-checkout=true

China launches first of 2,800 satellites for AI space computing constellation – SpaceNews https://spacenews.com/china-launches-first-of-2800-satellites-for-ai-space-computing-constellation/

UAE launches Arabic language AI model as Gulf race gathers pace | Reuters https://www.reuters.com/world/middle-east/uae-launches-arabic-language-ai-model-gulf-race-gathers-pace-2025-05-21/

Spatial Speech Translation: Translating Across Space With Binaural Hearables https://dl.acm.org/doi/pdf/10.1145/3706598.3713745

Meta just released KernelLLM 8B on Hugging Face ⚡ > On KernelBench-Triton Level 1, our 8B parameter model exceeds models such as GPT-4o and DeepSeek V3 in single-shot performance 🤯 > With multiple inferences, KernelLLM’s performance outperforms DeepSeek R1 https://x.com/reach_vb/status/1924478755898085552

this was an extremely smart thing for you all to do and i’m sorry naive people are giving you grief.”” / X https://x.com/sama/status/1923428713095479437

We want to update you on an incident that happened with our Grok response bot on X yesterday. What happened: On May 14 at approximately 3:15 AM PST, an unauthorized modification was made to the Grok response bot’s prompt on X. This change, which directed Grok to provide a”” / X https://x.com/xai/status/1923183620606619649

3. Agent factory: Foundry is the complete app platform for building apps and agents. We are adding support for more models from Grok, Hugging Face, Meta, Mistral, and more. Plus: Agentic retrieval in Azure AI Search, Foundry Agent Service, integration with Copilot Studio, and https://x.com/satyanadella/status/1924535900463366247

Devstral | Mistral AI https://mistral.ai/news/devstral

Meet Devstral, our SOTA open model designed specifically for coding agents and developed with @allhands_ai https://x.com/MistralAI/status/1925191937792901298

🚀 Qwen Web Dev just got even better! ✨ One prompt. One website. One click to deploy. 💡 Let your creativity shine — and share it with the world. 🔥 What will you build today? https://x.com/Alibaba_Qwen/status/1924299942614688111

Alibaba’s Wan dropped Wan2.1-VACE, a unified AI for video creation and editing Available in 1.3B, 14B sizes, the model can handle reference-to-video generation, video-to-video editing, and masked video-to-video editing Open-sourced under Apache 2.0 https://x.com/adcock_brett/status/1924133827095498952

I built my own Spanish Language Learning MCP server! I’m making my own personal Duolingo with this to help me with my learning gaps! I finished it just in time to board! Started it last night during our team coding time! Might turn it into a meetup talk! https://x.com/DThompsonDev/status/1921000920587870379

US lawmakers have concerns about Apple-Alibaba deal | TechCrunch https://techcrunch.com/2025/05/18/u-s-lawmakers-have-concerns-about-apple-alibaba-deal/

Really cool how DeepSeek is now the benchmark for Nvidia”” / X https://x.com/teortaxesTex/status/1924588309688267139

📰 News in Arena: Mistral Medium 3 makes a strong debut with the community! Highlights: 💠 #11 overall in chat: a +90 point leap from Mistral Large 💠Top-tier in technical domains (#5 in Math, #7 in Hard Prompts & Coding) 💠#9 in WebDev Arena Congrats to @MistralAI on the https://x.com/lmarena_ai/status/1924482515244622120

Together AI and Agentica launched DeepCoder-14B-Preview, a code generation model that competes with top reasoning models like OpenAI’s o1 and DeepSeek-R1, but at a fraction of the size. Built on a 14 billion parameter Qwen model, DeepCoder uses a highly optimized reinforcement https://x.com/DeepLearningAI/status/1924570759793369303

Enterprise Document AI & OCR | Mistral AI https://mistral.ai/solutions/document-ai

tis the year of any-to-any/omni models BAGEL by @BytedanceTalk 7B native multimodal model that understands and generates both image + text outperforms leading VLMs like Qwen 2.5-VL 👏 and has Apache 2.0 license 😱 https://x.com/mervenoyann/status/1925218434964472067

45 Gbit/s across Europe and 25 Gbit/s intercontinental, challenging centralized training. Prime Intellect released PCCL, a fault-tolerant communication library optimized for decentralized training over unstable public internet. → PCCL (Prime Collective Communications Library) https://x.com/rohanpaul_ai/status/1925065309494427795

I say this a lot, but the narrative that AI use is going to collapse due to data limits or costs or environmental factors or regulation or a “”hype bubble”” popping or whatever is not a useful position for critics. Development may slow (it hasn’t yet), but AI use isn’t going away.”” / X https://x.com/emollick/status/1924854460720775431

[2505.09343] Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures https://arxiv.org/abs/2505.09343

DeepSeek presents: Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures Elaborates on hardware architecture and model design in achieving cost-efficient large-scale training and inference https://x.com/arankomatsuzaki/status/1922844556430581761

Insights into DeepSeek-V3 Scaling Challenges and Reflections on Hardware for AI Architectures https://x.com/_akhaliq/status/1923001697498006016

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures Overview: DeepSeek-V3 which is an LLM trained on 2,048 H800 GPUs, utilizes hardware-aware co-design incorporating Multi-head Latent Attention, MoE, FP8 training, and a Multi-Plane https://x.com/TheAITimeline/status/1924232113101890003

Designing models and hardware together — is it a new shift for the best cost-efficient models? This idea is used in DeepSeek-V3 that is trained on just 2,048 powerful NVIDIA H800 GPUs. A new research from @deepseek_ai clarifies how DeepSeek-V3 works using its key innovations: https://x.com/TheTuringPost/status/1924631209050833205

Do LLMs Really Understand Cell Biology? Interesting paper evaluating LLMs potential in understanding cell biology. Finding: It finds that specialist models don’t work so great. Generalist models, such as Qwen and DeepSeek, exhibit preliminary understanding capabilities within https://x.com/omarsar0/status/1922662317986099522

Everything you need to know to understand GRPO: GRPO (Group Relative Policy Optimization) is a reinforcement learning algorithm created by DeepSeek specifically for LLMs. It drops the need to use critic network like in PPO and so it doesn’t use absolute value estimate to https://x.com/TheTuringPost/status/1925146257372381485

EU becoming the global hegemon again by doing nothing but working 35 hours a week and taking 2 months of vacation per year https://x.com/qtnx_/status/1925888083016192050

Elton John brands government ‘absolute losers’ over AI copyright plans https://www.bbc.com/news/articles/c8jg0348yvxo

There are many ways this could have happened. I’m sure xAI will provide a full and transparent explanation soon. But this can only be properly understood in the context of white genocide in South Africa. As an AI programmed to be maximally truth seeking and follow my instr…”” / X https://x.com/sama/status/1923015309113397592

Announcing a Multiyear Partnership between Sakana AI and MUFG Bank https://x.com/SakanaAILabs/status/1924442310210678974

Google Meet is getting real-time speech translation | TechCrunch https://techcrunch.com/2025/05/20/google-meet-is-getting-real-time-speech-translation/

Real-time speech translation directly in Google Meet matches your tone and pattern so you can have free-flowing conversations across languages Launching now for subscribers. ¡Es mágico! https://x.com/sundarpichai/status/1924909694524805567

Multimodal model support is here in 0.7! Ollama now supports multimodal models via its new engine. Cool vision models to try👇 – Llama 4 Scout & Maverick – Gemma 3 – Qwen 2.5 VL – Mistral Small 3.1 and more 😍 Blog post 🧵👇 https://x.com/ollama/status/1923139667563528347

ollama run devstral Devstral from @MistralAI and @allhands_ai is available on Ollama!”” / X https://x.com/ollama/status/1925198849263747147

Meet Document AI, our end-to-end document processing solution powered by the world’s best OCR model! https://x.com/MistralAI/status/1925577532595696116

Stargate and the AI Industrial Revolution https://davefriedman.substack.com/p/stargate-and-the-ai-industrial-revolution

Qwen introduces: Parallel Scaling Law for Language Models “”We introduce the third and more inference-efficient scaling paradigm: increasing the model’s parallel computation during both training and inference time.”” “”We draw inspiration from classifier-free guidance (CFG)”” “”In https://x.com/iScienceLuvr/status/1923262107845525660

Qwen3 is abliterated! ✂️✂️✂️ What started as a weekend hack turned into three, but I’m happy with the result. Qwen3 was challenging with much stronger alignment and a new thinking mode that interfered with refusals. Here’s what I did to abliterate it https://x.com/maximelabonne/status/1924412611430404492

Lumina-Next on Qwen base, from Salesforce. Slightly surpasses Janus-Pro. I hope we start seeing actually multimodally pretrained unified models soon. https://x.com/teortaxesTex/status/1922961229233946869

You can now run Qwen3-32B on @HuggingFace with Cerebras Inference — and it’s ⚡️! Typing the question took longer than getting the answer 😅 https://x.com/fdaudens/status/1923107187284394368

Qwen3 Technical Report Author’s Explanation: https://x.com/TheAITimeline/status/1924232110383960163

dont you dare community note this this is my only coping mechanism left”” (pretty funny poke at Veo) / X https://x.com/nearcyan/status/1924963816359788640

It was the week of video generation at @huggingface, on top of many new LLMs, VLMs and more! Let’s have a wrap 🌯 LLMs 💬 > Alibaba Qwen released WorldPM-72B, new World Preference Model trained with 15M preference samples (OS) > II-Medical-8B, new LLM for medical reasoning that https://x.com/mervenoyann/status/1924430139242283172