Image created with gemini-3.1-flash-image-preview with claude-sonnet-4-5. Image prompt: Flat cartoon illustration of cute coral-red lobster mascot centered on dark charcoal background, holding white speech bubble with ‘Alibaba’ text in black Helvetica font, standing before simplified geometric Chinese gate in orange outlines, minimal flat cloud and shopping bag icons in background, kawaii style, high contrast, web interface aesthetic, clean lines, no gradients.

🎉 Congrats to @Alibaba_Qwen on releasing Qwen3-Coder-Next — and day-0 support is ready in vLLM 0.15.0! An 80B MoE with only 3B active params, matching models 10-20x larger. Built for coding agents and local development. Verified on NVIDIA GPUs. Recipe below 👇”” https://x.com/vllm_project/status/2018742511502856568

🚀 Introducing Qwen3-Coder-Next, an open-weight LM built for coding agents & local development. What’s new: 🤖 Scaling agentic training: 800K verifiable tasks + executable envs 📈 Efficiency-Performance Tradeoff: achieves strong results on SWE-Bench Pro with 80B total params and”” https://x.com/Alibaba_Qwen/status/2018718453570707465

Qwen releases Qwen3-Coder-Next. 💜 The new 80B MoE model excels at agentic coding & local use. With 256K context, it delivers similar performance to models with 10-20× more active parameters. Run on 46GB RAM or less. Guide: https://t.co/kFrY9qi5co GGUF: https://x.com/UnslothAI/status/2018718997584474191

🏆 Agent-Centric Benchmark Results 🟣 SWE-Bench Verified: Qwen3-Coder-Next >70% with the SWE-Agent scaffold 🟣 Efficient but strong: Despite a small active footprint, it matches or exceeds several much larger open-source models on a range of agent benchmarks”” https://x.com/Alibaba_Qwen/status/2018719026558664987

[2601.21337] Qwen3-ASR Technical Report https://arxiv.org/abs/2601.21337

> – 1T param with 512 experts!!!!! (22B active) Reminder that Very High Sparsity is the new meta, we’re well into the post-V3 regime. Qwen, Xiaomi are already doing 512 experts in 80B models, with tiny intermediate dim I think at this rate V4 will have, like, 2048 (16-24 active).”” https://x.com/teortaxesTex/status/2019245564232364231

Qwen https://qwen.ai/blog?id=qwen3-coder-next

Qwen-Image-Edit-3D-Lighting-Control app, featuring 8× horizontal and 3× elevational positions for precise 3D multi-angle lighting control. It enables studio-level lighting with fast Qwen Image Edit inference, paired with Multi-Angle-Lighting adapter. Try it now on @huggingface.🤗”” https://x.com/prithivMLmods/status/2019084493210992884

Leave a Reply

Trending

Discover more from Ethan B. Holland

Subscribe now to keep reading and get access to the full archive.

Continue reading