“QwQ-32B evals on par with Deep Seek R1 680B but runs fast on a laptop. Delivery accepted. Here it is running nicely on a M4 Max with MLX. A snippet of its 8k token long thought process: https://x.com/awnihannun/status/1897394318434034163

“The team behind Manus partnered with Alibaba’s Qwen to develop a Chinese version of its autonomous agent The collaboration will integrate Manus with Qwen’s open-source models and computing infra This comes just after Manus went viral last week! https://x.com/rowancheung/status/1899713389439377678

“Alibaba’s QwQ-32B, a 32.5-billion-parameter language model, demonstrates reasoning abilities comparable to much larger models like DeepSeek-R1. Fine-tuned with reinforcement learning in two stages, it excels in math, coding, and other forms of problem-solving while remaining” / X https://x.com/DeepLearningAI/status/1900351166086537659

“Alibaba’s Qwen team dropped QwQ-32B, a reasoning AI that matches or surpasses DeepSeek-R1 at much less cost —20x smaller than DeepSeek-R1 —Priced $0.20 per million input and output tokens —Open-sourced under Apache 2.0 Here it is running on M4 Max: https://x.com/rowancheung/status/1897554323489325517

“Folks, we have set up a github repo for QwQ, specifically providing evaluation scripts for you to easily test the benchmark performance of reasoning models, and also reproduce our reported results. We provide step-by-step guidance for you to run the evaluation, and we hope this” / X https://x.com/Alibaba_Qwen/status/1900595120053047452

“👋 Introducing the Enhanced Qwen Chat We are pleased to announce the latest update to Qwen Chat, designed to deliver a seamless, versatile, and user-centric experience. Explore the key features below and visit https://x.com/Alibaba_Qwen/status/1899497336889659775

Trending

Discover more from Ethan B. Holland

Subscribe now to keep reading and get access to the full archive.

Continue reading