A cartoon of Australia angry at Facebook. Shaking its fist.
“In 2012 CUDA was very important. You can’t build anything without it. In 2024 90% of AI developers are actually web developers – and they build off Llama, not CUDA.
“WILD speed at SambaNova – They just launched World’s fastest API 🤯 👉 Llama 3.1 405B @ 132 tokens/sec 👉 Llama 3.1 70B @ 570 tokens/sec @SambaNovaAI
“📣 Announcing the world’s fastest AI platform SambaNova Cloud runs Llama 3.1 405B @ 132t/s at full precision ✅ Llama 3.1 405B @ 132 tokens/sec ✅ Llama 3.1 70B @ 570 tokens/sec ✅ 10X Faster Inference than GPUs Start developing #FastAI ➡️
“The ecosystem around Llama is continuing to push the limits. SambaNova Cloud is setting a new bar for
inference on 405B and it’s available for developers to start building today.” / X
“First distilled Llama 3.1 released by @arcee_ai! 🦙 SuperNova is a distilled reasoning Llama 3.1 70B & 8B! 👀 Arcee distilled @AIatMeta Llama 3.1 405B using offline knowledge distillation and combined it with RLHF and model merging to create new #1 open LLMs. SuperNova 70B is
“LLaMA-Omni, a new model for speech interaction 🦙Based on Llama 3.1 8B Instruct ⚡️Low-latency speech 🚀Simultaneous text and speech generation 🤏Trained with 4 GPUs in less than 3 days Model:
Meta Will Soon Get a 100,000 GPU Cluster Too; What’s Life At Character Like Now? — The Information
Facebook admits to scraping every Australian adult user’s public photos and posts to train AI, with no opt-out option – ABC News





Leave a Reply