Ethan B. Holland

Over 50,200 manually organized AI links and counting

Locally Run Models: AI News Week Ending 11/29/2024

November 29, 2024

“🚀 Introducing Hymba-1.5B: a new hybrid architecture for efficient small language models! ✅ Outperforms Llama, Qwen, and SmolLM2 with 6-12x less training ✅ Massive reductions in KV cache size & good throughput boost ✅ Combines Mamba & Attention in a Hybrid Parallel

https://twitter.com/PavloMolchanov/status/1861484218087584217

SmolVLM – a Hugging Face Space by HuggingFaceTB

https://huggingface.co/spaces/HuggingFaceTB/SmolVLM

“🚀 Mind-blown by SmolVLM – a tiny but mighty vision language model! ✨ Key specs: – 2.25B parameters – Only 5GB GPU RAM needed – Apache 2.0 license – Fine-tunable on Google Colab free tier #AI #MachineLearning

https://twitter.com/fdaudens/status/1862301934771650975

SmolVLM – small yet mighty Vision Language Model

https://huggingface.co/blog/smolvlm

Why Small Language Models Are The Next Big Thing In AI

https://www.forbes.com/sites/deandebiase/2024/11/25/why-small-language-models-are-the-next-big-thing-in-ai

Leave a ReplyCancel reply

Trending