a map lays on the ground in a forest with a trail sign that reads “Open Source” –ar 5:3 –style raw

This week’s category cover theme is a sign in a forest. Each category image prompt is a derivative of the formula “an [category themed object] in a forest with a trail sign that reads “[category name]”. Using a theme each week takes the cover creation time down to about 20 minutes, rather than several hours.

“📰New open models this week: multilinguality, long contexts, and VLMs 🔥 – CogVLM2: multimodal conversational – Yi 1.5 long context – M2-BERT-V2, long-context encoder models – Phi 3 small and medium + vision – Falcon VLM – Mistral 7B 0.3 – Aya 23: multilingual

📰New open models this week: multilinguality, long contexts, and VLMs 🔥

– CogVLM2: multimodal conversational
– Yi 1.5 long context
– M2-BERT-V2, long-context encoder models
– Phi 3 small and medium + vision
– Falcon VLM
– Mistral 7B 0.3
– Aya 23: multilingual pic.twitter.com/0txqbuStMV
— Omar Sanseviero (@osanseviero) May 24, 2024

“BREAKING: California’s newly passed AI bill 📌 “Covered models” trained with over 10^26 FLOPS must be incapable of enabling certain critical harms like creation of WMDs, even if fine-tuned. 📌 Developers must implement the capability to promptly enact a full shutdown of covered

BREAKING: California’s newly passed AI bill

📌 "Covered models" trained with over 10^26 FLOPS must be incapable of enabling certain critical harms like creation of WMDs, even if fine-tuned.

📌 Developers must implement the capability to promptly enact a full shutdown of covered… pic.twitter.com/zhS9WdqA6N
— Rohan Paul (@rohanpaul_ai) May 22, 2024

“This California Bill Makes Zero Sense And Is Targeted At Banning Open-Source AI The bill stipulates that if you train a “model” with some arbitrary amount of compute, then a whole bunch of rules and restrictions apply. Imagine if I stopped training just short of that compute

This California Bill Makes Zero Sense And Is Targeted At Banning Open-Source AI

The bill stipulates that if you train a "model" with some arbitrary amount of compute, then a whole bunch of rules and restrictions apply.

Imagine if I stopped training just short of that compute… pic.twitter.com/U8yjRM7ujn
— Bindu Reddy (@bindureddy) May 22, 2024

IBM makes more AI models open source and lands Saudi Arabia deal

https://finance.yahoo.com/news/ibm-makes-more-ai-models-070402158.html

Open LLM Leaderboard: DROP deep dive

https://huggingface.co/blog/open-llm-leaderboard-drop

“6/ on the national security front, the US has won by leading. restricting open-source AI won’t stop determined adversaries, only slow U.S. innovation and cede that leadership. openness keeps us on the offense. we must shape AI with western values and norms.” / X

6/ on the national security front, the US has won by leading. restricting open-source AI won't stop determined adversaries, only slow U.S. innovation and cede that leadership. openness keeps us on the offense. we must shape AI with western values and norms.
— sarah guo // conviction (@saranormous) May 22, 2024

“strong disagree. 1/ open source isn’t a charity, it’s a strategy for both building and selling. I grew up on linux. the kernel has over 20,000 individual contributors and more than 1,300 contributing companies. linux’s large community helps it remain robust, secure & versatile” / X

strong disagree.
1/ open source isn't a charity, it's a strategy for both building and selling. I grew up on linux. the kernel has over 20,000 individual contributors and more than 1,300 contributing companies. linux's large community helps it remain robust, secure & versatile https://t.co/erDF1XQIIb
— sarah guo // conviction (@saranormous) May 22, 2024

“Reminder: open-source is the foundation of all AI (including closed-source AI)!” / X

Reminder: open-source is the foundation of all AI (including closed-source AI)!
— clem 🤗 (@ClementDelangue) May 23, 2024

“What a week in the ML world! Here is a recap thread on all the exciting open ML updates!🔥 VLMs: Salesforce, Kosmos 2.5, PaliGemma, Cumo LLM: Yi 1.5, Falcon 2, DeepSeek v2 lite Diffusion: HunyuanDiT, Lumina next Keep reading 👇” / X

What a week in the ML world! Here is a recap thread on all the exciting open ML updates!🔥

VLMs: Salesforce, Kosmos 2.5, PaliGemma, Cumo
LLM: Yi 1.5, Falcon 2, DeepSeek v2 lite
Diffusion: HunyuanDiT, Lumina next

Keep reading 👇
— Omar Sanseviero (@osanseviero) May 19, 2024

“I am a little confused about the reason for the plethora of open weights models being offered by providers right now. Given that a few models dominate the leaderboards across many skills, come in multiple sizes & are getting cheaper, fast, what is the point of using the others?

I am a little confused about the reason for the plethora of open weights models being offered by providers right now. Given that a few models dominate the leaderboards across many skills, come in multiple sizes & are getting cheaper, fast, what is the point of using the others? pic.twitter.com/7L7Uedpyvn
— Ethan Mollick (@emollick) May 21, 2024

“LLMs are plateauing and the gap between closed vs. open is almost closed!! If you are look at MMLU open-source is caught up to closed source and we are seeing the LLMs plateau It’s time to move on to different benchmarks that measure LLM capabilities on hard problems The key

LLMs are plateauing and the gap between closed vs. open is almost closed!!

If you are look at MMLU open-source is caught up to closed source and we are seeing the LLMs plateau

It's time to move on to different benchmarks that measure LLM capabilities on hard problems

The key… pic.twitter.com/ywfrzwaz9v
— Bindu Reddy (@bindureddy) May 24, 2024

“You can now fine-tune models using an AI assistant! And it works with any open-source model. No-code fine-tuning and deployment is as good as it gets! Seriously, it’s mind-blowing how far we’ve come. I recorded a video to show you how to use a GPT to fine-tune a model. You

You can now fine-tune models using an AI assistant!

And it works with any open-source model. No-code fine-tuning and deployment is as good as it gets!

Seriously, it's mind-blowing how far we've come.

I recorded a video to show you how to use a GPT to fine-tune a model. You… pic.twitter.com/NyKlXClSyF
— Santiago (@svpino) May 22, 2024

Cohere

Aya | Cohere For AI

https://cohere.com/research/aya

Cohere For AI Launches Aya 23, 8 and 35 Billion Parameter Open Weights Release

https://cohere.com/blog/aya23

@romainhuet Cohere just launched Aya 23 — a family of multilingual LLMs with open weights and support for 23 different languages. Access to top models will soon become crucial for many parts of the world, so democratizing access is a massive step forward.

Cohere just launched Aya 23 — a family of multilingual LLMs with open weights and support for 23 different languages.

Access to top models will soon become crucial for many parts of the world, so democratizing access is a massive step forward. pic.twitter.com/xFWGa7EfAS
— Rowan Cheung (@rowancheung) May 24, 2024

“Switching from French to German to Chinese in the same discussion 😅 Impressive to see @CohereForAI’s new Aya model multilingual capabilities. – C4AI Aya 23 is a research open weights release – 8 and 35 billion parameter models – 23 languages supported You can try it out here:

Switching from French to German to Chinese in the same discussion 😅

Impressive to see @CohereForAI's new Aya model multilingual capabilities.

– C4AI Aya 23 is a research open weights release
– 8 and 35 billion parameter models
– 23 languages supported

You can try it out here:… pic.twitter.com/wMvLO8o5Qj
— Florent Daudens (@fdaudens) May 23, 2024

Gemma

PaliGemma: Open Source Multimodal Model by Google

https://blog.roboflow.com/paligemma-multimodal-vision

Grok

“Wow this is powerful. Grok is able to accurately give me the last closing price of a stock option, and correctly explain the reason for this price. Congrats @grok team, your RAG capabilities are very impressive and useful, Grok will be my exclusive personal assistant from now on

Wow this is powerful. Grok is able to accurately give me the last closing price of a stock option, and correctly explain the reason for this price.
Congrats @grok team, your RAG capabilities are very impressive and useful, Grok will be my exclusive personal assistant from now on… pic.twitter.com/hCO7Va8Wj4
— Julien Salinas (@JulienSalinasEN) May 22, 2024

Elon Musk’s xAI is working on making Grok multimodal – The Verge

https://www.theverge.com/2024/5/21/24161764/elon-musk-xai-grok-multimodal-ai

Open Release of Grok-1

https://x.ai/blog/grok-os

Hugging Face

“No cloud, no cost, no data sent to anyone, no problem. Welcome to local AI on Hugging Face!

No cloud, no cost, no data sent to anyone, no problem. Welcome to local AI on Hugging Face! pic.twitter.com/DtLeLGePKh
— clem 🤗 (@ClementDelangue) May 20, 2024

Hugging Face commits $10 million in free shared GPUs – The Verge

https://www.theverge.com/2024/5/16/24156755/hugging-face-celement-delangue-free-shared-gpus-ai

Experimental Moondream WebGPU – a Hugging Face Space by Xenova

https://huggingface.co/spaces/Xenova/experimental-moondream-webgpu

Meta/Llama

“The first open-source implementation of the paper that will change automatic test generation is now available! In February, Meta published a paper introducing a tool to automatically increase test coverage, guaranteeing improvements over an existing code base. This is a big

The first open-source implementation of the paper that will change automatic test generation is now available!

In February, Meta published a paper introducing a tool to automatically increase test coverage, guaranteeing improvements over an existing code base.

This is a big… pic.twitter.com/mNxgUx1ADA
— Santiago (@svpino) May 21, 2024

“Build a Full-Stack Job Search Assistant with @gokoyeb, @MongoDB, and @llama_index 🧑‍💼🔎 This is a comprehensive end to end tutorial by @rishi_raj_jain_ on building a RAG-powered assistant that streams its response in real-time but can also continuously update its internal

Build a Full-Stack Job Search Assistant with @gokoyeb, @MongoDB, and @llama_index 🧑‍💼🔎

This is a comprehensive end to end tutorial by @rishi_raj_jain_ on building a RAG-powered assistant that streams its response in real-time but can also continuously update its internal… pic.twitter.com/US5vSd69TE
— LlamaIndex 🦙 (@llama_index) May 23, 2024

“Let’s build a crew of AI agents to scrape the web and write blog posts for you, powered by Llama-3 (100% local):” / X

Let's build a crew of AI agents to scrape the web and write blog posts for you, powered by Llama-3 (100% local):
— Akshay 🚀 (@akshay_pachaar) May 23, 2024

“Welcome CogVLM 2 ⚡ > Beats GPT4 V/ Gemini Pro on TextVQA, DocVQA and ChartQA – by a decent margin 🔥 > 19B params > Llama 3 8B (Instruct) text backbone > Supports 8K context length > Upto 1344 X 1344 resolution supported > Works with both Chinese and English > Open access with

Welcome CogVLM 2 ⚡

> Beats GPT4 V/ Gemini Pro on TextVQA, DocVQA and ChartQA – by a decent margin 🔥
> 19B params
> Llama 3 8B (Instruct) text backbone
> Supports 8K context length
> Upto 1344 X 1344 resolution supported
> Works with both Chinese and English
> Open access with… pic.twitter.com/iOjvH06KVA
— Vaibhav (VB) Srivastav (@reach_vb) May 20, 2024

What’s up with Llama 3? Arena data analysis | LMSYS Org

https://lmsys.org/blog/2024-05-08-llama3

LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)

https://huggingface.co/blog/wolfram/llm-comparison-test-llama-3

Should Meta open-source Llama 3 400B or not?

“Meta open-sourcing Llama-3 400b will make them the biggest hero of our age! Nothing is more important or more urgent!” / X

Meta open-sourcing Llama-3 400b will make them the biggest hero of our age!

Nothing is more important or more urgent!
— Bindu Reddy (@bindureddy) May 23, 2024

“Meta plans to not open the weights for its 400B model. The hope is that we would quietly not notice / let it slide. Don’t let it slide.” / X

Meta plans to not open the weights for its 400B model.

The hope is that we would quietly not notice / let it slide.

Don’t let it slide.
— Jimmy Apples 🍎/acc (@apples_jimmy) May 22, 2024

“Head of Ted is calling Meta reckless for releasing Open source models. While its a fact that Meta is really the leader for OSS contributions. Beyond Llama-3 – we have all the below from them (and this not an exhaustive list) – React – PyTorch – React Native – GraphQL – Jest –

Head of Ted is calling Meta reckless for releasing Open source models.

While its a fact that Meta is really the leader for OSS contributions.

Beyond Llama-3 – we have all the below from them

(and this not an exhaustive list)

– React
– PyTorch
– React Native
– GraphQL
– Jest
-… pic.twitter.com/ulQFSBSWGQ
— Rohan Paul (@rohanpaul_ai) May 22, 2024

Mistral

“New @MistralAI 7B base and instruct 🔥 No magnet link, but a @huggingface repository! 🤗👀 🔡 Extended Vocabulary from 32000 to 32768 🔨 Function calling support 🔓 Apache 2.0 license ❌ No evaluation details Base:

New @MistralAI 7B base and instruct 🔥 No magnet link, but a @huggingface repository! 🤗👀

🔡 Extended Vocabulary from 32000 to 32768
🔨 Function calling support
🔓 Apache 2.0 license
❌ No evaluation details

Base: https://t.co/WqJPkfz693
Instruct: https://t.co/Bh3R4OFUs6

🤗
— Philipp Schmid (@_philschmid) May 22, 2024

“Made a free Colab for Mistral v3! You can QLoRA finetune 2x faster, use 70% less VRAM with no accuracy degradations with @UnslothAI! You can export to vLLM, GGUF, HF inference is 2x faster & we support 4x longer context windows than FA2 (24GB= 56K vs 14K)

Made a free Colab for Mistral v3! You can QLoRA finetune 2x faster, use 70% less VRAM with no accuracy degradations with @UnslothAI!

You can export to vLLM, GGUF, HF inference is 2x faster & we support 4x longer context windows than FA2 (24GB= 56K vs 14K)https://t.co/h03flYPgK7
— Daniel Han (@danielhanchen) May 22, 2024

“Checkout the new Mistral v0.3 models with MLX LM. Pre-quantized models in the 🤗 MLX community

Checkout the new Mistral v0.3 models with MLX LM.

Pre-quantized models in the 🤗 MLX community https://t.co/dUgErUXnM3 h/t @Prince_Canuma !

Generating 512 tokens at 107 toks/sec with the 4-bit model on an M2 Ultra. Models got better but just as fast as ever: pic.twitter.com/NHJ50x22X0
— Awni Hannun (@awnihannun) May 22, 2024

“Let’s fucking go! Mistral just released 7B v0.3 🔥 > Base + Instruct model checkpoints released > Extended vocabulary to 32768 > Supports new v3 Tokenizer > Supports function calling > Uncensored (no moderation tactics used during fine-tuning) Thanks for the sweet surprise,” / X

Let's fucking go! Mistral just released 7B v0.3 🔥

> Base + Instruct model checkpoints released
> Extended vocabulary to 32768
> Supports new v3 Tokenizer
> Supports function calling
> Uncensored (no moderation tactics used during fine-tuning)

Thanks for the sweet surprise,…
— Vaibhav (VB) Srivastav (@reach_vb) May 22, 2024

“BREAKING : Mistral-7B v0.3 has been released 🎇 – Extended vocabulary to 32768 – Supports v3 Tokenizer – Supports function calling Their github repo is claiming that Mixtral 8x7B Instruct and Mixtral 8x7B will be updated soon, probably also in the same fashion as Mistral 7B

BREAKING : Mistral-7B v0.3 has been released 🎇

– Extended vocabulary to 32768
– Supports v3 Tokenizer
– Supports function calling

Their github repo is claiming that Mixtral 8x7B Instruct and Mixtral 8x7B will be updated soon, probably also in the same fashion as Mistral 7B… pic.twitter.com/Sj1HbuErtu
— Rohan Paul (@rohanpaul_ai) May 22, 2024

Mistral AI and Harvey Partnership

https://www.harvey.ai/blog/mistral-announcement

Phi

Microsoft Phi-Silica: 3.3B small AI model made for Copilot+ PC NPUs | VentureBeat

Microsoft introduces Phi-Silica, a 3.3B parameter model made for Copilot+ PC NPUs

New models added to the Phi-3 family, available on Microsoft Azure | Microsoft Azure Blog

New models added to the Phi-3 family, available on Microsoft Azure

“Phi-3 small & medium are now available under the MIT license! 🚀@Microsoft has just launched Phi-3 small (7B) and medium (14B) 🤯. The Phi-3 small model claims to outperform @AIatMeta’s Llama 3 and @MistralAI, and the Phi-3 medium model GPT-3.5 and @cohere Command R+. 🤔 TL;DR:

Phi-3 small & medium are now available under the MIT license! 🚀@Microsoft has just launched Phi-3 small (7B) and medium (14B) 🤯. The Phi-3 small model claims to outperform @AIatMeta's Llama 3 and @MistralAI, and the Phi-3 medium model GPT-3.5 and @cohere Command R+. 🤔

TL;DR:… pic.twitter.com/UsOsKvvFdz
— Philipp Schmid (@_philschmid) May 21, 2024

“Small Models Are Improving Exponentially – Phi-3 14B Is Phenomenal The new Phi-3 14B model scores phenomenally on all benchmarks. On key numbers, it seems to be pretty close to Llama-3-Instruct 🤯🤯 As small models become more and more powerful, we will see 7b-sized GPT-4 class

Small Models Are Improving Exponentially – Phi-3 14B Is Phenomenal

The new Phi-3 14B model scores phenomenally on all benchmarks. On key numbers, it seems to be pretty close to Llama-3-Instruct 🤯🤯

As small models become more and more powerful, we will see 7b-sized GPT-4 class… pic.twitter.com/Xd1XR5ob4v
— Bindu Reddy (@bindureddy) May 21, 2024

“False alarm on the phi-3 models (did very poorly on a few offline benchmarks I have), still using llama-3 fine tuned models for a few specialized services. The phi-3 models seem very sensitive to prompts (not a good thing imo)” / X

False alarm on the phi-3 models (did very poorly on a few offline benchmarks I have), still using llama-3 fine tuned models for a few specialized services. The phi-3 models seem very sensitive to prompts (not a good thing imo)
— anton (@abacaj) May 21, 2024

“Phi-3-vision with 4.2B parameters

Phi-3-vision with 4.2B parameters pic.twitter.com/0iAJWBp9sI
— Rohan Paul (@rohanpaul_ai) May 21, 2024

“The more I look at these numbers the more magical it looks. 😯🔥 Phi-3-small with only 7B parameters beats GPT-3.5T across a variety of language, reasoning, coding, and math benchmarks. Next, GPT4 level model in my pocket GPY by this year end 😯

The more I look at these numbers the more magical it looks. 😯🔥

Phi-3-small with only 7B parameters beats GPT-3.5T across a variety of language, reasoning, coding, and math benchmarks.

Next, GPT4 level model in my pocket GPY by this year end 😯 pic.twitter.com/rIm8J2SyOq
— Rohan Paul (@rohanpaul_ai) May 23, 2024

Other Open Source News

“Great Yi-1.5-34B with much longer context window.” / X

Great Yi-1.5-34B with much longer context window. https://t.co/hdRVXcC5UO
— Rohan Paul (@rohanpaul_ai) May 20, 2024

“You asked for longer contexts🎤 and we heard you!👂 The following models are now available on @huggingface by popular demand: ✅Yi-1.5-34B-32K ✅Yi-1.5-34B-Chat-16K ✅Yi-1.5-9B-32K ✅Yi-1.5-9B-Chat-16K Happy building!

You asked for longer contexts🎤 and we heard you!👂

The following models are now available on @huggingface by popular demand:
✅Yi-1.5-34B-32K
✅Yi-1.5-34B-Chat-16K
✅Yi-1.5-9B-32K
✅Yi-1.5-9B-Chat-16K

Happy building! https://t.co/bwAz0YnYpS https://t.co/oFt08sHkd7
— Yi-01.AI (@01AI_Yi) May 20, 2024

Heads up! You’ve scrolled to the end of this category. There may have been just one or two links (above), so go back up and double check to be sure you didn’t quickly scroll down past it.

Be Sure To Read This Week’s Main Post:

This week’s executive overview and top links are here:

AI News #34: Week Ending 05/24/2024 with Executive Summary and Top 47 Links

The post you just read is an deep dive extension of my weekly newsletter, This Week In AI, an executive summary of the top things to know in AI. Each week, I create an accessible overview for laypeople to feel confident they are conversant with the week’s AI developments. I include a curated list of must-click links of the week, to offer everyone a hands-on opportunity to explore the most intriguing updates in artificial intelligence across various categories, including robotics, imagery, video, AR/VR, science, ethics, and more. Beyond the overview, I post these topic-based deeper dives (below). If you haven’t read this week’s overview, I recommend starting there.

Credits/Sources

Most of these weekly links come from just a few prolific oversharing sources. Please follow them, as they work hard to find the news each week and they make it a lot easier for me to compile.