Open Source AI News: Week Ending 08/09/2024

Flux[dev]: Computer source code is arranged to create the form of a futuristic robot, sleek smooth humanoid design. Smooth, glossy black faceplate with no visible facial features, high-tech, minimalist appearance. The robot's body is matte black or dark gray, with articulated joints and mechanical parts that resemble those of a human, including fingers. In the foreground, "Open Source" is written in glowing green system font.

Open Source AI News: Week Ending 08/09/2024

August 9, 2024

Flux[dev]: Computer source code is arranged to create the form of a futuristic robot, sleek smooth humanoid design. Smooth, glossy black faceplate with no visible facial features, high-tech, minimalist appearance. The robot’s body is matte black or dark gray, with articulated joints and mechanical parts that resemble those of a human, including fingers. In the foreground, “Open Source” is written in glowing green system font.

Hugging Face

“Exclusive: Hugging Face just bought a machine learning platform called XetHub, started by former Apple employees, aimed at letting developers build large-scale AI models

Exclusive: Hugging Face just bought a machine learning platform called XetHub, started by former Apple employees, aimed at letting developers build large-scale AI modelshttps://t.co/L7iUBWqSKg
— Richard Nieva (@richardjnieva) August 8, 2024

XetHub is joining Hugging Face!

https://huggingface.co/blog/xethub-joins-hf

Meta/Llama

“Idefics3-Llama is out! 💥 It’s a multimodal model based on Llama 3.1 that accepts arbitrary number of interleaved images with text with a huge context window (10k tokens!) 😍 Link to demo and model in the next one 😏

Idefics3-Llama is out! 💥

It's a multimodal model based on Llama 3.1 that accepts arbitrary number of interleaved images with text with a huge context window (10k tokens!) 😍

Link to demo and model in the next one 😏 pic.twitter.com/40tsgV8EBC
— merve (@mervenoyann) August 6, 2024

Call for Applications: Llama 3.1 Impact Grants

https://ai.meta.com/blog/llama-3-1-impact-grants-call-for-applications

“It’s curious how Llama 405b’s performance drops by 5 percentage points when using standard simple-evals prompts instead of its native Llama 3.1 prompts. Other models show much less sensitivity to this prompt change and fall nicely along the 45-degree line.

https://twitter.com/tamaybes/status/1820265143781224680

“📣 Today we’re opening a call for applications for Llama 3.1 Impact Grants! Until Nov 22, teams can submit proposals for using Llama to address social challenges across their communities for a chance to be awarded a $500K grant. Details + application ➡️

https://twitter.com/AIatMeta/status/1820493232826138946

“New smol-vision tutorial dropped: QLoRA fine-tuning IDEFICS3-Llama 8B on VQAv2 🐶 Learn how to efficiently fine-tune the latest IDEFICS3-Llama on visual question answering in this notebook 📖 Link in the next one 🤗

https://twitter.com/mervenoyann/status/1821605881815147004

“@huggingface This is the direct successor of Meta-Llama-3-120B-Instruct, a self-merge of Llama 3 70B that produced great results in tasks like creative writing.

https://twitter.com/maximelabonne/status/1820746727638323531

“The methods from this paper were able to reliably jailbreak the most difficult target models with prompts that appear similar to human-written prompts. Achieves attack success rates > 93% for Llama-2-7B, Llama-3-8B, and Vicuna-7B, while maintaining model-measured perplexity <

https://twitter.com/rohanpaul_ai/status/1821151485293437237

Mistral

“Introducing @MistralAI agents! You can now build your agents based on Mistral models or fine-tuned models and use on Le Chat 🙌 More features coming soon!

https://twitter.com/sophiamyang/status/1821476909345128740

“Mistral Large 2 (2407) is now on @lmsysorg. It performs extremely well in the Coding, Hard Prompts, Math, and Longer Query categories, where it outperforms GPT4-Turbo and Claude 3 Opus. It is also doing very well in Instruction Following where it ranks above Llama 3.1 405B.

https://twitter.com/GuillaumeLample/status/1820833645009277388

Mistral AI’s CEO on Microsoft and Europe’s AI Ecosystem | TIME

https://time.com/7007040/mistral-ai-ceo-arthur-mensch-interview

“.@MistralAI Mistral Large doing well on the @allen_ai ZebraLogic benchmark despite being much smaller than the other models 🙌

https://twitter.com/sophiamyang/status/1821119082432712938

Build, tweak, repeat | Mistral AI | Frontier AI in your hands

https://mistral.ai/news/build-tweak-repeat

Qwen

“CONGRATS to @Alibaba_Qwen team on Qwen2-Math-72B outperforming GPT-4o, Claude-3.5-Sonnet, Gemini-1.5-Pro, Llama-3.1-405B on a series of math benchmarks 👏👏👏

https://twitter.com/rohanpaul_ai/status/1821615977332682929

Introducing Qwen2-Math | Qwen

https://qwenlm.github.io/blog/qwen2-math

Other Open Source News

“🔥 Meet Yi-Large Turbo: the powerful, cost-effective upgrade to Yi-Large. Faster and more affordable at only $0.19 per 1M tokens for input and output. Ideal for heavy data tasks like complex inference and high-quality text generation. Check it out now:

https://twitter.com/01AI_Yi/status/1820456064405369335

Introducing Palmyra-Med and Palmyra-Fin – Writerhttps://writer.com/blog/palmyra-med-fin-models/