Lumana launches new AI video surveillance system | Security Systems News – https://www.securitysystemsnews.com/article/lumana-launches-new-ai-video-surveillance-system 

Bad Haircut? A Hot Chinese App Is Giving Americans Blunt Advice – WSJ – https://www.wsj.com/tech/personal-tech/bad-haircut-a-hot-chinese-app-is-giving-americans-blunt-advice-b82e67e2?st=5apwke3zcycch1t&reflink=desktopwebshare_permalink 

“Introducing Universal-1, our most powerful speech recognition model to date. Trained on over 12.5 million hours of multilingual audio data, Universal-1 achieves best-in-class speech-to-text accuracy across English, Spanish, French, and German.  https://twitter.com/AssemblyAI/status/1775527556042629437

“Are We on the Right Way for Evaluating Large Vision-Language Models? Large vision-language models (LVLMs) have recently achieved rapid progress, sparking numerous studies to evaluate their multi-modal capabilities. However, we dig into current evaluation works and identify  https://twitter.com/_akhaliq/status/1774669369869508743 

Claude

“I used Anthropic Claude 3 Opus model to analyse all 63 Podcasts from Dwarkesh Patel’s channel and extracted useful book recommendations, career advices, interesting ideas, learning tips etc 1.49Million tokens processed, costs $23 ( luckily Anthropic Claude 3 has provided me…  https://twitter.com/arunprakashml/status/1774989084307624144

Gemini

“AIs are really impressive at figuring out context from video. I uploaded a minute of me playing Balatro, a new roguelike deckbuilding poker game, and Gemini 1.5 was able to figure out the core gameplay loop from watching me play once. Surprisingly good insights.  https://twitter.com/emollick/status/1775712704738545940

Heads up! You’ve scrolled to the end of this category. There may have been just one or two links (above), so go back up and double check to be sure you didn’t quickly scroll down past it.

Be Sure To Read This Week’s Main Post:

This week’s executive overview and top links are here:

AI News #27: Week Ending 04/05/2024 with Executive Summary and Top 48 Links

The post you just read is an deep dive extension of my weekly newsletter, This Week In AI, an executive summary of the top things to know in AI. Each week, I create an accessible overview for laypeople to feel confident they are conversant with the week’s AI developments. I include a curated list of must-click links of the week, to offer everyone a hands-on opportunity to explore the most intriguing updates in artificial intelligence across various categories, including robotics, imagery, video, AR/VR, science, ethics, and more. Beyond the overview, I post these topic-based deeper dives (below). If you haven’t read this week’s overview, I recommend starting there.

Credits/Sources

Most of these weekly links come from just a few prolific oversharing sources. Please follow them, as they work hard to find the news each week and they make it a lot easier for me to compile.

For previous issues, please visit the archives!

Thanks for reading!

One response to “Multimodality News: Week Ending 04/05/2024”

  1. […] Multimodal AI News of the Week: This is a broad topic for an single AI model that demonstrates an ability to interact with more than one modality (imagery, video, audio, text). Often multimodal news will end up in one of these categories. I’m playing it by ear on a case by case basis. Please be patient with my organizational challenges.This week’s multimodal AI news: https://ethanbholland.com/2024/04/06/multimodality-news-week-ending-04-05-2024/ […]

Leave a Reply

Trending

Discover more from Ethan B. Holland

Subscribe now to keep reading and get access to the full archive.

Continue reading