Image created with GPT Image 1. Image prompt: high-contrast monochrome portrait silhouette on cream backdrop, Low-Life monochrome palette, minimalist graphic design inspired by New Order’s ‘Low-Life’, metaphor for retrieval arrows looping documents, flat color, subtle texture, 1980s Saville typography style

Amazon launched Nova Sonic, a real-time speech-to-speech model with bidirectional streaming, tool calling, and RAG support, delivering low-latency, expressive voice output at top-tier price-performance. → Nova Sonic handles real-time, interactive conversations with human-like https://x.com/rohanpaul_ai/status/1920972570595127640

VLMS 2025 UPDATE 🔥 We just shipped a blog on everything latest on vision language models, including 🤖 GUI agents, agentic VLMs, omni models 📑 multimodal RAG ⏯️ video LMs 🤏🏻 smol models ..and more! find it on the next one ⤵️ https://x.com/mervenoyann/status/1921962750353301986

We’ve built an AI agent that can not only perform highly accurate extraction from the most complex PDFs/Powerpoints/etc., but as of today also give back precise citations and reasoning back to the source element 🧐📜 Some of the most complex documents (insurance policies, https://x.com/jerryjliu0/status/1920182045042749691

Text-based RAG is already outdated. The real competitive edge in AI is building systems that can actually understand charts, graphs, and images – not just text. If you’re not learning Visual RAG now, you’re already behind the curve in enterprise AI development.”” / X https://x.com/jxnlco/status/1922003672701018219

Here are the top AI Papers of the Week (May 5 – 11): – ZeroSearch – Discuss-RAG – Absolute Zero – Llama-Nemotron – The Leaderboard Illusion – Reward Modeling as Reasoning Read on for more:”” / X https://x.com/dair_ai/status/1921606662214787114

Github 👨‍🔧: Scalable Multi-modal RAG → Ingests diverse unstructured data (PDFs, video, text) with intelligent parsing and automatic chunking/embedding. → Implements advanced Retrieval Augmented Generation (RAG) using multi-modal embeddings (ColPali) and integrated Knowledge https://x.com/rohanpaul_ai/status/1922276643520811308

🚀 Very Excited to Introduce FedRAG! Today, I’m pleased to share my latest open-source project that I have been working on since joining @VectorInst this past January. FedRAG is a framework for fine-tuning RAG systems. It endeavours to simplify the fine-tuning of RAG systems https://x.com/_nerdai_/status/1922732119706698118

Trending

Discover more from Ethan B. Holland

Subscribe now to keep reading and get access to the full archive.

Continue reading