Ethan B. Holland

Over 56,600 manually organized AI links and counting

AGI: AI News Week Ending 04/11/2025

April 11, 2025

Image created with Flux Pro Ultra. Image prompt: A Minecraft screenshot showing an enormous redstone computer with glowing circuits, surrounded by strange mechanical constructs building perfect replicas of villages, with “AGI” written in pixelated Minecraft font across the top

AI 2027 https://ai-2027.com/

AI DISRUPT | Hasura | PromptQL https://hasura.io/events/ai-disrupt-promptql

“Let’s take AI predictions from blog posts, podcasts and tweets and move them to betting markets, our state of the art in truth. My struggle has been coming up with good, concrete, resolvable predicates. Ideally, predicates related to industry metrics and macroeconomics. Eg” / X https://x.com/karpathy/status/1908109168952676855

“One way of measuring how good “vibecoding” is with AI is the average number of ambitious requests you can make of the AI to create/edit a codebase before the system starts making errors that it cannot recover from without some experienced guidance. Number has been creeping up.” / X https://x.com/emollick/status/1908155246439731700

“The Jagged Frontier that we first wrote about two years ago continues to define AI. In any real-world, high-end workflow there will be things that AI can’t do & those weaknesses are hard for most to predict in advance Experienced humans can both identify & work around those gaps” / X https://x.com/emollick/status/1908579281917051295

““You will see wonders beyond your imagination, nod, think “that’s a cool wonder”, and become inured to it” A thought-provoking essay on reconstructing meaning in a world of AI content, and the role of our agency in finding meaning & wonder for ourselves. https://x.com/emollick/status/1908924545672585242

“PyTorch just released an awesome tool to visualize matrices and what’s happening inside them. Matrix multiplications (matmuls) are the building blocks of today’s models. It can even run in browser. https://x.com/LiorOnAI/status/1908233269998403980

“Memory is the next scaling laws paradigm shift” / X https://x.com/EdwardSun0909/status/1910384097786290497

“Google Gemini 2.5 is the first public AI model to definitively beat the level of performance of human PhDs with access to Google on hard multiple choice problems inside their field of expertise (around 81%). All AI tests are flawed, but GPQA Diamond has been a pretty good one.” / X https://x.com/emollick/status/1907737487176286418

“I’ve been saying that DeepSeek will expand from verifiable to general domains, and expected a paper. Here is that paper. Self-Principled Critique Tuning. rule-based online RL. Gemma-2 27b is enough to match R1. This is roughly what Google does for Gemma 3 and likely Geminis. https://x.com/teortaxesTex/status/1907987423377666538

“Researchers at UC San Diego demonstrated that AI systems can consistently pass Alan Turing’s famous test of machine intelligence OpenAI’s GPT-4.5 was mistaken for human nearly three-quarters of the time in controlled trials https://x.com/adcock_brett/status/1908913665706721489

“I might be experiencing a rare moment with my AI-powered IDE. It doesn’t feel like luck, I think it’s a glimpse of the future. I’ve also experienced this with products like ChatGPT Canvas. In many instances, it predicted what I was thinking 1 step ahead, and it felt magical.” / X https://x.com/omarsar0/status/1910409193737027639

“New preprint: we evaluated LLMs in a 3-party Turing test (participants speak to a human & AI simultaneously and decide which is which). GPT-4.5 (when prompted to adopt a humanlike persona) was judged to be the human 73% of the time, suggesting it passes the Turing test (🧵) https://x.com/camrobjones/status/1907086860322480233

[2503.23674] Large Language Models Pass the Turing Test https://arxiv.org/abs/2503.23674

“If AGI is about AI transforming our economy—how close are we, really? What’s still missing, and how do we get there? OpenAI’s new Strategic Deployment team tackles exactly these questions. We push frontier models to be more capable, reliable, and aligned—then deploy them to” / X https://x.com/aleks_madry/status/1909686225658695897

“If you wanted to see how little attention folks are paying to the possibility of AGI (however defined) no matter what the labs say, here is an official course from Google Deepmind whose first session is “we are on a path to superhuman capabilities” It has less than 1,000 views. https://x.com/emollick/status/1907810677470712090

“Google made its all-new Gemini 2.5 Pro Experimental model available to all, including free users of the Gemini app The company also shared its safety strategy for AGI, which it expects to be a reality by 2030. https://x.com/adcock_brett/status/1908913530662682956

5/Apr/2025 – ASI checklist item #5, Llama 4 Behemoth 2T on ~30T tokens, 1X NEO AGI – YouTube https://www.youtube.com/watch?v=oipjJwRVW20&t=888s

““We are entering the era of labor abundance.” Bernt Bornich of 1X believes that robots need to grow up in the home, like humans do, with data diversity. Factories have been designed to reduce diversity, so they are a deprived birthplace for physical AGI. https://x.com/FutureJurvetson/status/1909995466588295400

“Figure is the ultimate deployment vector for AGI https://x.com/adcock_brett/status/1909266695933599923

AI Index 2025: State of AI in 10 Charts | Stanford HAI https://hai.stanford.edu/news/ai-index-2025-state-of-ai-in-10-charts