Image created with GPT Image 1. Image prompt: A glowing trench coat made from iridescent archival scrolls and quantum-blue ink flows behind a Black mystic-seeker whose third eye shines with lattice light, walking through an infinite corridor of neural diagrams and burning paper lanterns — knowledge, hunger, and fire embodied.

DeepSeek released Prover-V2, an open-source AI combining informal math reasoning with theorem proving With 671B params, the model solves 88.9% of problems on MiniF2F It does a ‘cold-start’ to break down proofs into subgoals before formal verification https://x.com/adcock_brett/status/1919060364655800684

We just released DeepSeek-Prover V2. – Solves nearly 90% of miniF2F problems – Significantly improves the SoTA performance on the PutnamBench – Achieves a non-trivial pass rate on AIME 24 & 25 problems in their formal version Github: https://x.com/zhs05232838/status/1917600755936018715

Episode 167: Overnight Agent We share the results of our first overnight agent run. We fed DeepSeek R1 a summary of the new @Cloudflare agents SDK and asked it to think every 15 minutes about the entire conversation history and reflect on new ideas that extend the ideas https://x.com/OpenAgentsInc/status/1901964880594313542

2/ Cloudflare Agent SDK Summary @OpenAgentsInc fed DeepSeek R1 a summary of the new Cloudflare agents SDK and asked it to think every 15 minutes about the entire conversation history and reflect on new ideas that extend the ideas further. https://x.com/AtomSilverman/status/1918047663800631794

DeepSeek quietly released Prover-V2, an open-source AI combining informal math reasoning with theorem proving —671B params —Solves 88.9% of problems on MiniF2F —Does ‘cold-start’ to break down complex proofs into subgoals before formal verification https://x.com/rowancheung/status/1917844254648324388

🚨This week’s top AI/ML research papers: – DeepSeek-Prover-V2 – The Leaderboard Illusion – Phi-4-reasoning Technical Report – Mem0 – X-Fusion – Softpick – RL for Reasoning in LLMs with One Training Example – ReasonIR – RL for LLM Reasoning Under Memory Constraints – https://x.com/TheAITimeline/status/1919155696655843474

Trending

Discover more from Ethan B. Holland

Subscribe now to keep reading and get access to the full archive.

Continue reading