Image created with gemini-3.1-flash-image-preview with claude-sonnet-4-5. Image prompt: Flat cartoon illustration of a friendly coral-red lobster mascot character centered on dark charcoal background, one claw replaced with small silver mechanical robotic arm with visible joints, holding large white speech bubble containing ‘ROBOTS’ text in black Helvetica font, minimal geometric circuit board patterns in faint cyan in background, clean outlines, kawaii mascot style, high contrast, web interface aesthetic.

Introducing DreamZero 🤖🌎 from @nvidia > A 14B “World Action Model” that achieves zero-shot generalization to unseen tasks & few-shot adaptation to new robots > The key? Jointly predicting video & actions in the same diffusion forward pass Project Page: https://x.com/jang_yoel/status/2019083437265867057

New milestone: we trained a robot foundation model on a world model backbone, and enabled zero-shot, open-world prompting capability for new verbs, nouns, and environments. If the world model can “”dream”” the right future in pixels, then the robot can execute well in motors. We”” https://x.com/DrJimFan/status/2019112603637920237

📢 New paper from GEAR team @NVIDIARobotics We released DreamZero, a World Action Model that turns video world models into zero-shot robot policies. Built on a pretrained video diffusion backbone, it jointly predicts future video frames and actions. 🌐”” https://x.com/yukez/status/2019096072690553112

Introducing NVIDIA Cosmos Policy for Advanced Robot Control https://huggingface.co/blog/nvidia/cosmos-policy-for-robot-control

DreamZero: World Action Models are Zero-shot Policies
https://dreamzero0.github.io/

Jim Fan on X: “The Second Pre-training Paradigm” / X
https://x.com/DrJimFan/status/2018754323141054786

Website: https://t.co/2YwjQs3JMC Robot execution demos across various verbs, nouns, and environments: https://t.co/loUZXZODcR The model is open-source! https://x.com/DrJimFan/status/2019112605315637451

Planning is one of the most exciting uses of world models, but existing planners struggle on long horizons. Introducing GRASP: a fast gradient-based planner for world models that outperforms prior methods on long-horizon tasks. Two key ideas: 1.jointly optimize actions and”” https://x.com/_amirbar/status/2019903658792497482

tl;dr New planner for world models! GRASP: gradient-based, stochastic, parallelized. Long range planning for world models has always been an issue. 0th order methods like CEM/MPPI dominate, but have degrading performance at longer contexts or higher-dimensional actions. We”” https://x.com/michaelpsenka/status/2019870377032503595

Robbyant has announced LingBot-VLA: an open-source Vision-Language-Action model – Pretrained on ~20k hours of real-world dual-arm robot data – Strong generalization across 9 embodiments – Improves consistently with more data – Claims outperformance over π₀.₅, GR00T N1.6 &”” https://x.com/TheHumanoidHub/status/2017337216054575513

World Model meets robot policy! Robbyant’s LingBot-VA: unifies video world modeling and robotic policy learning. – A single model generates both future video and the actions to make it real. – Long-term memory enables long-horizon tasks. – Claims significant outperformance over”” https://x.com/TheHumanoidHub/status/2017638555741552672

self-driving <as a 2D robot with a low-dim action space that focused mostly on avoidance rather than interaction> will reach real-world impact faster than anything else. the really cool part is that the world model isn’t just about videos; it’s about modeling continuous,”” https://x.com/sainingxie/status/2019841784990351381

Humanoid whole-body control ASI benchmark”” https://x.com/TheHumanoidHub/status/2017293983115092168

Next Time You Come In, You Come Heavy Or Not At All”” https://x.com/adcock_brett/status/2016723547360809011

Similar to a human, Helix can use its hips to close a drawer and kick up the dishwasher door”” https://x.com/adcock_brett/status/2017654710778663399

Solve this in under 5 minutes and I’ll offer you $500k/year in cash plus several million in equity I’m building a Computer-Use team, goal is to use computers better than humans No experience or PhD needed Instructions: 1. Solve all 30 challenges on this website in under 5″” https://x.com/adcock_brett/status/2018417226895028414

This week we shared a detailed report on Helix 02 It explains how we’re working toward general robotics, for those who want to go deeper At the highest level of abstraction, our goal is to give AI a body”” https://x.com/adcock_brett/status/2016919225643008313

UPDATE: zero people have solved this”” https://x.com/adcock_brett/status/2018919553963880613

The Helix team at Figure spent the last 12 months hitting wall after wall on what seemed like a simple problem: How do we give our AI model, Helix, control of the entire humanoid body (pixels in; motor torques out)? Core to our belief is shipping fully autonomous robots that”” https://x.com/adcock_brett/status/2016743751088263238

🎙️ @stepjamUK is the founder and CEO of @Neuracore_AI, and Assistant Professor of Robot Learning at Imperial College London: In this episode, Stephen shares his path from growing up in Wales to spending a decade at Imperial, a postdoc at Berkeley, and eventually founding”” https://x.com/IlirAliu_/status/2016875729775145087

1.7× faster inverse kinematics in pure Python, GPU-accelerated! Fully open-source and built for scale [📍Bookmark for later] PyRoki, a modular toolkit for robot kinematic optimization, supporting – inverse kinematics – trajectory optimization, and – motion retargeting. Built”” https://x.com/IlirAliu_/status/2017309072165425316

3D printed differential robot arm wrist. [👇 GitHub Link – Open Source ] High demand, so they open sourced it early. A differential mechanism released as a test fixture for future robots. Built around their Spectral micro BLDC driver. Parts, STL files, example code are”” https://x.com/IlirAliu_/status/2018609427117465775

A collection of Python robotics algorithms (localization, SLAM, path planning, motion planning). (📍GitHub ) Python sample codes and textbook for robotics algorithms. GitHub: https://t.co/4NRf4VCkZP —- Weekly robotics and AI insights. Subscribe free: https://x.com/IlirAliu_/status/2018404604556513504

A motorized Almond Coupling, or Bent-Arm Joint. No supports or hardware needed. (📍 Link to Makerworld) It was invented and patented around 1885 as a right angle transmission coupling for steam and gas engines. 📍 https://t.co/IEslL2rhyI — Weekly robotics and AI insights.”” https://x.com/IlirAliu_/status/2018247037649637711

A peek inside Physical Intelligence, the startup building Silicon Valley’s buzziest robot brains | TechCrunch https://techcrunch.com/2026/01/30/physical-intelligence-stripe-veteran-lachy-grooms-latest-bet-is-building-silicon-valleys-buzziest-robot-brains/

AI in robotics gets all the attention right now, but sometimes the most interesting work is very practical. Viet built a small vision system that counts potatoes on a conveyor belt. No giant dataset. No huge model. Just a clear problem and a smart setup. He used Ultralytics’”” https://x.com/IlirAliu_/status/2017522263210312184

Building artificial life forces a new appreciation for biology.”” https://x.com/TheHumanoidHub/status/2017804851422396426

Chris Paxton on X: “New blog post: how do we quantify progress in robotics? Benchmarks have made a huge difference in other areas of AI, but we don’t have an equivalent to imagenet or SWE-Bench for robotics. Why not and what are people doing about it? Simulations, distributed evaluation, and https://t.co/KujXPHN9YG&#8221; / X
https://x.com/chris_j_paxton/status/2016872870001963433

Engineers at XPENG developed an RL pipeline to achieve a natural walking gait for the IRON humanoid, tailoring both the data and the algorithm to adapt to the stiffness of its lattice skin.”” https://x.com/TheHumanoidHub/status/2018375680837443631

How do you teach a robot to handle complex, multi-step tasks, without training it for each one? [Github ⬇️] The team behind ReKep shows that robots can perform bimanual, in-the-wild tasks by reasoning over keypoint constraints: Generated on the fly using vision and language”” https://x.com/IlirAliu_/status/2018971869236310248

HumanX: a scalable framework that converts single monocular human videos into agile, generalizable interaction skills for humanoid robots without task-specific rewards. Core parts: – XGen: retargets human motion + synthesizes diverse physically plausible training data (via”” https://x.com/TheHumanoidHub/status/2018922472616391029

moltbook but real robots 😨”” https://x.com/TheHumanoidHub/status/2017721951595466845

Things didn’t go entirely according to plan, when XPENG’s latest generation IRON robot made a striking public appearance at a shopping mall.”” https://x.com/TheHumanoidHub/status/2017646098136141858

XPENG plans to enter mass production of IRON humanoids this year.”” https://x.com/TheHumanoidHub/status/2017700729876877313

AGI stuck in a box will always rely on humans to act in the physical world In the extreme, they will boss around humans Humanoid robots are the ultimate deployment vector for AGI”” https://x.com/adcock_brett/status/2018770644993978433

Reinforcement Learning for Active Perception in Autonomous Navigation. [📍GitHub & Paper ] Most robots navigate as if their cameras were nailed in place. But perception is not passive. Animals move their heads and eyes constantly to decide where to go next. Robots should do”” https://x.com/IlirAliu_/status/2018762226170016109

Robust humanoid perceptive locomotion is still underexplored. Especially when different cameras see different terrains, paths get narrow, and payloads disturb balance… Introduce RPL, tackling this with one unified policy: • Challenging terrains (slopes, stairs and stepping”” https://x.com/Yuanhang__Zhang/status/2019092752240181641

Tired of teleoperation? One human video → 1,000s of robot demos. (📍GitHub ) Scaling Robot Data Without Dynamics Simulation or Robot Hardware Real2Render2Real (R2R2R) is a new way to scale robot data without physics simulation or hardware. You take a phone scan + a single”” https://x.com/IlirAliu_/status/2017884655869976975

Hard to know which X articles are valuable, but this is a good summary of the significance of world modeling by a distinguished scientist and robot expert NVIDIA”” https://x.com/emollick/status/2018774863734075878

Leave a Reply

Trending

Discover more from Ethan B. Holland

Subscribe now to keep reading and get access to the full archive.

Continue reading