Archive for Sunday, 22nd February 2026

Sunday, 22nd February 2026

From Vision to Torques: How NVIDIA’s GR00T Stack Controls a Humanoid Robot

NVIDIA’s GR00T stack for humanoid robots has three layers: a Vision-Language-Action model that understands what to do, a whole-body controller that figures out how to move, and a physics simulator that validates it all before touching real hardware. Here’s how they connect.

[... 976 words]

5:19 pm / physical-ai

VLA → WBC → MuJoCo: Two Ways to Wire Up NVIDIA’s GR00T Humanoid Stack

There are two ways to wire up NVIDIA’s GR00T stack from vision-language all the way down to physics simulation: the official NVIDIA eval pipeline and a custom pipeline using the SONIC C++ binary. I’ve set up both. Here’s how they work and where they differ.

[... 674 words]

6:58 pm / physical-ai

M	T	W	T	F	S	S
						1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28

Akshay Parkhi's Weblog

Sunday, 22nd February 2026

From Vision to Torques: How NVIDIA’s GR00T Stack Controls a Humanoid Robot

VLA → WBC → MuJoCo: Two Ways to Wire Up NVIDIA’s GR00T Humanoid Stack