Akshay Parkhi's Weblog

Subscribe

Thursday, 19th February 2026

PPO vs VLM

Modern humanoid robots combine two fundamentally different kinds of intelligence:

[... 638 words]

GR00T N1.6 Fine-Tuning — Full Internal Deep Dive

GR00T N1.6 is NVIDIA’s Vision-Language-Action (VLA) model for humanoid robot control. After spending time digging through the internals, here’s a comprehensive deep dive into exactly how fine-tuning works — from model architecture to gradient flow to the data pipeline.

[... 1,200 words]

GR00T Architecture: A Systems Engineering Breakdown

GR00T is not just a VLM. It is a Perception → Reasoning → Control generator stack.

[... 762 words]

2026 » February

MTWTFSS
      1
2345678
9101112131415
16171819202122
232425262728