Shumik
LLMs and Agents
Reinforcement Learning
GitHub
Welcome
Hi, welcome!
Posts
Batch TD(0) as Dynamic Programming on the Empirical MRP
March 9, 2026
ML Environment Engineering: Building Machines That Build Machines
March 5, 2026
Agents: From Inference-Time Scaffolding to Inference-Time Compute
March 1, 2026