Asynchronous Robot Inference: Decoupling Action Prediction and Execution
Asynchronous inference decouples action prediction from execution in robotic policies, reducing runtime lag and enabling replanning with action chunks. The article describes a two-component architecture (PolicyServer and RobotClient) using gRPC to achieve ~2× speedups and continuous operation, and explains why sequential inference falls short.