Github

代码库

FlashRT is a high-performance realtime inference engine for small-batch, latency-sensitive AI workloads. The flagship integration is production VLA control for Pi0, Pi0.5, GROOT N1.6, and Pi0-FAST. Also support llm e.g, qwen3.6-27B
C++
cudacuda-kernelsgr00tgr00t-n1-6-3bjetsonjetson-thormotuspipi05qwenqwen3-6qwen3-6-27brealtime-inferencerealtime-vlathorvlawanwan22-5b