Github

代码库

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations
Python
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
C++
disaggregationinferencekvcachellmrdmasglangvllm