Github

代码库

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
Python