Github

代码库

APOLLO: SGD-like Memory, AdamW-level Performance; MLSys'25 Oustanding Paper Honorable Mention
Python
efficient-trainingllmllm-trainingmemory-efficientoptimizer