Github

代码库

Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters
Jupyter Notebook
cluster-managementdeep-learningdistributedllamallama2llmmachine-learningmlopspytorch