代码库
A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind.
Python
torch_remat fine-grained activation checkpointing API
Python
Configuration classes enabling type-safe PyTorch configuration for Hydra apps
Python
This module defines a type system for distributed training code, based off of JAX's sharding in types, but adapted for the PyTorch ecosystem.
Python