DeepSpeed

Name: DeepSpeed
Availability: InStock

Microsoft library for extreme-scale model training: ZeRO optimizer states, pipeline parallelism, and inference kernels.

Why it is included

Core OSS component behind many large-model training recipes complementing PyTorch.

Teams training or fine-tuning large models where memory fragmentation is the bottleneck.

FSDP · Megatron-LM · Horovod