Tags: liyancas/DeepSpeed
Tags
[ZeRO] Default disable elastic ckpt in stage 1+2 and reduce CPU memor… …y overhead during ckpt load (deepspeedai#1525) Co-authored-by: Olatunji Ruwase <[email protected]>
Various small documentation text improvements (deepspeedai#1665) Co-authored-by: Jeff Rasley <[email protected]>
Remove unused import of ssl.OP_ENABLE_MIDDLEBOX_COMPAT (deepspeedai#1601 )
Prevent creation of local temp directory (deepspeedai#1494) Co-authored-by: Olatunji Ruwase <[email protected]> Co-authored-by: Jeff Rasley <[email protected]>
[zero_to_fp32] adapt to 4-bytes alignment in z2 (deepspeedai#1372) Co-authored-by: Olatunji Ruwase <[email protected]>
Reducing the memory-overhead of creating model for multi-GPU run (dee… …pspeedai#1244) Co-authored-by: Jeff Rasley <[email protected]>
PreviousNext