Release v2.0 · awslabs/fast-differential-privacy

Adding support to DeepSpeed and FSDP through DP-ZeRO on multi-GPU
Adding a second approach to compute private gradient. This approach re-writes and extends the torch layers' back-propagation. New approach does not need ghost differentiation, may be slower (but improvable), and is much more generally applicable.
Removing param.summed_clipped_grad and replacing with param.private_grad
Adding ZeRO examples for image classification and GPT
Adding mixed precision training (fp16 and bf16)

Provide feedback