Batch Clipping and Adaptive Layerwise Clipping for Differential Private Stochastic Gradient Descent

N. Nguyen, Toan; Nguyen, Phuong Ha; M. Nguyen, Lam; van Dijk, Marten

doi:10.48550/arXiv.2307.11939

T. N. Nguyen (Toan), P.H. Nguyen (Phuong Ha), L. M. Nguyen (Lam) and M.E. van Dijk (Marten)

2023-07-21

Batch Clipping and Adaptive Layerwise Clipping for Differential Private Stochastic Gradient Descent

Each round in Differential Private Stochastic Gradient Descent (DPSGD) transmits a sum of clipped gradients obfuscated with Gaussian noise to a central server which uses this to update a global model which often represents a deep neural network. Since the clipped gradients are computed separately, which we call Individual Clipping (IC), deep neural networks like resnet-18 cannot use Batch Normalization Layers (BNL) which is a crucial component in deep neural networks for achieving a high accuracy. To utilize BNL, we introduce Batch Clipping (BC) where, instead of clipping single gradients as in the orginal DPSGD, we average and clip batches of gradients. Moreover, the model entries of different layers have different sensitivities to the added Gaussian noise. Therefore, Adaptive Layerwise Clipping methods (ALC), where each layer has its own adaptively finetuned clipping constant, have been introduced and studied, but so far without rigorous DP proofs. In this paper, we propose {\em a new ALC and provide rigorous DP proofs for both BC and ALC}. Experiments show that our modified DPSGD with BC and ALC for CIFAR-10 with resnet-18 converges while DPSGD with IC and ALC does not.

Additional Metadata
Keywords	Machine learning
Persistent URL	doi.org/10.48550/arXiv.2307.11939
Organisation	Computer Security
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Nguyen, T., Nguyen, P. H., Nguyen, L., & van Dijk, M. (2023). Batch Clipping and Adaptive Layerwise Clipping for Differential Private Stochastic Gradient Descent. doi:10.48550/arXiv.2307.11939

View at Publisher

Free Full Text ( Final Version , 4mb )

Batch Clipping and Adaptive Layerwise Clipping for Differential Private Stochastic Gradient Descent

Publication

Publication

Address

Publishing at CWI

Questions or comments?

Batch Clipping and Adaptive Layerwise Clipping for Differential Private Stochastic Gradient Descent

Publication

Publication

Workflow

Workflow

Add Content