We show how multiple data-owning parties can collaboratively train several machine learning algorithms without jeopardizing the privacy of their sensitive data. In particular, we assume that every party knows specific features of an overlapping set of people. Using a secure implementation of an advanced hidden set intersection protocol and a privacy-preserving Gradient Descent algorithm, we are able to train a Ridge, LASSO or SVM model over the intersection of people in their data sets. Both the hidden set intersection protocol and privacy-preserving LASSO implementation are unprecedented in literature.

, , ,
Lecture Notes in Computer Science , Security and Cryptology
Centrum Wiskunde & Informatica, Amsterdam (CWI), The Netherlands

Veugen, P.J.M, Kamphorst, B, van de L’Isle, N, & van Egmond, M.B. (2021). Privacy-preserving coupling of vertically-partitioned databases and subsequent training with gradient descent. In Cyber Security Cryptography and Machine Learning (pp. 38–51). doi:10.1007/978-3-030-78086-9_3