SPINDLE

SPINDLE

Scalable, privacy-preserving, distributed learning on distributed datasets

SPINDLE allows to train and evaluate generalized linear models on datasets that are stored at different data providers who don't want to share the original data.

Homomorphic EncryptionSecure Multi-Party Computation
Key facts
Maturity
Support
C4DT
Retired
Lab
Unknown
  • Presentation
  • Details
  • App
  • Demo
  • C4DT work
  • Research papers

SPINDLE allows for some statistics on distributed data without sharing any data in clear. One kind of statistics it can do is called machine learning, which allows a program to make predictions based on a given set of data. It fits many use cases, such as determining the likeliness that some hospital patient has a disease, or automatically determine what an image represents.
SPINDLE can do all that while at the same time keeping the data private. It does so by using a new kind of cryptography, allowing to apply common mathematical operations on encrypted data and only revealing the result.

Example

Using homomorphic encryption in machine learning allows SPINDLE to both ensure the privacy of data and to make the machine learning computations. In the demonstrator, we present a use-case based around predicting whether a hospital patient is likely to have diabetes, based on the data of three different hospitals.

The training phase in machine learning usually never finishes, it does its computation repeatedly, with each round using the model created during the previous one, and hopefully improving it. In the demonstrator, you can set the number of wanted round. In order to improve the execution speed of the algorithm, each hospital first computes a local model. This local model is then merged with the other models, to obtain a network model, behaving as if it was computed on the aggregated data directly. Then, the querier can ask SPINDLE to predict using this network model.

Laboratory for Data Security

Laboratory for Data Security

Prof. Jean-Pierre Hubaux

Over the last 15 years, the Laboratory for Data Security has pioneered the areas of security and privacy in personalized health and mobile/wireless networks as well as tackled interpersonal privacy issues. On the first topic, they collaborate extensively with hospitals. Their core competencies are in applied cryptography, data protection techniques such as differential privacy, and wireless networking. Their research is funded by the Strategic Focus Area “Personalized Health and Related Technologies” of the ETH Domain, the Swiss Data Science Center and the Swiss National Science Foundation.

This page was last edited on 2022-09-28.