Drynx

Drynx

Decentralized, secure, verifiable system for statistical queries and machine learning on distributed datasets

Drynx allows to create privacy-preserving queries on encrypted datasets that are stored at different data providers who don't want to share the original data. Different types of statistical queries are possible, like average, standard deviation, linear and logistic regressions - all using homomorphic encryption, which means that the data is never shared in cleartext.

Homomorphic EncryptionSecure Multi-Party Computation
Key facts
Maturity
PrototypeIntermediateMature
Support
C4DT
Retired
Lab
Unknown
  • Presentation
  • Demo
  • C4DT work
  • Research papers

Data sharing has become of primary importance in many domains such as big-data analytics, economics and medical research, but remains difficult to achieve when the data are sensitive. In fact, sharing personal information requires individuals’ unconditional consent or is often simply forbidden for privacy and security reasons. In this paper, we propose Drynx, a decentralized system for privacy-conscious statistical analysis on distributed datasets.


Drynx relies on a set of computing nodes to enable the computation of statistics such as standard deviation or extrema, and the training and evaluation of machine-learning models on sensitive and distributed data. To ensure data confidentiality and the privacy of the data providers, Drynx combines interactive protocols, homomorphic encryption, zero-knowledge proofs of correctness, and differential privacy. It enables an efficient and decentralized verification of the input data and of all the system’s computations thus provides auditability in a strong adversarial model in which no entity has to be individually trusted.

Drynx is highly modular, dynamic and parallelizable. Our evaluation shows that it enables the training of a logistic regression model on a dataset (12 features and 600,000 records) distributed among 12 data providers in less than 2 seconds. The computations are distributed among 6 computing nodes, and Drynx enables the verification of the query execution’s correctness in less than 22 seconds.

Laboratory for Data Security

Laboratory for Data Security

Prof. Jean-Pierre Hubaux

Over the last 15 years, the Laboratory for Data Security has pioneered the areas of security and privacy in personalized health and mobile/wireless networks as well as tackled interpersonal privacy issues. On the first topic, they collaborate extensively with hospitals. Their core competencies are in applied cryptography, data protection techniques such as differential privacy, and wireless networking. Their research is funded by the Strategic Focus Area “Personalized Health and Related Technologies” of the ETH Domain, the Swiss Data Science Center and the Swiss National Science Foundation.

This page was last edited on 2022-09-28.