Maxime Agostini, Sarus cofounder and CEO, was invited at Databricks Data+AI summit to speak on how to solve data access at scale with privacy technologies.
He presented how the proper combination of Differential Privacy, Synthetic Data, and Remote Query execution brings scale and security to data governance.Legacy data masking or pseudonymization approaches are both weak and require many manual and fallible decisions along the way
Cleanrooms solve many input privacy concerns, but they appear to be unpractical for any data science work that is more than just a single SQL query that is easy to come up with without validating it on data first.
Luckily, combining synthetic data to explore the underlying data, remote query execution of entire data science pipelines, and differential privacy to guarantee the privacy of outputs, we can achieve the promise of both data security and governance efficiency!
If you did not get a chance to attend, here is the video recording: