Google releases differential privacy pipeline for Python

Thanusri swetha J May 13, 2022 |10:00 AM Technology

PipelineDP allows datasets containing personal information to be aggregated in a way that preserves the privacy of individuals[1]. Google is extending differential privacy capabilities to the Python language, with an open source tool for creating pipelines that aggregate data containing personal information in a way that preserves the privacy of individuals.

PipelineDP, developed in partnership with OpenMined and accessible from the project website, is still in an experimental stage. With differential privacy, useful insights and services can be provided without revealing any information about individuals. PipelineDP follows the 2019 launch of an open source version of Google’s foundational differential privacy library, which works with the C++, Go, and Java languages. [2]

Figure 1. Google releases differential privacy pipeline for Python

Figure 1 shows developers, researchers, and companies can use the new Python library to build applications with privacy technology that enables them to gain insights and observe trends from datasets while protecting and respecting individual privacy, Google said. PipelineDP can be used with the Apache Spark and Apache Beam frameworks for data processing. It already has enabled users to begin experimenting with new use cases, such as showing a website’s most-visited pages on a per country basis in an aggregated, anonymized way.

Google also is releasing a differential privacy tool to allow practitioners to visualize and tune parameters used to produce differentially private information. In addition, Google researchers have published a paper that shares techniques for scaling differential privacy to datasets of a petabyte or more. [3]

Google also is releasing a differential privacy tool to allow practitioners to visualise and tune parameters used to produce differentially private information. In addition, Google researchers have published a paper that shares techniques for scaling differential privacy to datasets of a petabyte or more. [4]

Note that PipelineDP is still experimental and is subject to change. At the moment, the project developers do not recommend using it in production systems and it's not thoroughly tested yet. Google and OpenMined teams are looking to add more functionalities and improve reliability in the near future. [5]

References:
  1. https://www.infoworld.com/article/3648415/google-releases-differential-privacy-pipeline-for-python.html
  2. https://www.north-face-outlet.com.co/2022/01/31/google-releases-differential-privacy-pipeline-for-python/
  3. https://www.infoworld.com/article/3648415/google-releases-differential-privacy-pipeline-for-python.html
  4. https://www.arnnet.com.au/article/694832/google-releases-differential-privacy-pipeline-python/
  5. https://www.infoq.com/news/2022/02/differential-privacy-python/
Cite this article:

Thanusri swetha J (2022), Google releases differential privacy pipeline for Python, Anatechmaz, pp. 117

Recent Post

Blog Archive