Introduction to Local Differential Privacy
Almost all data statistics and analysis tasks fundamentally depend on a basic understanding of the distribution of the data. Local differential privacy (LDP) is proposed as a distributed variant of Differential Privacy which achieves privacy guarantees for each user locally and is independent of any assumptions on the third party servers. It has been used for tasks like frequency and mean value estimation, heavy hitters discovery,
Definition: -Differential Privacy
A randomized mechanism
where
Theorem: Sequential Composition
Let
Randomized Response
Randomized response is a mechanism for local differential privacy which was first proposed in a 1965 paper by S. L. Warner. At the time, the technique was intended to improve bias in survey responses about sensitive issues, and it was not originally proposed as a mechanism for differential privacy (which wouldn’t be invented for another 40 years). After differential privacy was developed, statisticians realized that this existing technique already satisfied the definition and it has become the de facto standard for LDP.
Users who possess a private answer
- Flip a coin
- If the coin is heads, answer the question truthfully
- If the coin is tails, flip another coin
- If the second coin is heads, answer “yes”; if it is tails, answer “no”
The randomization in this algorithm comes from the two coin flips. As in all other differentially private algorithms, this randomization creates uncertainty about the true answer, which is the source of privacy.