Apply individual variance tolerances to protect the identity of the originating data
Objective
The objective of this page is to explain how date and number values in a StreamEvent can be obfuscated using variance tolerances to protect sensitive information, such as date of birth, salary and age.
This method, similar to blurring, adjusts values within a specified range while preserving the original distribution and accuracy, ensuring privacy without compromising data utility.
This is more akin to blurring rather than obfuscation
Date variance
Each date value for a specified field will be varied by a random number of days, whilst maintaining the original variance, range and distribution.
This can useful where it would otherwise be possible to identify individuals by an exact match, such as date of birth.
Example
This code defines an obfuscation strategy named dateBucketing applied to the dateOfBirth field.
It uses date bucketing with a variance of 30, meaning that the actual date of birth will be obscured by randomly shifting the date within a 30-day range.
This protects the exact date while maintaining some level of accuracy.