Bucketing

StreamEvent date and number values can be obfuscated using individual variance tolerances to protect the identity of the originating data structure. This is more akin to blurring rather than obfuscation.


Date Variance

Each date value for a specified field will be varied by some random number of days, whilst maintaining the original variance, range and distribution. This can useful where it would otherwise be possible to identify individuals by an exact match, such as date of birth. It will for example, give an approximate value for age information.

Example

obfuscation:
  name: dateBucketing
  fields:
    dateOfBirth:
      date bucketing:
        variance: 30

Attributes

AttributeDescriptionData TypeRequired

variance

Maximum number of days to vary the source date

Integer

Default: 120


Number Variance

Each number can be varied by some random percentage, whilst maintaining the original variance, range and distribution. This can useful where it would otherwise be possible to identify individuals salary by an exact match. It will, for example, give an approximate value for salary information.

Example

obfuscation:
  name: numberBucketing
  fields:
    salary:
      number bucketing:
        variance: 0.25
    age: 
      number bucketing:
        variance: 0.10

Attributes

AttributeDescriptionData TypeRequired

variance

Variance multiplier to be applied to random masking process

Double

Default: 0.15

Last updated