Data Obfuscation¶
Data obfuscation is a crucial process for protecting sensitive information in datasets. This guide will walk you through the steps of using DATAMIMIC to perform data obfuscation, showcasing two variants:
- updating specific columns in a database and
- obfuscating records from a CSV file.

Steps¶
-
Clone the Demo
Anonymization
from the Demo Store- This project includes predefined models and configurations to help you get started quickly.
-
Setup Environments
-
In the DATAMIMIC base model, we have referenced and predefined two environments:
- sourceDB: The database from which we will read the data.
- targetDB: The database where the obfuscated records will be written.
- For more details on setting up and managing environments, refer to the Environments documentation.
-
Switch to the File
2_generate
- This file demonstrates two variants of data obfuscation:
Variant 1: Database Record Obfuscation¶
This variant retrieves records from the CUSTOMER
table of the sourceDB
and updates the name
column. All other columns remain unchanged. The name
column is obfuscated by appending the string _mask
. The obfuscated data will be written into targetDB
.
1 2 3 |
|
Variant 2: Database Record Obfuscation with Converter¶
In Variant 2, built-in converters are used to anonymize existing values of the keys, i.e., cell values of the columns full_name
, email
, and tc_creation_src
.
1 2 3 4 5 6 7 |
|
Variant 3: CSV File Obfuscation with Multiple Approaches¶
This variant shows how to obfuscate person records from a CSV file. Various options are used to update or overwrite the original data from the file and write it as a new file with name ObfuscateCSV.csv
.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 |
|
Recap¶
- Review the demo
Anonymization
from the Demo Store. - In the
2_generate
file, review the data obfuscation variants to update specific database columns or to obfuscate records from a CSV file. - Review the DATAMIMIC models
3-1-anon-person-constant
and3-2-anon-person-hash.xml
for additional scenarios and obfuscation options.