Anonymize dataset

Anonymizing your dataset is a crucial step to ensure the privacy of the information it contains. Follow these steps to effectively anonymize your dataset using the VEIL.AI Anonymization Engine

Step-by-Step Guide to Anonymization

Select the Dataset: Begin by selecting the dataset you wish to anonymize from the drop down list of available datasets. This dataset will already have default parameters associated with it.

Set Parameters:

Review and Adjust Default Parameters: Adjust the epsilon (ε) and k values:

Epsilon (ε): Controls the trade-off between data privacy and accuracy. A lower epsilon value increases privacy by adding more noise, while a higher epsilon value maintains greater accuracy but offers less privacy.
k: Ensures that each record is indistinguishable from at least k−1 others only when quasi-identifiers have been defined under Dataset Variables. k-Anonymity operates by grouping records on those quasi-identifiers so that no single record can be singled out—if no quasi-identifiers are set, this parameter will have no effect.

Provide Additional Parameters:

Result ID: Enter a unique result ID for the synthetic dataset for tracking and reference purposes.

Anonymize the Dataset: Once you are satisfied with the settings, click the 'Anonymize' button to start the anonymization process. Depending on the size of the dataset, this might take some time as it also initiates the risk analysis and exploratory quality analysis.

The anonymization, risk analysis, and quality analysis are all run as background jobs. This allows you to continue working while the processes complete. You can monitor the progress of these tasks under the 'Tasks' section of the application.

Anonymize dataset Step-by-Step Guide to Anonymization