Anonymize dataset

Anonymize dataset

Anonymizing your dataset is a crucial step to ensure the privacy of the information it contains. Follow these steps to effectively anonymize your dataset using the VEIL.AI Anonymization Engine

Step-by-Step Guide to Anonymization

  1. Select the Dataset: Begin by selecting the dataset you wish to anonymize from the drop down list of available datasets. This dataset will already have default parameters associated with it.
  1. Set Parameters:
  • Review and Adjust Default Parameters: Adjust the epsilon (ε) and k values:
    • Epsilon (ε): Controls the trade-off between data privacy and accuracy. A lower epsilon value increases privacy by adding more noise, while a higher epsilon value maintains greater accuracy but offers less privacy.
    • k: Ensures that each individual in a dataset cannot be distinguished from at least k−1 others with similar attributes.
  • Provide Additional Parameters:
    • Result ID: Enter a unique result ID for the synthetic dataset for tracking and reference purposes.
  1. Anonymize the Dataset: Once you are satisfied with the settings, click the 'Anonymize' button to start the anonymization process. Depending on the size of the dataset, this might take some time as it also initiates the risk analysis and exploratory quality analysis.
The anonymization, risk analysis, and quality analysis are all run as background jobs. This allows you to continue working while the processes complete. You can monitor the progress of these tasks under the 'Tasks' section of the application.