Simulating data with identifiable information

doi: 10.53962/yp05-5fwc

Originally published on 2022-07-22 under a CC0 Public Domain Dedication

Authors

Summary

This NodeJS script allows for the generation of fake datasets with identifiable information. This attempt to generate synthetic datasets serves the purpose of estimating precision measures of procedures to detect identifiable information in datasets. Please note that the script has the NodeJS dependencies "@faker-js/faker" and "chance" - these can be installed using the supporting file in a folder and running "npm install".

Main file

index.js

Supporting files