Synthetic identifying information for 100,000 individuals: A pseudo-population

doi: 10.53962/wz7c-s5wc

Originally published on 2022-08-12 under a CC0 Public Domain Dedication

Authors

Summary

This is a principal dataset of synthetic identifying information. We created this synthetic dataset to test the precision of later stage mechanisms to retrieve identifying information. The dataset contains 100,000 fake individuals, which can serve as a pseudo-population to sample from. For details on how this data is generated, please view the supporting documentation.

Main file

synthetic_principal.csv

Supporting files