Once a request has been approved and data sharing agreements have been completed, agency Data Stewards assemble the requested datasets.
These datasets are separated into two files. One file is used for matching purposes and includes only identifying fields like name, date of birth, etc. The other file fulfills the request and contains demographic data, requested records, etc. These two files are sent separately to the Data Integration Hub for matching, removal of identifiers and the creation of fake ID matrix.
The files used to match data are destroyed once the linkage has occurred and generic group numbers and fake ID numbers are in place. The process protects individual privacy by separating the data at all stages of the process, by using meaningless fake record numbers, and by destroying the data used to create the links.
Data recipients receive the matched, de-identified records and the Fake ID matrix for analysis.