How are Data Matched?

Article
Read time: 6 minutes

Once a request has been approved and data sharing agreements have been completed, agency Data Stewards begin to pull the requested datasets.

These datasets are then separated into two files. One file is used for matching purposes and includes fields like name, date of birth, etc. and the other file fulfills the request and contains demographic data, requested records, etc. These two files are sent separately to the Data Integration Hub for matching and the creation of fake ID matrix.

The files used to match data are destroyed once the linkage has occurred and generic group numbers and fake ID numbers are in place. The process protects individual privacy by separating the data at all stages of the process, by using meaningless fake record numbers, and by destroying the data used to create the links.

Data recipients receive the matched, deidentified records and the Fake ID matrix for analysis. 

P20 WIN