Evaluating Data Linkage: Creating longitudinal synthetic data to provide a gold-standard linked dataset

1 hour 2 mins,  898.97 MB,  MPEG-4 Video  640x360,  29.97 fps,  44100 Hz,  1.93 Mbits/sec
Share this media item:
Embed this media item:


About this item
Image inherited from collection
Description: Dalton, T (University of St Andrews)
Thursday 20th October 2016 - 15:30 to 16:30
 
Created: 2016-11-02 15:59
Collection: Data Linkage and Anonymisation
Publisher: Isaac Newton Institute
Copyright: Dalton, T
Language: eng (English)
Distribution: World     (downloadable)
Explicit content: No
Aspect Ratio: 16:9
Screencast: No
Bumper: UCS Default
Trailer: UCS Default
 
Abstract: When performing probabilistic data linkage on real world data we, by the fact we need to link it, do not know the true linkage. Therefore, the success of our linkage approach is difficult to evaluate. Often small hand linked datasets are used as a ‘gold-standard’ for the linkage approach to be evaluated against. However, errors in the hand-linkage and the limited size and number of these datasets do not allow for robust evaluation. The research focuses on the creation of longitudinal synthetic datasets for the domain of population reconstruction. In this talk I will cover the previous and current models we have created to achieve this and detail the approaches to how we: define the desired behaviour in the model to avoid clashes between input distributions, verify the statistical correctness of the population, and initialise the model such that the starting population meets the temporal requirements of the desired behaviour. To conclude I will outline the model’s intended use for linkage evaluation, its other potential uses and also take questions.
Available Formats
Format Quality Bitrate Size
MPEG-4 Video * 640x360    1.93 Mbits/sec 898.97 MB View Download
WebM 640x360    602.74 kbits/sec 273.71 MB View Download
iPod Video 480x270    498.26 kbits/sec 226.26 MB View Download
MP3 44100 Hz 252.55 kbits/sec 114.69 MB Listen Download
Auto (Allows browser to choose a format it supports)