Strategies to facilitate access to detailed geocoding information based on synthetic data

1 hour 5 mins,  943.32 MB,  MPEG-4 Video  640x360,  29.97 fps,  44100 Hz,  1.93 Mbits/sec
Share this media item:
Embed this media item:


About this item
Image inherited from collection
Description: Drechsler, J (Institut für Arbeitsmarkt-und Berufsforschung)
Thursday 1st December 2016 - 15:30 to 16:30
 
Created: 2016-12-19 16:07
Collection: Data Linkage and Anonymisation
Publisher: Isaac Newton Institute
Copyright: Drechsler, J
Language: eng (English)
Distribution: World     (downloadable)
Explicit content: No
Aspect Ratio: 16:9
Screencast: No
Bumper: UCS Default
Trailer: UCS Default
 
Abstract: In this seminar we investigate if generating synthetic data can be a viable strategy to provide access to detailed geocoding information for external researchers without compromising the confidentiality of the units included in the database. This research was motivated by a recent project at the Institute for Employment Research (IAB) that linked exact geocodes to the Integrated Employment Biographies, a large administrative database containing several million records. Based on these data we evaluate the performance of several synthesizers in terms of addressing the trade-off between preserving analytical validity and limiting the risk of disclosure. We propose strategies for making the synthesizers scalable for such large files, introduce analytical validity measures for the generated data and provide general recommendations for statistical agencies considering the synthetic data approach for disseminating detailed geographical information.
Available Formats
Format Quality Bitrate Size
MPEG-4 Video * 640x360    1.93 Mbits/sec 943.32 MB View Download
WebM 640x360    848.67 kbits/sec 404.03 MB View Download
iPod Video 480x270    493.76 kbits/sec 235.07 MB View Download
MP3 44100 Hz 253.47 kbits/sec 120.67 MB Listen Download
Auto (Allows browser to choose a format it supports)