Applying data synthesis for longitudinal business data across three countries

2Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.

Abstract

Data on businesses collected by statistical agencies are challenging to protect. Many businesses have unique characteristics, and distributions of employment, sales, and profits are highly skewed. Attackers wishing to conduct identification attacks often have access to much more information than for any individual. As a consequence, most disclosure avoidance mechanisms fail to strike an acceptable balance between usefulness and confidentiality protection. Detailed aggregate statistics by geography or detailed industry classes are rare, public-use microdata on businesses are virtually inexistant, and access to confidential microdata can be burdensome. Synthetic microdata have been proposed as a secure mechanism to publish microdata, as part of a broader discussion of how to provide broader access to such data sets to researchers. In this article, we document an experiment to create analytically valid synthetic data, using the exact same model and methods previously employed for the United States, for data from two different countries: Canada (Longitudinal Employment Analysis Program (LEAP)) and Germany (Establishment History Panel (BHP)). We assess utility and protection, and provide an assessment of the feasibility of extending such an approach in a cost-effective way to other data.

References Powered by Scopus

Some tests of specification for panel data:monte carlo evidence and an application to employment equations

19057Citations
N/AReaders
Get full text

Initial conditions and moment restrictions in dynamic panel data models

15550Citations
N/AReaders
Get full text

Another look at the instrumental variable estimation of error-components models

11817Citations
N/AReaders
Get full text

Cited by Powered by Scopus

30 Years of Synthetic Data

9Citations
N/AReaders
Get full text

Systematic Review of Generative Modelling Tools and Utility Metrics for Fully Synthetic Tabular Data

1Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Alam, M. J., Dostie, B., Drechsler, J., & Vilhuber, L. (2020). Applying data synthesis for longitudinal business data across three countries. Statistics in Transition New Series, 21(4), 212–236. https://doi.org/10.21307/STATTRANS-2020-039

Readers over time

‘20‘21‘22‘23‘2400.511.52

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 2

50%

Researcher 2

50%

Readers' Discipline

Tooltip

Computer Science 2

50%

Medicine and Dentistry 1

25%

Economics, Econometrics and Finance 1

25%

Article Metrics

Tooltip
Social Media
Shares, Likes & Comments: 1336

Save time finding and organizing research with Mendeley

Sign up for free
0