Saved in:
Bibliographic Details
Main Author: Francis, Paul
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2403.08463
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916158516494336
author Francis, Paul
author_facet Francis, Paul
contents SynDiffix is a new open-source tool for structured data synthesis. It has anonymization features that allow it to generate multiple synthetic tables while maintaining strong anonymity. Compared to the more common single-table approach, multi-table leads to more accurate data, since only the features of interest for a given analysis need be synthesized. This paper compares SynDiffix with 15 other commercial and academic synthetic data techniques using the SDNIST analysis framework, modified by us to accommodate multi-table synthetic data. The results show that SynDiffix is many times more accurate than other approaches for low-dimension tables, but somewhat worse than the best single-table techniques for high-dimension tables.
format Preprint
id arxiv_https___arxiv_org_abs_2403_08463
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle A Comparison of SynDiffix Multi-table versus Single-table Synthetic Data
Francis, Paul
Cryptography and Security
SynDiffix is a new open-source tool for structured data synthesis. It has anonymization features that allow it to generate multiple synthetic tables while maintaining strong anonymity. Compared to the more common single-table approach, multi-table leads to more accurate data, since only the features of interest for a given analysis need be synthesized. This paper compares SynDiffix with 15 other commercial and academic synthetic data techniques using the SDNIST analysis framework, modified by us to accommodate multi-table synthetic data. The results show that SynDiffix is many times more accurate than other approaches for low-dimension tables, but somewhat worse than the best single-table techniques for high-dimension tables.
title A Comparison of SynDiffix Multi-table versus Single-table Synthetic Data
topic Cryptography and Security
url https://arxiv.org/abs/2403.08463