Skip to content
AdminNov 1, 2023< 1 min read

Evaluating the Utility and Privacy of Synthetic Breast Cancer Clinical Trial Data Sets


This paper published in the Journal of Clinical Oncology: Clinical Cancer Informatics describes a study evaluating synthetic data generation on diverse breast cancer clinical trial datasets. We present a quantitative methodology for evaluating the replicability of analyses using synthetic data. We evaluate two common/defensible privacy metrics: attribution and membership disclosure. We compare performance of three types of generative models. The results from replicating 8 clinical trial analyses show generative models can produce high utility and high privacy datasets. The study was performed with colleagues at the Ottawa Hospital and collaborators across Canada/US.

Following the acquisition of Replica Analytics by Aetion, the generative AI technology previously known as Replica Synthesis is now Aetion® Generate and continues to create privacy-enhancing synthetic data.