☆ 4.6 Article

Privacy-preserving analysis of time-to-event data under nested case-control sampling

STATISTICAL METHODS IN MEDICAL RESEARCH (2023)

期刊

STATISTICAL METHODS IN MEDICAL RESEARCH

卷 -, 期 -, 页码 -

出版社

SAGE PUBLICATIONS LTD

DOI: 10.1177/09622802231215804

关键词

Survival analysis; data disclosure; privacy-preserving analysis; specimen pooling

类别

Health Care Sciences & Services Mathematical & Computational Biology Medical Informatics Statistics & Probability

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Analyses of distributed data networks of rare diseases are challenging due to privacy and ethical concerns. We propose a privacy-preserving data analysis technique by pooling individual records of covariates at recruiting sites. Our method shows good performance in simulations and analysis of real data.

Analyses of distributed data networks of rare diseases are constrained by legitimate privacy and ethical concerns. Analytical centers (e.g. research institutions) are thus confronted with the challenging task of obtaining data from recruiting sites that are often unable or unwilling to share personal records of participants. For time-to-event data, recently popularized disclosure techniques with privacy guarantees (e.g. , etc.) are generally computationally expensive or inaccessible to applied researchers. To perform the widely used Cox proportional hazards regression, we propose an easy-to-implement privacy-preserving data analysis technique by pooling (i.e. aggregating) individual records of covariates at recruiting sites under the nested case-control sampling framework before sharing the pooled nested case-control subcohort. We show that the pooled hazard ratio estimators, under the pooled nested case-control subsamples from the contributing sites, are maximum likelihood estimators and provide consistent estimates of the individual level full cohort HRs. Furthermore, a sampling technique for generating pseudo-event times for individual subjects that constitute the pooled nested case-control subsamples is proposed. Our method is demonstrated using extensive simulations and analysis of the National Lung Screening Trial data. The utility of our proposed approach is compared to the gold standard (full cohort) and synthetic data generated using classification and regression trees. The proposed pooling technique performs to near-optimal levels comparable to full cohort analysis or synthetic data; the efficiency improves in rare event settings when more controls are matched on during nested case-control subcohort sampling.

Privacy-preserving analysis of time-to-event data under nested case-control sampling

期刊

STATISTICAL METHODS IN MEDICAL RESEARCH

出版社

SAGE PUBLICATIONS LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Privacy-preserving analysis of time-to-event data under nested case-control sampling

期刊

STATISTICAL METHODS IN MEDICAL RESEARCH

出版社

SAGE PUBLICATIONS LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文