Choosing blindly but wisely: differentially private solicitation of DNA datasets for disease marker discovery.

TitleChoosing blindly but wisely: differentially private solicitation of DNA datasets for disease marker discovery.
Publication TypeJournal Article
Year of Publication2015
AuthorsZhao, Y, Wang, X, Jiang, X, Ohno-Machado, L, Tang, H
JournalJ Am Med Inform Assoc
Volume22
Issue1
Pagination100-8
Date Published2015 Jan
ISSN1527-974X
iDASH CategoryPrivacy Technology
Abstract<p><b>OBJECTIVE: </b>To propose a new approach to privacy preserving data selection, which helps the data users access human genomic datasets efficiently without undermining patients' privacy.</p><p><b>METHODS: </b>Our idea is to let each data owner publish a set of differentially-private pilot data, on which a data user can test-run arbitrary association-test algorithms, including those not known to the data owner a priori. We developed a suite of new techniques, including a pilot-data generation approach that leverages the linkage disequilibrium in the human genome to preserve both the utility of the data and the privacy of the patients, and a utility evaluation method that helps the user assess the value of the real data from its pilot version with high confidence.</p><p><b>RESULTS: </b>We evaluated our approach on real human genomic data using four popular association tests. Our study shows that the proposed approach can help data users make the right choices in most cases.</p><p><b>CONCLUSIONS: </b>Even though the pilot data cannot be directly used for scientific discovery, it provides a useful indication of which datasets are more likely to be useful to data users, who can therefore approach the appropriate data owners to gain access to the data.</p>
DOI10.1136/amiajnl-2014-003043
Alternate JournalJ Am Med Inform Assoc
PubMed ID25352565
PubMed Central IDPMC4433380
Grant List1R01HG007078-01 / HG / NHGRI NIH HHS / United States
R00 LM011392 / LM / NLM NIH HHS / United States
R00LM011392 / LM / NLM NIH HHS / United States
R01 HG007078 / HG / NHGRI NIH HHS / United States
R21 LM012060 / LM / NLM NIH HHS / United States
R21LM012060 / LM / NLM NIH HHS / United States
U54HL108460 / HL / NHLBI NIH HHS / United States

iDASH Category: