4.7 Article

Private Empirical Risk Minimization With Analytic Gaussian Mechanism for Healthcare System

Journal

IEEE TRANSACTIONS ON BIG DATA
Volume 8, Issue 4, Pages 1107-1117

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TBDATA.2020.2997732

Keywords

Medical services; Machine learning; Privacy; Perturbation methods; Training; Machine learning algorithms; Differential privacy; analytic Gaussian mechanism; empirical risk minimization; machine learning; healthcare

Funding

  1. U.S. National Science Foundation [US CNS-1350230, CNS-1702850, CNS-1801925, CNS-2029569]
  2. National Science Foundation [CNS2029685]
  3. National Natural Science Foundation of China (NSFC) [61860206005]

Ask authors/readers for more resources

With the wide range application of machine learning in healthcare for helping humans drive crucial decisions, data privacy becomes an inevitable concern due to the utilization of sensitive data such as patients records and registers of a company. Thus, constructing a privacy preserving machine learning model while still maintaining high accuracy becomes a challenging problem. In this article, we propose two differentially private algorithms, i.e., Output Perturbation with aGM (OPERA) and Gradient Perturbation with aGM (GRPUA) for empirical risk minimization, a useful method to obtain a globally optimal classifier, by leveraging the analytic Gaussian mechanism (aGM) to achieve privacy preservation of sensitive medical data in a healthcare system. We theoretically analyze and prove utility upper bounds of proposed algorithms and compare them with prior algorithms in the literature. The analyses show that in the high privacy regime, our proposed algorithms can achieve a tighter utility bound for both settings: strongly convex and non-strongly convex loss functions. Besides, we evaluate the proposed private algorithms on five benchmark datasets. The simulation results demonstrate that our approaches can achieve higher accuracy and lower objective values compared with existing ones in all three datasets while providing differential privacy guarantees.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available