A Novel Differentially Private Online Learning Algorithm for Group Lasso in Big Data

This study addresses the challenge of extracting valuable information and selecting key variables from large datasets, essential across statistics, computational science, and data science. In the age of big data, where safeguarding personal privacy is paramount, this study presents an online learnin...

Full description

Saved in:
Bibliographic Details
Main Authors: Jinxia Li, Liwei Lu
Format: Article
Language:English
Published: Wiley 2024-01-01
Series:IET Information Security
Online Access:http://dx.doi.org/10.1049/2024/5553292
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832557982855987200
author Jinxia Li
Liwei Lu
author_facet Jinxia Li
Liwei Lu
author_sort Jinxia Li
collection DOAJ
description This study addresses the challenge of extracting valuable information and selecting key variables from large datasets, essential across statistics, computational science, and data science. In the age of big data, where safeguarding personal privacy is paramount, this study presents an online learning algorithm that leverages differential privacy to handle large-scale data effectively. The focus is on enhancing the online group lasso approach within the differential privacy realm. The study begins by comparing online and offline learning approaches and classifying common online learning techniques. It proceeds to elucidate the concept of differential privacy and its importance. By enhancing the group-follow-the-proximally-regularized-leader (GFTPRL) algorithm, we have created a new method for the online group lasso model that integrates differential privacy for binary classification in logistic regression. The research offers a solid validation of the algorithm’s effectiveness based on differential privacy and online learning principles. The algorithm’s performance was thoroughly evaluated through simulations with both synthetic and actual data. The comparison is made between the proposed privacy-preserving algorithm and traditional non-privacy-preserving counterparts, with a focus on regret bounds, a measure of performance. The findings underscore the practical benefits of the differential privacy-preserving algorithm in tackling large-scale data analysis while upholding privacy standards. This research marks a significant step forward in the fusion of big data analytics and the safeguarding of individual privacy.
format Article
id doaj-art-a73707c7acbd43ae8b66f20f780ad0c9
institution Kabale University
issn 1751-8717
language English
publishDate 2024-01-01
publisher Wiley
record_format Article
series IET Information Security
spelling doaj-art-a73707c7acbd43ae8b66f20f780ad0c92025-02-03T01:45:20ZengWileyIET Information Security1751-87172024-01-01202410.1049/2024/5553292A Novel Differentially Private Online Learning Algorithm for Group Lasso in Big DataJinxia Li0Liwei Lu1Informatization OfficeInformatization OfficeThis study addresses the challenge of extracting valuable information and selecting key variables from large datasets, essential across statistics, computational science, and data science. In the age of big data, where safeguarding personal privacy is paramount, this study presents an online learning algorithm that leverages differential privacy to handle large-scale data effectively. The focus is on enhancing the online group lasso approach within the differential privacy realm. The study begins by comparing online and offline learning approaches and classifying common online learning techniques. It proceeds to elucidate the concept of differential privacy and its importance. By enhancing the group-follow-the-proximally-regularized-leader (GFTPRL) algorithm, we have created a new method for the online group lasso model that integrates differential privacy for binary classification in logistic regression. The research offers a solid validation of the algorithm’s effectiveness based on differential privacy and online learning principles. The algorithm’s performance was thoroughly evaluated through simulations with both synthetic and actual data. The comparison is made between the proposed privacy-preserving algorithm and traditional non-privacy-preserving counterparts, with a focus on regret bounds, a measure of performance. The findings underscore the practical benefits of the differential privacy-preserving algorithm in tackling large-scale data analysis while upholding privacy standards. This research marks a significant step forward in the fusion of big data analytics and the safeguarding of individual privacy.http://dx.doi.org/10.1049/2024/5553292
spellingShingle Jinxia Li
Liwei Lu
A Novel Differentially Private Online Learning Algorithm for Group Lasso in Big Data
IET Information Security
title A Novel Differentially Private Online Learning Algorithm for Group Lasso in Big Data
title_full A Novel Differentially Private Online Learning Algorithm for Group Lasso in Big Data
title_fullStr A Novel Differentially Private Online Learning Algorithm for Group Lasso in Big Data
title_full_unstemmed A Novel Differentially Private Online Learning Algorithm for Group Lasso in Big Data
title_short A Novel Differentially Private Online Learning Algorithm for Group Lasso in Big Data
title_sort novel differentially private online learning algorithm for group lasso in big data
url http://dx.doi.org/10.1049/2024/5553292
work_keys_str_mv AT jinxiali anoveldifferentiallyprivateonlinelearningalgorithmforgrouplassoinbigdata
AT liweilu anoveldifferentiallyprivateonlinelearningalgorithmforgrouplassoinbigdata
AT jinxiali noveldifferentiallyprivateonlinelearningalgorithmforgrouplassoinbigdata
AT liweilu noveldifferentiallyprivateonlinelearningalgorithmforgrouplassoinbigdata