A Novel Differentially Private Online Learning Algorithm for Group Lasso in Big Data

This study addresses the challenge of extracting valuable information and selecting key variables from large datasets, essential across statistics, computational science, and data science. In the age of big data, where safeguarding personal privacy is paramount, this study presents an online learnin...

Full description

Saved in:

Bibliographic Details
Main Authors:	Jinxia Li, Liwei Lu
Format:	Article
Language:	English
Published:	Wiley 2024-01-01
Series:	IET Information Security
Online Access:	http://dx.doi.org/10.1049/2024/5553292
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832557982855987200
author	Jinxia Li Liwei Lu
author_facet	Jinxia Li Liwei Lu
author_sort	Jinxia Li
collection	DOAJ
description	This study addresses the challenge of extracting valuable information and selecting key variables from large datasets, essential across statistics, computational science, and data science. In the age of big data, where safeguarding personal privacy is paramount, this study presents an online learning algorithm that leverages differential privacy to handle large-scale data effectively. The focus is on enhancing the online group lasso approach within the differential privacy realm. The study begins by comparing online and offline learning approaches and classifying common online learning techniques. It proceeds to elucidate the concept of differential privacy and its importance. By enhancing the group-follow-the-proximally-regularized-leader (GFTPRL) algorithm, we have created a new method for the online group lasso model that integrates differential privacy for binary classification in logistic regression. The research offers a solid validation of the algorithm’s effectiveness based on differential privacy and online learning principles. The algorithm’s performance was thoroughly evaluated through simulations with both synthetic and actual data. The comparison is made between the proposed privacy-preserving algorithm and traditional non-privacy-preserving counterparts, with a focus on regret bounds, a measure of performance. The findings underscore the practical benefits of the differential privacy-preserving algorithm in tackling large-scale data analysis while upholding privacy standards. This research marks a significant step forward in the fusion of big data analytics and the safeguarding of individual privacy.
format	Article
id	doaj-art-a73707c7acbd43ae8b66f20f780ad0c9
institution	Kabale University
issn	1751-8717
language	English
publishDate	2024-01-01
publisher	Wiley
record_format	Article
series	IET Information Security
spelling	doaj-art-a73707c7acbd43ae8b66f20f780ad0c92025-02-03T01:45:20ZengWileyIET Information Security1751-87172024-01-01202410.1049/2024/5553292A Novel Differentially Private Online Learning Algorithm for Group Lasso in Big DataJinxia Li0Liwei Lu1Informatization OfficeInformatization OfficeThis study addresses the challenge of extracting valuable information and selecting key variables from large datasets, essential across statistics, computational science, and data science. In the age of big data, where safeguarding personal privacy is paramount, this study presents an online learning algorithm that leverages differential privacy to handle large-scale data effectively. The focus is on enhancing the online group lasso approach within the differential privacy realm. The study begins by comparing online and offline learning approaches and classifying common online learning techniques. It proceeds to elucidate the concept of differential privacy and its importance. By enhancing the group-follow-the-proximally-regularized-leader (GFTPRL) algorithm, we have created a new method for the online group lasso model that integrates differential privacy for binary classification in logistic regression. The research offers a solid validation of the algorithm’s effectiveness based on differential privacy and online learning principles. The algorithm’s performance was thoroughly evaluated through simulations with both synthetic and actual data. The comparison is made between the proposed privacy-preserving algorithm and traditional non-privacy-preserving counterparts, with a focus on regret bounds, a measure of performance. The findings underscore the practical benefits of the differential privacy-preserving algorithm in tackling large-scale data analysis while upholding privacy standards. This research marks a significant step forward in the fusion of big data analytics and the safeguarding of individual privacy.http://dx.doi.org/10.1049/2024/5553292
spellingShingle	Jinxia Li Liwei Lu A Novel Differentially Private Online Learning Algorithm for Group Lasso in Big Data IET Information Security
title	A Novel Differentially Private Online Learning Algorithm for Group Lasso in Big Data
title_full	A Novel Differentially Private Online Learning Algorithm for Group Lasso in Big Data
title_fullStr	A Novel Differentially Private Online Learning Algorithm for Group Lasso in Big Data
title_full_unstemmed	A Novel Differentially Private Online Learning Algorithm for Group Lasso in Big Data
title_short	A Novel Differentially Private Online Learning Algorithm for Group Lasso in Big Data
title_sort	novel differentially private online learning algorithm for group lasso in big data
url	http://dx.doi.org/10.1049/2024/5553292
work_keys_str_mv	AT jinxiali anoveldifferentiallyprivateonlinelearningalgorithmforgrouplassoinbigdata AT liweilu anoveldifferentiallyprivateonlinelearningalgorithmforgrouplassoinbigdata AT jinxiali noveldifferentiallyprivateonlinelearningalgorithmforgrouplassoinbigdata AT liweilu noveldifferentiallyprivateonlinelearningalgorithmforgrouplassoinbigdata

A Novel Differentially Private Online Learning Algorithm for Group Lasso in Big Data

Similar Items