Identifying the Risky SNP of Osteoporosis with ID3-PEP Decision Tree Algorithm

In the past 20 years, much progress has been made on the genetic analysis of osteoporosis. A number of genes and SNPs associated with osteoporosis have been found through GWAS method. In this paper, we intend to identify the suspected risky SNPs of osteoporosis with computational methods based on th...

Full description

Saved in:
Bibliographic Details
Main Authors: Jincai Yang, Huichao Gu, Xingpeng Jiang, Qingyang Huang, Xiaohua Hu, Xianjun Shen
Format: Article
Language:English
Published: Wiley 2017-01-01
Series:Complexity
Online Access:http://dx.doi.org/10.1155/2017/9194801
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832562732752175104
author Jincai Yang
Huichao Gu
Xingpeng Jiang
Qingyang Huang
Xiaohua Hu
Xianjun Shen
author_facet Jincai Yang
Huichao Gu
Xingpeng Jiang
Qingyang Huang
Xiaohua Hu
Xianjun Shen
author_sort Jincai Yang
collection DOAJ
description In the past 20 years, much progress has been made on the genetic analysis of osteoporosis. A number of genes and SNPs associated with osteoporosis have been found through GWAS method. In this paper, we intend to identify the suspected risky SNPs of osteoporosis with computational methods based on the known osteoporosis GWAS-associated SNPs. The process includes two steps. Firstly, we decided whether the genes associated with the suspected risky SNPs are associated with osteoporosis by using random walk algorithm on the PPI network of osteoporosis GWAS-associated genes and the genes associated with the suspected risky SNPs. In order to solve the overfitting problem in ID3 decision tree algorithm, we then classified the SNPs with positive results based on their features of position and function through a simplified classification decision tree which was constructed by ID3 decision tree algorithm with PEP (Pessimistic-Error Pruning). We verified the accuracy of the identification framework with the data set of GWAS-associated SNPs, and the result shows that this method is feasible. It provides a more convenient way to identify the suspected risky SNPs associated with osteoporosis.
format Article
id doaj-art-4b85d0da22f648f194c6d3e3d46a08ad
institution Kabale University
issn 1076-2787
1099-0526
language English
publishDate 2017-01-01
publisher Wiley
record_format Article
series Complexity
spelling doaj-art-4b85d0da22f648f194c6d3e3d46a08ad2025-02-03T01:21:50ZengWileyComplexity1076-27871099-05262017-01-01201710.1155/2017/91948019194801Identifying the Risky SNP of Osteoporosis with ID3-PEP Decision Tree AlgorithmJincai Yang0Huichao Gu1Xingpeng Jiang2Qingyang Huang3Xiaohua Hu4Xianjun Shen5School of Computer Science, Central China Normal University, Wuhan 430079, ChinaSchool of Computer Science, Central China Normal University, Wuhan 430079, ChinaSchool of Computer Science, Central China Normal University, Wuhan 430079, ChinaSchool of Life Science, Central China Normal University, Wuhan 430079, ChinaSchool of Computer Science, Central China Normal University, Wuhan 430079, ChinaSchool of Computer Science, Central China Normal University, Wuhan 430079, ChinaIn the past 20 years, much progress has been made on the genetic analysis of osteoporosis. A number of genes and SNPs associated with osteoporosis have been found through GWAS method. In this paper, we intend to identify the suspected risky SNPs of osteoporosis with computational methods based on the known osteoporosis GWAS-associated SNPs. The process includes two steps. Firstly, we decided whether the genes associated with the suspected risky SNPs are associated with osteoporosis by using random walk algorithm on the PPI network of osteoporosis GWAS-associated genes and the genes associated with the suspected risky SNPs. In order to solve the overfitting problem in ID3 decision tree algorithm, we then classified the SNPs with positive results based on their features of position and function through a simplified classification decision tree which was constructed by ID3 decision tree algorithm with PEP (Pessimistic-Error Pruning). We verified the accuracy of the identification framework with the data set of GWAS-associated SNPs, and the result shows that this method is feasible. It provides a more convenient way to identify the suspected risky SNPs associated with osteoporosis.http://dx.doi.org/10.1155/2017/9194801
spellingShingle Jincai Yang
Huichao Gu
Xingpeng Jiang
Qingyang Huang
Xiaohua Hu
Xianjun Shen
Identifying the Risky SNP of Osteoporosis with ID3-PEP Decision Tree Algorithm
Complexity
title Identifying the Risky SNP of Osteoporosis with ID3-PEP Decision Tree Algorithm
title_full Identifying the Risky SNP of Osteoporosis with ID3-PEP Decision Tree Algorithm
title_fullStr Identifying the Risky SNP of Osteoporosis with ID3-PEP Decision Tree Algorithm
title_full_unstemmed Identifying the Risky SNP of Osteoporosis with ID3-PEP Decision Tree Algorithm
title_short Identifying the Risky SNP of Osteoporosis with ID3-PEP Decision Tree Algorithm
title_sort identifying the risky snp of osteoporosis with id3 pep decision tree algorithm
url http://dx.doi.org/10.1155/2017/9194801
work_keys_str_mv AT jincaiyang identifyingtheriskysnpofosteoporosiswithid3pepdecisiontreealgorithm
AT huichaogu identifyingtheriskysnpofosteoporosiswithid3pepdecisiontreealgorithm
AT xingpengjiang identifyingtheriskysnpofosteoporosiswithid3pepdecisiontreealgorithm
AT qingyanghuang identifyingtheriskysnpofosteoporosiswithid3pepdecisiontreealgorithm
AT xiaohuahu identifyingtheriskysnpofosteoporosiswithid3pepdecisiontreealgorithm
AT xianjunshen identifyingtheriskysnpofosteoporosiswithid3pepdecisiontreealgorithm