Identifying the Risky SNP of Osteoporosis with ID3-PEP Decision Tree Algorithm
In the past 20 years, much progress has been made on the genetic analysis of osteoporosis. A number of genes and SNPs associated with osteoporosis have been found through GWAS method. In this paper, we intend to identify the suspected risky SNPs of osteoporosis with computational methods based on th...
Saved in:
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Wiley
2017-01-01
|
Series: | Complexity |
Online Access: | http://dx.doi.org/10.1155/2017/9194801 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832562732752175104 |
---|---|
author | Jincai Yang Huichao Gu Xingpeng Jiang Qingyang Huang Xiaohua Hu Xianjun Shen |
author_facet | Jincai Yang Huichao Gu Xingpeng Jiang Qingyang Huang Xiaohua Hu Xianjun Shen |
author_sort | Jincai Yang |
collection | DOAJ |
description | In the past 20 years, much progress has been made on the genetic analysis of osteoporosis. A number of genes and SNPs associated with osteoporosis have been found through GWAS method. In this paper, we intend to identify the suspected risky SNPs of osteoporosis with computational methods based on the known osteoporosis GWAS-associated SNPs. The process includes two steps. Firstly, we decided whether the genes associated with the suspected risky SNPs are associated with osteoporosis by using random walk algorithm on the PPI network of osteoporosis GWAS-associated genes and the genes associated with the suspected risky SNPs. In order to solve the overfitting problem in ID3 decision tree algorithm, we then classified the SNPs with positive results based on their features of position and function through a simplified classification decision tree which was constructed by ID3 decision tree algorithm with PEP (Pessimistic-Error Pruning). We verified the accuracy of the identification framework with the data set of GWAS-associated SNPs, and the result shows that this method is feasible. It provides a more convenient way to identify the suspected risky SNPs associated with osteoporosis. |
format | Article |
id | doaj-art-4b85d0da22f648f194c6d3e3d46a08ad |
institution | Kabale University |
issn | 1076-2787 1099-0526 |
language | English |
publishDate | 2017-01-01 |
publisher | Wiley |
record_format | Article |
series | Complexity |
spelling | doaj-art-4b85d0da22f648f194c6d3e3d46a08ad2025-02-03T01:21:50ZengWileyComplexity1076-27871099-05262017-01-01201710.1155/2017/91948019194801Identifying the Risky SNP of Osteoporosis with ID3-PEP Decision Tree AlgorithmJincai Yang0Huichao Gu1Xingpeng Jiang2Qingyang Huang3Xiaohua Hu4Xianjun Shen5School of Computer Science, Central China Normal University, Wuhan 430079, ChinaSchool of Computer Science, Central China Normal University, Wuhan 430079, ChinaSchool of Computer Science, Central China Normal University, Wuhan 430079, ChinaSchool of Life Science, Central China Normal University, Wuhan 430079, ChinaSchool of Computer Science, Central China Normal University, Wuhan 430079, ChinaSchool of Computer Science, Central China Normal University, Wuhan 430079, ChinaIn the past 20 years, much progress has been made on the genetic analysis of osteoporosis. A number of genes and SNPs associated with osteoporosis have been found through GWAS method. In this paper, we intend to identify the suspected risky SNPs of osteoporosis with computational methods based on the known osteoporosis GWAS-associated SNPs. The process includes two steps. Firstly, we decided whether the genes associated with the suspected risky SNPs are associated with osteoporosis by using random walk algorithm on the PPI network of osteoporosis GWAS-associated genes and the genes associated with the suspected risky SNPs. In order to solve the overfitting problem in ID3 decision tree algorithm, we then classified the SNPs with positive results based on their features of position and function through a simplified classification decision tree which was constructed by ID3 decision tree algorithm with PEP (Pessimistic-Error Pruning). We verified the accuracy of the identification framework with the data set of GWAS-associated SNPs, and the result shows that this method is feasible. It provides a more convenient way to identify the suspected risky SNPs associated with osteoporosis.http://dx.doi.org/10.1155/2017/9194801 |
spellingShingle | Jincai Yang Huichao Gu Xingpeng Jiang Qingyang Huang Xiaohua Hu Xianjun Shen Identifying the Risky SNP of Osteoporosis with ID3-PEP Decision Tree Algorithm Complexity |
title | Identifying the Risky SNP of Osteoporosis with ID3-PEP Decision Tree Algorithm |
title_full | Identifying the Risky SNP of Osteoporosis with ID3-PEP Decision Tree Algorithm |
title_fullStr | Identifying the Risky SNP of Osteoporosis with ID3-PEP Decision Tree Algorithm |
title_full_unstemmed | Identifying the Risky SNP of Osteoporosis with ID3-PEP Decision Tree Algorithm |
title_short | Identifying the Risky SNP of Osteoporosis with ID3-PEP Decision Tree Algorithm |
title_sort | identifying the risky snp of osteoporosis with id3 pep decision tree algorithm |
url | http://dx.doi.org/10.1155/2017/9194801 |
work_keys_str_mv | AT jincaiyang identifyingtheriskysnpofosteoporosiswithid3pepdecisiontreealgorithm AT huichaogu identifyingtheriskysnpofosteoporosiswithid3pepdecisiontreealgorithm AT xingpengjiang identifyingtheriskysnpofosteoporosiswithid3pepdecisiontreealgorithm AT qingyanghuang identifyingtheriskysnpofosteoporosiswithid3pepdecisiontreealgorithm AT xiaohuahu identifyingtheriskysnpofosteoporosiswithid3pepdecisiontreealgorithm AT xianjunshen identifyingtheriskysnpofosteoporosiswithid3pepdecisiontreealgorithm |