Home    中文  
 
  • Search
  • lucene Search
  • Citation
  • Fig/Tab
  • Adv Search
Just Accepted  |  Current Issue  |  Archive  |  Featured Articles  |  Most Read  |  Most Download  |  Most Cited

Chinese Journal of Cell and Stem Cell(Electronic Edition) ›› 2023, Vol. 13 ›› Issue (01): 19-26. doi: 10.3877/cma.j.issn.2095-1221.2023.01.003

• Original Research • Previous Articles     Next Articles

Construction of diagnosis model for coronary atherosclerosis heart disease using random forest and artificial neural network based on susceptibility genes in peripheral blood cells

Enrui Xie1, Yixuan Duan1, Chang Liu1, Jie Deng1,()   

  1. 1. Department of Cardiovascular Medicine, the Second Affiliated Hospital of Xi'an Jiaotng University, Xi'an 710000, China
  • Received:2022-08-20 Online:2023-02-01 Published:2023-05-16
  • Contact: Jie Deng

Abstract:

Objective

Our study aims to find susceptibility genes from peripheral blood cells as potential molecular biomarkers of coronary heart disease (CHD) and to create a diagnosis model using bioinformatics combined with random forest (RF) and artificial neural network (ANN) .

Methods

We downloaded three gene expression profiles (GSE20680, GSE20681, GSE12288) from Gene Expression Omnibus (GEO) database. Then we performed analyses of differential expression, gene ontology terms, and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways based on GSE20680. Next, the RF was used further to obtain the key genes from the differentially expressed genes. Finally, we set up a training set to construct the diagnostic model using ANN and two test sets to verify the diagnostic efficacy of the model by comprehensively merging the three datasets.

Results

Using gene expression profiles in the GEO database, we identified 21 key genes from 284 differentially expressed genes by RF, and a new diagnostic model of CHD was also successfully constructed by using ANN to calculate the weight of key genes. Finally, two test sets were used to verify the diagnostic model's performance, and the AUC values were high (0.9024 and 0.8153 respectively) .

Conclusion

We identified 21 potential gene biomarkers of CHD and established a novel diagnostic model which shows a good result in the classification of CHD, and it may be helpful to CHD screening and early clinical diagnosis.

Key words: Coronary atherosclerotic heart disease, Bioinformatics, Neural networks, Gene analysis, Machine learning

京ICP 备07035254号-3
Copyright © Chinese Journal of Cell and Stem Cell(Electronic Edition), All Rights Reserved.
Tel: 0086-591-87982783 E-mail: ccsct@vip.163.com
Powered by Beijing Magtech Co. Ltd