Bi-stream CNN Down Syndrome screening model based on genotyping array

Abstract Background Human Down syndrome (DS) is usually caused by genomic micro-duplications and dosage imbalances of human chromosome 21. It is associated with many genomic and phenotype abnormalities. Even though human DS occurs about 1 per 1,000 births worldwide, which is a very high rate, resear...

Full description

Saved in:
Bibliographic Details
Main Authors: Bing Feng (Author), William Hoskins (Author), Yan Zhang (Author), Zibo Meng (Author), David C. Samuels (Author), Jiandong Wang (Author), Ruofan Xia (Author), Chao Liu (Author), Jijun Tang (Author), Yan Guo (Author)
Format: Book
Published: BMC, 2018-11-01T00:00:00Z.
Subjects:
Online Access:Connect to this object online.
Tags: Add Tag
No Tags, Be the first to tag this record!

MARC

LEADER 00000 am a22000003u 4500
001 doaj_d43044b5a7c940dbaa1e1bbb3ea5aefc
042 |a dc 
100 1 0 |a Bing Feng  |e author 
700 1 0 |a William Hoskins  |e author 
700 1 0 |a Yan Zhang  |e author 
700 1 0 |a Zibo Meng  |e author 
700 1 0 |a David C. Samuels  |e author 
700 1 0 |a Jiandong Wang  |e author 
700 1 0 |a Ruofan Xia  |e author 
700 1 0 |a Chao Liu  |e author 
700 1 0 |a Jijun Tang  |e author 
700 1 0 |a Yan Guo  |e author 
245 0 0 |a Bi-stream CNN Down Syndrome screening model based on genotyping array 
260 |b BMC,   |c 2018-11-01T00:00:00Z. 
500 |a 10.1186/s12920-018-0416-0 
500 |a 1755-8794 
520 |a Abstract Background Human Down syndrome (DS) is usually caused by genomic micro-duplications and dosage imbalances of human chromosome 21. It is associated with many genomic and phenotype abnormalities. Even though human DS occurs about 1 per 1,000 births worldwide, which is a very high rate, researchers haven't found any effective method to cure DS. Currently, the most efficient ways of human DS prevention are screening and early detection. Methods In this study, we used deep learning techniques and analyzed a set of Illumina genotyping array data. We built a bi-stream convolutional neural networks model to screen/predict the occurrence of DS. Firstly, we built image input data by converting the intensities of each SNP site into chromosome SNP maps. Next, we proposed a bi-stream convolutional neural network (CNN) architecture with nine layers and two branch models. We further merged two CNN branch models into one model in the fourth convolutional layer, and output the prediction in the last layer. Results Our bi-stream CNN model achieved 99.3% average accuracies, and very low false-positive and false-negative rates, which was necessary for further applications in disease prediction and medical practice. We further visualized the feature maps and learned filters from intermediate convolutional layers, which showed the genomic patterns and correlated SNPs variations in human DS genomes. We also compared our methods with other CNN and traditional machine learning models. We further analyzed and discussed the characteristics and strengths of our bi-stream CNN model. Conclusions Our bi-stream model used two branch CNN models to learn the local genome features and regional patterns among adjacent genes and SNP sites from two chromosomes simultaneously. It achieved the best performance in all evaluating metrics when compared with two single-stream CNN models and three traditional machine-learning algorithms. The visualized feature maps also provided opportunities to study the genomic markers and pathway components associated with Human DS, which provided insights for gene therapy and genomic medicine developments. 
546 |a EN 
690 |a Deep learning 
690 |a Convolutional neural networks 
690 |a Human down syndrome 
690 |a Genotyping 
690 |a Internal medicine 
690 |a RC31-1245 
690 |a Genetics 
690 |a QH426-470 
655 7 |a article  |2 local 
786 0 |n BMC Medical Genomics, Vol 11, Iss S5, Pp 25-33 (2018) 
787 0 |n http://link.springer.com/article/10.1186/s12920-018-0416-0 
787 0 |n https://doaj.org/toc/1755-8794 
856 4 1 |u https://doaj.org/article/d43044b5a7c940dbaa1e1bbb3ea5aefc  |z Connect to this object online.