Online citations, reference lists, and bibliographies.
← Back to Search

A Deep Convolutional Neural Network Approach For Predicting Phenotypes From Genotypes

W. Ma, Zhixu Qiu, J. Song, J. Li, Q. Cheng, J. Zhai, Chuang Ma
Published 2018 · Biology, Medicine

Cite This
Download PDF
Analyze on Scholarcy
Share
Main conclusionDeep learning is a promising technology to accurately select individuals with high phenotypic values based on genotypic data.AbstractGenomic selection (GS) is a promising breeding strategy by which the phenotypes of plant individuals are usually predicted based on genome-wide markers of genotypes. In this study, we present a deep learning method, named DeepGS, to predict phenotypes from genotypes. Using a deep convolutional neural network, DeepGS uses hidden variables that jointly represent features in genotypes when making predictions; it also employs convolution, sampling and dropout strategies to reduce the complexity of high-dimensional genotypic data. We used a large GS dataset to train DeepGS and compared its performance with other methods. The experimental results indicate that DeepGS can be used as a complement to the commonly used RR-BLUP in the prediction of phenotypes from genotypes. The complementarity between DeepGS and RR-BLUP can be utilized using an ensemble learning approach for more accurately selecting individuals with high phenotypic values, even for the absence of outlier individuals and subsets of genotypic markers. The source codes of DeepGS and the ensemble learning approach have been packaged into Docker images for facilitating their applications in different GS programs.
This paper references
2016) A deep learning framework for modeling structural features of RNAbinding protein targets
S Zhang (2016)
10.1038/srep38433
EP-DNN: A Deep Neural Network-Based Global Enhancer Prediction Algorithm
Seong Gon Kim (2016)
10.1109/ICNN.1995.488968
Particle swarm optimization
J. Kennedy (1995)
10.1016/j.tplants.2014.05.006
Genomic selection: genome-wide prediction in plant improvement.
Zeratsion Abera Desta (2014)
10.1016/j.tibtech.2013.06.003
Does genomic selection have a future in plant breeding?
E. Jonas (2013)
10.3835/plantgenome2011.08.0024
Ridge Regression and Other Kernels for Genomic Selection with R Package rrBLUP
Jeffrey B. Endelman (2011)
10.1371/journal.pgen.1004982
Genomic Selection and Association Mapping in Rice (Oryza sativa): Effect of Trait Genetic Architecture, Training Population Composition, Marker Number and Statistical Model on Accuracy of Rice Genomic Selection in Elite, Tropical Rice Breeding Lines
Jennifer Spindel (2015)
10.1186/1471-2164-13-452
Comparison of whole-genome prediction models for traits with contrasting genetic architecture in a diversity panel of maize inbred lines
C. Riedelsheimer (2012)
10.3389/fpls.2016.01666
Genome-Enabled Prediction Models for Yield Related Traits in Chickpea
Manish Roorkiwal (2016)
10.1093/nar/gkv1025
A deep learning framework for modeling structural features of RNA-binding protein targets
S. Zhang (2016)
Convolutional neural network architectures for predicting DNA–protein binding. Bioinformatics 32(12):i121– i127
H Zeng (2016)
10.1038/srep17573
Improving Protein Fold Recognition by Deep Learning Networks
Taeho Jo (2015)
10.1534/g3.116.029637
Genomic Prediction of Gene Bank Wheat Landraces
J. Crossa (2016)
10.1101/032821
DanQ: a hybrid convolutional and recurrent deep neural network for quantifying the function of DNA sequences
Daniel Quang (2015)
10.1534/genetics.111.137026
Accuracy of Genomic Selection Methods in a Standard Data Set of Loblolly Pine (Pinus taeda L.)
M. Resende (2012)
Deep learning
Y LeCun (2015)
10.1016/j.plantsci.2015.09.009
Exciting journey of 10 years from genomes to fields and markets: Some success stories of genomics-assisted breeding in chickpea, pigeonpea and groundnut.
R. Varshney (2016)
10.1016/j.atg.2016.10.004
Wheat quality improvement at CIMMYT and the use of genomic selection on it
C. Guzmán (2016)
10.2135/CROPSCI2007.04.0191
Marker-Assisted Selection in Plant Breeding: From Publications to Practice
Y. Xu (2008)
10.1038/nplants.2016.150
Genomic prediction contributing to a promising global strategy to turbocharge gene banks
Xiao-qing Yu (2016)
10.1038/nbt.3300
Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning
B. Alipanahi (2015)
10.1111/pbi.12559
Bulked sample analysis in genetics, genomics and crop improvement
C. Zou (2016)
10.1016/j.pbi.2010.01.001
Detection and use of QTL for complex traits in multiple environments.
F. V. van Eeuwijk (2010)
10.1126/science.1254806
The human splicing code reveals new insights into the genetic determinants of disease
H. Y. Xiong (2015)
10.3168/jds.2007-0980
Efficient methods to compute genomic predictions.
P. VanRaden (2008)
10.1093/bioinformatics/btu703
DANN: a deep learning approach for annotating the pathogenicity of genetic variants
Daniel Quang (2015)
10.1038/323533a0
Learning representations by back-propagating errors
D. Rumelhart (1986)
10.1142/S1793351X16500045
Deep Learning
X. Hao (2016)
10.1093/bfgp/elq001
Genomic selection in plant breeding: from theory to practice.
J. Jannink (2010)
10.1146/annurev-phyto-080615-100056
Advances and Challenges in Genomic Selection for Disease Resistance.
J. Poland (2016)
10.1534/g3.116.033381
Cross-Validation Without Doing Cross-Validation in Genome-Enabled Prediction
D. Gianola (2016)
10.4238/2015.September.9.26
Comparison of methods used to identify superior individuals in genomic selection in plant breeding.
Leonardo Lopes Bhering (2015)
10.1101/028399
Basset: Learning the regulatory code of the accessible genome with deep convolutional neural networks
David R. Kelley (2015)
Prediction of total genetic value using genome-wide dense marker maps.
T. Meuwissen (2001)
10.1007/s00122-016-2748-5
Optimum breeding strategies using genomic selection for hybrid breeding in wheat, maize, rye, barley, rice and triticale
Jose J Marulanda (2016)
10.1007/s00122-015-2639-1
Prediction of malting quality traits in barley based on genome-wide marker data to assess the potential of genomic selection
Malthe Schmidt (2015)
10.1101/034421
Gene expression inference with deep learning
Y. Chen (2015)
DeepChrome: deeplearning for predicting gene expression from histone modifications. Bioinformatics 32(17):i639–i648. https ://doi.org/10.1093/ bioin forma tics/btw42
R Singh (2016)
10.1038/nmeth.3547
Predicting effects of noncoding variants with deep learning–based sequence model
J. Zhou (2015)
10.1016/j.tplants.2017.08.011
Genomic Selection in Plant Breeding: Methods, Models, and Perspectives.
J. Crossa (2017)
10.1093/bib/bbw068
Deep learning in bioinformatics
Seonwoo Min (2017)
10.1371/journal.pone.0128570
A Ranking Approach to Genomic Selection
Mathieu Blondel (2015)
10.1101/gr.200535.115
Basset: learning the regulatory code of the accessible genome with deep convolutional neural networks.
D. Kelley (2016)
Dropout: a simple way to prevent neural networks from overfitting
Nitish Srivastava (2014)
10.15252/msb.20156651
Deep learning for computational biology
Christof Angermueller (2016)
10.1007/978-3-319-42291-6_41
Application of Machine Learning-Based Classification to Genomic Selection and Performance Improvement
Zhixu Qiu (2016)
10.1038/srep28517
PEDLA: predicting enhancers with a deep learning-based algorithmic framework
F. Liu (2016)
10.1038/srep18962
Protein Secondary Structure Prediction Using Deep Convolutional Neural Fields
S. Wang (2016)
10.1007/s00122-016-2733-z
Genomic selection for wheat traits and trait stability
M. Huang (2016)
10.1093/bioinformatics/btw427
DeepChrome: deep-learning for predicting gene expression from histone modifications
R. Singh (2016)
10.3835/plantgenome2010.12.0029
Genomic Selection Accuracy using Multifamily Prediction Models in a Wheat Breeding Program
E. Heffner (2011)
10.3389/fgene.2016.00221
Genomic Selection in the Era of Next Generation Sequencing for Complex Traits in Plant Breeding
J. Bhat (2016)
10.1017/S0016672399004462
Marker-assisted selection using ridge regression.
J. Whittaker (2000)
10.1534/genetics.113.150078
Genome-Wide Prediction of Traits with Different Genetic Architecture Through Efficient Variable Selection
V. Wimmer (2013)
10.1093/bioinformatics/btw255
Convolutional neural network architectures for predicting DNA–protein binding
Haoyang Zeng (2016)
Deep learning. Nature 521(7553):436–444
Y LeCun (2015)
10.1534/genetics.109.101501
Predicting Quantitative Traits With Regression Models for Dense Molecular Markers and Pedigree
G. de los Campos (2009)



This paper is referenced by
10.3389/fpls.2020.00922
Global Role of Crop Genomics in the Face of Climate Change
M. Pourkheirandish (2020)
10.3389/fgene.2019.01091
Phenotype Prediction and Genome-Wide Association Study Using Deep Convolutional Neural Network of Soybean
Y. Liu (2019)
10.1016/j.aquaculture.2020.735357
Research advances in the genomics and applications for molecular breeding of aquaculture animals
Xinxin You (2020)
10.1101/2019.12.18.881912
Predicting genotypic values associated with gene interactions using neural networks: A simulation study for investigating factors affecting prediction accuracy
A. Onogi (2019)
10.1534/g3.119.400498
Benchmarking Parametric and Machine Learning Models for Genomic Prediction of Complex Traits
C. B. Azodi (2019)
10.1002/CSC2.20052
Predictive breeding for maize: Making use of molecular phenotypes, machine learning, and physiological crop models
Jacob D. Washburn (2019)
10.1101/2020.05.12.090118
Using local convolutional neural networks for genomic prediction
Torsten Pook (2020)
10.1101/2020.02.28.969261
The cis-regulatory codes of response to combined heat and drought stress in Arabidopsis thaliana
C. B. Azodi (2020)
10.1080/10496505.2019.1638264
Data-Driven Decision Making in Precision Agriculture: The Rise of Big Data in Agricultural Systems
Nicoleta Tantalaki (2019)
10.1101/2020.01.20.910539
Deep learning for population size history inference: design, comparison and combination with approximate Bayesian computation
Théophile Sanchez (2020)
10.1145/3307339.3342179
A Sparse Convolutional Predictor with Denoising Autoencoders for Phenotype Prediction
Jiangchuan Chen (2019)
10.1007/978-3-030-41866-3_4
Data-Driven Decisions for Accelerated Plant Breeding
Saritha V. Kuriakose (2020)
10.3389/fgene.2020.00025
Sparse Convolutional Neural Networks for Genome-Wide Prediction
Patrik Waldmann (2020)
10.1101/614479
Benchmarking algorithms for genomic prediction of complex traits
C. B. Azodi (2019)
10.1016/j.cj.2020.04.005
Harness the power of genomic selection and the potential of germplasm in crop breeding for global food security in the era with rapid climate change
Tianhua He (2020)
10.1186/s12711-020-00531-z
Deep learning versus parametric and ensemble methods for genomic prediction of complex phenotypes
Rostam Abdollahi-Arpanahi (2020)
10.1007/s10126-019-09925-7
Evaluation of Genomic Selection for Seven Economic Traits in Yellow Drum (Nibea albiflora)
Guijia Liu (2019)
10.1109/BIBM47256.2019.8982998
Ensemble supervised learning for genomic selection
Sheikh Jubair (2019)
10.3390/AGRICULTURE10100436
Machine Learning for Plant Breeding and Biotechnology
Mohsen Niazian (2020)
10.3389/fpls.2020.00681
Integrating High-Throughput Phenotyping and Statistical Genomic Methods to Genetically Improve Longitudinal Traits in Crops
F. F. Moreira (2020)
10.1007/978-3-030-41866-3
Accelerated Plant Breeding, Volume 1: Cereal Crops
S. H. Wani (2020)
Determining What a Model is a Model Of In a recent commentary
Carlos Zednik (2020)
Semantic Scholar Logo Some data provided by SemanticScholar