The effect of prior distributions of different statistical models on the accuracy of genomic prediction: simulation study

Document Type : Research Paper

Authors

1 Assistant professor, Department of Animal Science, Faculty of Agriculture, Ilam University, Ilam, Iran

2 Animal Science Research Department, Kermanshah Agriculture and Natural Resources Research and Education Center, Kermanshah, Iran.

3 Animal Science Research Institute of Iran, Agricultural Research, Education and Extension Organization (AREEO), Karaj, Iran

Abstract

In the genomic selection, SNP markers across whole genome are used to estimate the marker effects. Genomic breeding value of animals can be predicted by different statistical methods. Genomic breeding value of animals can be predicted by different statistical methods. In the present study, accuracies of the predicted direct genomic breeding values were compared under several statistical models including Bayes A, B-LASSO gamma, B-LASSO beta and BGLR by considering two heritabilities of 0.3 and 0.05. Three values of π (0.1, 0.3, and 0.5) and four values of s2 (0.01, 0.1, 10, and 100) were simulated and correspondingly evaluated in terms of accuracy. The obtained results showed the highest accuracy of direct genomic breeding value was obtained under Bayes-A method, which were 0.88 and 0.69 for heritabilities of 0.3 and 0.05, respectively. Regression coefficients of methods for estimating the marker effects were more unbiased under heritability of 0.3 than 0.05. By considering the heritability of 0.3, the lowest and highest error were obtained under Bayes-A (121.2) and B-LASSO beta (165.2) methods, respectively. Under BGLR method, the highest and lowest accuracy of Bayes-A were obtained for π, 0.5 (0.81) and π, 0.1 (0.45). By increasing s2 parameter a decrease in the accuracy of genomic predictions was obtained. The obtained results suggested that to maximize the accuracy of the genetic prediction for traits with moderate to high heritability (0.3) optimal point of parameter π may ranged from 0.41 to 0.56 and while for traits with low heritability from 0.56 to 0.73.

Keywords


محمدی، ی. و ستایی مختاری، م. (1396). روش‌های پارامتری و ناپارامتری برای بررسی صحت انتخاب ژنومی با معماری‌های ژنتیکی افزایشی و  غالبیت . پژوهش­های تولیدات دامی. شماره 18. ص ص. 167-161.
Chang, L.Y., Toghian, S., Aggrey, S.E. and Rekaya R. (2019). Increasing accuracy of genomic selection in presence of high density marker panels through the prioritization of relevant polymorphisms. BMC Genetics. 20: 21.
De los Campos, G., Hickey, J.M., Pong-Wong, R., Daetwyler, H.D. and Calus M.P.L. (2013). Whole-genome regression and prediction methods applied to plant and animal breeding. Genetics. 193: 327–345.
De los Campos, G., Naya, H., Gianola, D., Crossa, J., Legarra, A., Manfredi, E., Weigel, K., and Cotes, J.M. (2009). Predicting quantitative traits with regression models for dense molecular markers and pedigree. Genetics. 182:375-385.
Harris, B.L., Johnson, D.L., and Spelman, R.J. (2008). Genomic selection in New Zealand and the implications for national genetic evaluation.  Proc. 36th ICAR Biennial Session, Niagara Falls, Canada, 38:325-330.
Howard, R., A. L. Carriquiry and W. D. Beavis. (2014). Parametric and nonparametric statistical methods for genomic selection of traits with additive and epistatic genetic architectures. G3. 4: 1027-1046.
Gaspa, G., Pintus, M.A., Nicolazzi, E.L., Vicario, D., Valentini, C., and Macciotta, N.P.P. (2013). Use of principal component approach to predict direct genomic breeding values for beef traits in Italian Simmental cattle. Journal Animal Science. 91:29-37.
Guarini, A.R., Lourenco, D.A.L., Brito, L.F., Sargolzaei, M., Baes, C., Miglior, F., Misztal, I. and Schenkel, F.S. (2018). Comparison of genomic predictions for lowly heritable traits using multi-step and single-step genomic best linear unbiased predictor in Holstein cattle. Journal of Dairy Science. 101:1–11.
Hayes, B., Bowman, P., Chamberlain, A. and Goddard, M. (2009). Genomic selection in dairy cattle: Progress and challenges. Journal of Dairy Science. 92:433–443.
Meuwissen, T.H., Hayes, B.J., and Goddard, M.E. (2001). Prediction of total genetic value using genome-wide dense marker maps. Genetics. 157: 1819–1829.
Neves, H.H.R., Carvalheiro, R., O’Brien, A. M., Utsunomiya, Y.T., do Carmo, A.S., Schenkel, F.S., Sölkner, J., McEwan, J.C., Van Tassell, C.P., Cole, J.B.,  da Silva, M.V., and Queiroz, S. A. (2014). Accuracy of genomic predictions in Bos indicus (Nellore) cattle. Genetics Selection Evolution. 46:17.
Nicolazzi, E., Negrini, R., Amanda, J., Chamberlain, Goddard, M.E., Marsan, P.A., and Hayes B.J. (2013). Effect of Prior Distributions on Accuracy of Genomic Breeding Values for Two Dairy Traits. Italian Journal of Animal Science. 12: 551-561.
Park, T., and Casella, G. (2008). The Bayesian Lasso. Journal of the American Statistical Association. 103: 681-686.
Pintus, M.A., Nicolazzi, E.L., Van Kaam, J.B., Biffani, S., Stella, A., Gaspa, G., Dimauro, C., and Macciotta, N.P. (2013). Use of different statistical models to predict direct genomic values for productive and functional traits in Italian Holstein. Journal Animal Breeding and Genetics. 130:32-40.
 ter Braak, C.J.F., Boer, M.P.,  and Bink, M.C.A.M. (2005). Extending Xu’s Bayesian model for estimating polygenic effects using markers of the entire genome. Genetics. 170:1435-1438.
Sargolzaei, M., and Schenkel, F.S. (2009). QMSim: a large-scale genome simulator for livestock. Bioinformatics. 25: 680–1.
Schefers, M.J. and Weigel, K.A. (2020). Genomic selection in dairy cattle: Integration of DNA testing into breeding programs. Animal Frontiers. 2(1): 24-37.
VanRaden, P. M., C. P. Van Tassell, G. R. Wiggans, T. S. Sonstegard, R. D. Schnabel, J. F. Taylor, and F. S. Schenkel. (2009). Invited review: Reliability of genomic predictions for North American Holstein bulls. Journal Dairy Science. 92:16–24.
Verbyla, K., Bowman, P.J., Hayes, B.J., and Goddard, M.E. (2010). Sensitivity of genomic selection to using different prior distributions. BMC Proceedings 4(Suppl.1):S5.
Yang, w., Chen, C., and Tempelman, R.J. (2015). Improving the computational efficiency of fully Bayes inference and assessing the effect of misspecification of hyperparameters in whole-genome prediction models. Genetics Selection Evolution. 47: 13.
Yi, N., and Xu, S. (2008). Bayesian LASSO for quantitative trait loci mapping. Genetics. 179:1045-1055.

Zhang, H., Yin, L., Wang, M., Yuan, X.,  and Liu X. (2019). Factors Affecting the Accuracy of Genomic Selection for Agricultural Economic Traits in Maize, Cattle, and Pig Populations. Frontiers in Genetics. 10: 180-189.