Title

Replication Data for: A comparison between three machine learning methods for multivariate genomic prediction using the Sparse Kernels Methods (SKM) library

Author

Osval Antonio Montesinos-Lopez

Pedro César Santana Mancilla

Jose Crossa

Access level

Open Access

Description

Abstract - Genomic selection (GS) provides a new way for plant breeders select the best genotype. It draws upon historical phenotypic and genotypic information for training a statistical machine learning model which is used for predicting phenotypic (or breeding) values of new lines for which only genotypic information is available. Many statistical machine learning methods have been proposed for this task, but multi-trait (MT) genomic prediction models are preferred because they take advantage of correlated traits to improve the prediction accuracy. This study contains six datasets that were used to compare the prediction performance of three MT methods: the MT genomic best linear unbiased predictor (GBLUP), the MT partial least square (PLS) and the multi-trait Random Forest (RF). The data come from groundnuts, rice, and wheat. The accompanying article describes the results of the analysis.

Publisher

International Maize and Wheat Improvement Center

Publish date

2022

Resource Type

Dataset

Source repository

Repositorio Institucional de Datos y Software de Investigación del CIMMYT

Downloads

0

Comments



You need to sign in or sign up to comment.