Combining Partially Overlapping Multi-Omics Data in Databases Using Relationship Matrices

Akdemir, Deniz ORCID: https://orcid.org/0000-0003-0553-6798, Knox, Ron and Isidro Sánchez, Julio ORCID: https://orcid.org/0000-0002-9044-3221 (2020). Combining Partially Overlapping Multi-Omics Data in Databases Using Relationship Matrices. "Frontiers in Plant Science", v. 11 (n. 947); p. 947. ISSN 1664462X. https://doi.org/10.3389/fpls.2020.00947.

Descripción

Título: Combining Partially Overlapping Multi-Omics Data in Databases Using Relationship Matrices
Autor/es:
Tipo de Documento: Artículo
Título de Revista/Publicación: Frontiers in Plant Science
Fecha: 14 Julio 2020
ISSN: 1664462X
Volumen: 11
Número: 947
Materias:
Palabras Clave Informales: Accuracy; Association; COVARIANCE ESTIMATION; expectation-maximization; Genomic Selection; GENOTYPE IMPUTATION; HAPLOTYPE-PHASE INFERENCE; Interoperability; Mixed Models; Multi-omics; Multiple kernel learning; Pedigree; phenomics; Plant; Prediction; Regression
Escuela: E.T.S. de Ingeniería Agronómica, Alimentaria y de Biosistemas (UPM)
Departamento: Biotecnología - Biología Vegetal
Licencias Creative Commons: Reconocimiento - Sin obra derivada - No comercial

Texto completo

[thumbnail of 7238074.pdf] PDF (Portable Document Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (5MB)

Resumen

© Copyright © 2020 Akdemir, Knox and Isidro y Sánchez. Private and public breeding programs, as well as companies and universities, have developed different genomics technologies that have resulted in the generation of unprecedented amounts of sequence data, which bring new challenges in terms of data management, query, and analysis. The magnitude and complexity of these datasets bring new challenges but also an opportunity to use the data available as a whole. Detailed phenotype data, combined with increasing amounts of genomic data, have an enormous potential to accelerate the identification of key traits to improve our understanding of quantitative genetics. Data harmonization enables cross-national and international comparative research, facilitating the extraction of new scientific knowledge. In this paper, we address the complex issue of combining high dimensional and unbalanced omics data. More specifically, we propose a covariance-based method for combining partial datasets in the genotype to phenotype spectrum. This method can be used to combine partially overlapping relationship/covariance matrices. Here, we show with applications that our approach might be advantageous to feature imputation based approaches; we demonstrate how this method can be used in genomic prediction using heterogeneous marker data and also how to combine the data from multiple phenotypic experiments to make inferences about previously unobserved trait relationships. Our results demonstrate that it is possible to harmonize datasets to improve available information across gene-banks, data repositories, or other data resources.

Más información

ID de Registro: 86505
Identificador DC: https://oa.upm.es/86505/
Identificador OAI: oai:oa.upm.es:86505
URL Portal Científico: https://portalcientifico.upm.es/es/ipublic/item/7238074
Identificador DOI: 10.3389/fpls.2020.00947
URL Oficial: https://www.frontiersin.org/journals/plant-science...
Depositado por: iMarina Portal Científico
Depositado el: 21 Ene 2025 12:24
Ultima Modificación: 21 Ene 2025 12:24