DisMaNET: A network-based tool to cross map disease vocabularies

García del Valle, Eduardo Pantaleón, Lagunes García, Gerardo, Prieto Santamaría, Lucía ORCID: https://orcid.org/0000-0003-1545-3515, Zanin, Massimiliano, Menasalvas Ruiz, Ernestina ORCID: https://orcid.org/0000-0002-5615-6798 and Rodríguez González, Alejandro ORCID: https://orcid.org/0000-0001-8801-4762 (2021). DisMaNET: A network-based tool to cross map disease vocabularies. "Computer Methods And Programs in Biomedicine", v. 207 ; pp. 1-22. ISSN 0169-2607. https://doi.org/10.1016/j.cmpb.2021.106233.

Descripción

Título: DisMaNET: A network-based tool to cross map disease vocabularies
Autor/es:
Tipo de Documento: Artículo
Título de Revista/Publicación: Computer Methods And Programs in Biomedicine
Fecha: 2021
ISSN: 0169-2607
Volumen: 207
Materias:
ODS:
Palabras Clave Informales: Disease vocabularies, Data interoperability, Disease cross-mapping, Graph database, Network analysis
Escuela: E.T.S. de Ingenieros Informáticos (UPM)
Departamento: Otro
Licencias Creative Commons: Reconocimiento - Sin obra derivada - No comercial

Texto completo

[thumbnail of INVE_MEM_2021_338247.pdf]
Vista Previa
PDF (Portable Document Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (908kB) | Vista Previa

Resumen

Background and Objectives The growing integration of healthcare sources is improving our understanding of diseases. Crossmapping resources such as UMLS play a very important role in this area, but their coverage is still incomplete. With the aim to facilitate the integration and interoperability of biological, clinical and literary sources in studies of diseases, we built DisMaNET, a system to cross-map terms from disease vocabularies by leveraging the power and intuitiveness of network analysis. Methods First, we collected and normalized data from 8 disease vocabularies and mapping sources to generate our datasets. Next, we built DisMaNET by integrating the generated datasets into a Neo4j graph database. Then we exploited the query mechanisms of Neo4j to cross-map disease terms of different vocabularies with a relevance score metric and contrasted the results with some state-of-the-art solutions. Finally, we made our system publicly available for its exploitation and evaluation both through a graphical user interface and REST APIs. Results DisMaNET contains almost half a million nodes and near nine hundred thousand edges, including hierarchical and mapping relationships. Its query capabilities enabled the detection of connections between disease vocabularies that are not present in major mapping sources such as UMLS and the Disease Ontology, even for rare diseases. Furthermore, DisMaNET was capable of obtaining more than 80% of the mappings with UMLS reported in MonDO and DisGeNET. Our tool was used successfully to complete the missing mappings in DISNET, a web-based system designed to extract knowledge from signs and symptoms retrieved from medical databases. Conclusions DisMaNET is a powerful, intuitive and publicly available system to cross-map terms from different disease vocabularies. Its completeness and the potential of network analysis make it a competitive alternative to existing mapping systems. Expansion with new sources, versioning and the improvement of the search and scoring algorithms are envisioned as future lines of work.

Proyectos asociados

Tipo
Código
Acrónimo
Responsable
Título
Comunidad de Madrid
IND2019/TIC-17159
Sin especificar
Universidad Politécnica de Madrid
Programa de fomento de la investigación y la innovación (Doctorados Industriales)
Gobierno de España
RTI2018-094576-A-I00
DISNET
Sin especificar
Creation and analysis of disease networks for drug repurposing from heterogeneous data sources applied to rare diseases
Horizonte 2020
851255
ARCTIC
Agencia Estatal Consejo Superior de Investigaciones Científicas
Air Transport as Information and Computation
Gobierno de España
MDM-2017-0711
Sin especificar
Sin especificar
Sin especificar

Más información

ID de Registro: 67857
Identificador DC: https://oa.upm.es/67857/
Identificador OAI: oai:oa.upm.es:67857
URL Portal Científico: https://portalcientifico.upm.es/es/ipublic/item/9342960
Identificador DOI: 10.1016/j.cmpb.2021.106233
URL Oficial: https://doi.org/10.1016/j.cmpb.2021.106233
Depositado por: Memoria Investigacion
Depositado el: 22 Sep 2021 10:27
Ultima Modificación: 12 Nov 2025 00:00