Extracción, cálculo de similitudes y análisis de características biológicas para la creación de redes humanas complejas

Prieto Santamaría, Lucía ORCID: https://orcid.org/0000-0003-1545-3515 (2018). Extracción, cálculo de similitudes y análisis de características biológicas para la creación de redes humanas complejas. Proyecto Fin de Carrera / Trabajo Fin de Grado, E.T.S. de Ingeniería Agronómica, Alimentaria y de Biosistemas (UPM), Madrid.

Description

Title: Extracción, cálculo de similitudes y análisis de características biológicas para la creación de redes humanas complejas
Author/s:
Contributor/s:
Item Type: Final Project
Degree: Grado en Biotecnología
Date: 27 June 2018
Subjects:
Faculty: E.T.S. de Ingeniería Agronómica, Alimentaria y de Biosistemas (UPM)
Department: Lenguajes y Sistemas Informáticos e Ingeniería del Software
UPM's Research Group: Minería de Datos y Simulación (MIDAS) MIDAS
Creative Commons Licenses: None

Full text

[thumbnail of TFG_LUCIA_PRIETO_SANTAMARÍA.pdf]
Preview
PDF - Requires a PDF viewer, such as GSview, Xpdf or Adobe Acrobat Reader
Download (2MB) | Preview

Abstract

The increasing availability of biological data is improving the understanding of diseases and providing new insight into their underlying relationships. A recent hypothesis claims that two diseases that share genes, symptoms or other molecular mechanisms, could have unknown relationships. The connection between pairs of diseases can be used for drug repositioning by checking whether they have biological relationships, and if the drug used to treat any of the diseases could affect some of the biological traits in the relationship. To evaluate these connections and their potential use for drug repositioning, complex human disease networks based on molecular information have been created. Relationships between diseases in complex human disease networks can be formed using different biological traits, such as symptoms, genes, or protein-protein interactions (PPIs). Thanks to these complex human disease networks, a greater understanding of the functioning of the diseases will be accomplished, and the analysis of these networks with computational approaches will allow this understanding to reach a larger scale. The DISNET project, in which is involved the current project, aims to build a comprehensive complex human disease network in three levels: the phenotypical layer, the genetical or biological layer and the drugs layer. The current project pretends to construct the DISNET biological layer programmatically in the most possible automatized way. Therefore, the state of the art of the current biological public databases is analyzed, in order to access to them and extract the necessary information to build the network. Genes, proteins, metabolic pathways and PPIs are selected to be the biological traits to create it. The process to access the resources and obtain the data is developed by Python scripts. Genes and proteins are queried to DisGeNET, metabolic pathways are queried to WikiPathways and PPIs are queried to IntAct. This data is structured in a relational way and stored in the same database of the DISNET phenotypical layer. A method to compute the similarities between diseases and the analysis of these similarities is also included. Disease similarity is represented by three metrics: cosine similarity, Jaccard index and Dice coefficient, which are computed for each combination of two diseases without repetition and based in each trait. The obtained values of these metrics are compared to the ones obtained in the phenotypical layer. In future lines, machine learning techniques will be applied to these results to obtain new information from the data.

More information

Item ID: 52568
DC Identifier: https://oa.upm.es/52568/
OAI Identifier: oai:oa.upm.es:52568
Deposited by: Lucía Prieto Santamaría
Deposited on: 10 Oct 2018 05:22
Last Modified: 10 Oct 2018 05:22
  • Logo InvestigaM (UPM)
  • Logo GEOUP4
  • Logo Open Access
  • Open Access
  • Logo Sherpa/Romeo
    Check whether the anglo-saxon journal in which you have published an article allows you to also publish it under open access.
  • Logo Dulcinea
    Check whether the spanish journal in which you have published an article allows you to also publish it under open access.
  • Logo de Recolecta
  • Logo del Observatorio I+D+i UPM
  • Logo de OpenCourseWare UPM