Improving grid fault tolerance by means of global behavior modeling.

Montes, Jesús; Sánchez, Alberto y Pérez Hernández, María de los Santos (2010). Improving grid fault tolerance by means of global behavior modeling.. En: "Ninth International Symposium on Parallel and Distributed Computing, 2010", 07/07/2010 - 09/07/2010, Estambul, Turquia. ISBN 978-0-7695-4120-4.

Descripción

Título: Improving grid fault tolerance by means of global behavior modeling.
Autor/es:
  • Montes, Jesús
  • Sánchez, Alberto
  • Pérez Hernández, María de los Santos
Tipo de Documento: Ponencia en Congreso o Jornada (Artículo)
Título del Evento: Ninth International Symposium on Parallel and Distributed Computing, 2010
Fechas del Evento: 07/07/2010 - 09/07/2010
Lugar del Evento: Estambul, Turquia
Título del Libro: Proceedings of the Ninth International Symposium on Parallel and Distributed Computing, 2010
Fecha: 2010
ISBN: 978-0-7695-4120-4
Materias:
Escuela: Facultad de Informática (UPM) [antigua denominación]
Departamento: Arquitectura y Tecnología de Sistemas Informáticos
Licencias Creative Commons: Reconocimiento - Sin obra derivada - No comercial

Texto completo

[img]
Vista Previa
PDF (Document Portable Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (1MB) | Vista Previa

Resumen

Grid systems have proved to be one of the most important new alternatives to face challenging problems but, to exploit its benefits, dependability and fault tolerance are key aspects. However, the vast complexity of these systems limits the efficiency of traditional fault tolerance techniques. It seems necessary to distinguish between resource-level fault tolerance (focused on every machine) and service-level fault tolerance (focused on global behavior). Techniques based on these concepts can handle system complexity and increase dependability. We present an autonomous, self-adaptive fault tolerance framework for grid systems, based on a new approach to model distributed environments. The grid is considered as a single entity, instead of a set of independent resources. This point of view focuses on service-level fault tolerance, allowing us to see the big picture and understand the system's global behavior. The resulting model's simplicity is the key to provide system-wide fault tolerance.

Más información

ID de Registro: 6852
Identificador DC: http://oa.upm.es/6852/
Identificador OAI: oai:oa.upm.es:6852
URL Oficial: http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5532500&tag=1
Depositado por: Memoria Investigacion
Depositado el: 04 May 2011 10:32
Ultima Modificación: 20 Abr 2016 15:59
  • Open Access
  • Open Access
  • Sherpa-Romeo
    Compruebe si la revista anglosajona en la que ha publicado un artículo permite también su publicación en abierto.
  • Dulcinea
    Compruebe si la revista española en la que ha publicado un artículo permite también su publicación en abierto.
  • Recolecta
  • e-ciencia
  • Observatorio I+D+i UPM
  • OpenCourseWare UPM