StreamCloud: An elastic and scalable data streaming system

Gulisano, Vincenzo Massimiliano, Jiménez Peris, Ricardo, Patiño Martínez, Marta ORCID: https://orcid.org/0000-0003-2997-3722, Soriente, Claudio and Valduriez, Patrick (2012). StreamCloud: An elastic and scalable data streaming system. "IEEE Transactions on Parallel And Distributed Systems", v. 23 (n. 12); pp. 2351-2365. ISSN 1045-9219. https://doi.org/10.1109/TPDS.2012.24.

Descripción

Título: StreamCloud: An elastic and scalable data streaming system
Autor/es:
  • Gulisano, Vincenzo Massimiliano
  • Jiménez Peris, Ricardo
  • Patiño Martínez, Marta https://orcid.org/0000-0003-2997-3722
  • Soriente, Claudio
  • Valduriez, Patrick
Tipo de Documento: Artículo
Título de Revista/Publicación: IEEE Transactions on Parallel And Distributed Systems
Fecha: Diciembre 2012
ISSN: 1045-9219
Volumen: 23
Número: 12
Materias:
ODS:
Palabras Clave Informales: Data streaming, scalability, elasticity, flujo de datos, escalabilidad, elasticidad.
Escuela: Facultad de Informática (UPM) [antigua denominación]
Departamento: Lenguajes y Sistemas Informáticos e Ingeniería del Software
Licencias Creative Commons: Reconocimiento - Sin obra derivada - No comercial

Texto completo

[thumbnail of INVE_MEM_2012_137816.pdf]
Vista Previa
PDF (Portable Document Format) - Se necesita un visor de ficheros PDF, como GSview, Xpdf o Adobe Acrobat Reader
Descargar (827kB) | Vista Previa

Resumen

Many applications in several domains such as telecommunications, network security, large scale sensor networks, require online processing of continuous data lows. They produce very high loads that requires aggregating the processing capacity of many nodes. Current Stream Processing Engines do not scale with the input load due to single-node bottlenecks. Additionally, they are based on static con?gurations that lead to either under or over-provisioning. In this paper, we present StreamCloud, a scalable and elastic stream processing engine for processing large data stream volumes. StreamCloud uses a novel parallelization technique that splits queries into subqueries that are allocated to independent sets of nodes in a way that minimizes the distribution overhead. Its elastic protocols exhibit low intrusiveness, enabling effective adjustment of resources to the incoming load. Elasticity is combined with dynamic load balancing to minimize the computational resources used. The paper presents the system design, implementation and a thorough evaluation of the scalability and elasticity of the fully implemented system.

Proyectos asociados

Tipo
Código
Acrónimo
Responsable
Título
FP7
257475
MASSIF
Sin especificar
MAnagement of Security information and events in Service InFrastructures

Más información

ID de Registro: 16848
Identificador DC: https://oa.upm.es/16848/
Identificador OAI: oai:oa.upm.es:16848
URL Portal Científico: https://portalcientifico.upm.es/es/ipublic/item/5487865
Identificador DOI: 10.1109/TPDS.2012.24
URL Oficial: http://www.computer.org/csdl/trans/td/2012/12/ttd2...
Depositado por: Memoria Investigacion
Depositado el: 05 Sep 2013 17:18
Ultima Modificación: 12 Nov 2025 00:00