StreamCloud: An elastic and scalable data streaming system

Gulisano, Vincenzo Massimiliano and Jiménez-Peris, Ricardo and Patiño-Martínez, M. and Soriente, Claudio and Valduriez, Patrick (2012). StreamCloud: An elastic and scalable data streaming system. "IEEE Transactions on Parallel And Distributed Systems", v. 23 (n. 12); pp. 2351-2365. ISSN 1045-9219. https://doi.org/10.1109/TPDS.2012.24.

Description

Title: StreamCloud: An elastic and scalable data streaming system
Author/s:
  • Gulisano, Vincenzo Massimiliano
  • Jiménez-Peris, Ricardo
  • Patiño-Martínez, M.
  • Soriente, Claudio
  • Valduriez, Patrick
Item Type: Article
Título de Revista/Publicación: IEEE Transactions on Parallel And Distributed Systems
Date: December 2012
ISSN: 1045-9219
Volume: 23
Subjects:
Freetext Keywords: Data streaming, scalability, elasticity, flujo de datos, escalabilidad, elasticidad.
Faculty: Facultad de Informática (UPM)
Department: Lenguajes y Sistemas Informáticos e Ingeniería del Software
Creative Commons Licenses: Recognition - No derivative works - Non commercial

Full text

[thumbnail of INVE_MEM_2012_137816.pdf]
Preview
PDF - Requires a PDF viewer, such as GSview, Xpdf or Adobe Acrobat Reader
Download (827kB) | Preview

Abstract

Many applications in several domains such as telecommunications, network security, large scale sensor networks, require online processing of continuous data lows. They produce very high loads that requires aggregating the processing capacity of many nodes. Current Stream Processing Engines do not scale with the input load due to single-node bottlenecks. Additionally, they are based on static con?gurations that lead to either under or over-provisioning. In this paper, we present StreamCloud, a scalable and elastic stream processing engine for processing large data stream volumes. StreamCloud uses a novel parallelization technique that splits queries into subqueries that are allocated to independent sets of nodes in a way that minimizes the distribution overhead. Its elastic protocols exhibit low intrusiveness, enabling effective adjustment of resources to the incoming load. Elasticity is combined with dynamic load balancing to minimize the computational resources used. The paper presents the system design, implementation and a thorough evaluation of the scalability and elasticity of the fully implemented system.

Funding Projects

Type
Code
Acronym
Leader
Title
FP7
257475
MASSIF
Unspecified
MAnagement of Security information and events in Service InFrastructures

More information

Item ID: 16848
DC Identifier: https://oa.upm.es/16848/
OAI Identifier: oai:oa.upm.es:16848
DOI: 10.1109/TPDS.2012.24
Official URL: http://www.computer.org/csdl/trans/td/2012/12/ttd2...
Deposited by: Memoria Investigacion
Deposited on: 05 Sep 2013 17:18
Last Modified: 04 Nov 2014 11:35
  • Logo InvestigaM (UPM)
  • Logo GEOUP4
  • Logo Open Access
  • Open Access
  • Logo Sherpa/Romeo
    Check whether the anglo-saxon journal in which you have published an article allows you to also publish it under open access.
  • Logo Dulcinea
    Check whether the spanish journal in which you have published an article allows you to also publish it under open access.
  • Logo de Recolecta
  • Logo del Observatorio I+D+i UPM
  • Logo de OpenCourseWare UPM