Full text
Preview |
PDF
- Requires a PDF viewer, such as GSview, Xpdf or Adobe Acrobat Reader
Download (1MB) | Preview |
Córdoba Cabeza, María Luisa ORCID: https://orcid.org/0000-0003-0988-233X, García Dopico, Antonio
ORCID: https://orcid.org/0000-0001-7373-7853, García Clemente, María Isabel
ORCID: https://orcid.org/0000-0002-6408-2506, Rosales García, Francisco Javier, Arnaiz García, Jesús, Bermejo Bermejo, Rodolfo
ORCID: https://orcid.org/0000-0003-2326-2073 and Galán del Sastre, Pedro
ORCID: https://orcid.org/0000-0002-4600-9954
(2014).
Efficient parallelization of a regional ocean model for the western Mediterranean Sea.
"International Journal of High Performance Computing Applications", v. 28
(n. 3);
pp. 368-383.
ISSN 1094-3420.
https://doi.org/10.1177/1094342013512344.
Title: | Efficient parallelization of a regional ocean model for the western Mediterranean Sea |
---|---|
Author/s: |
|
Item Type: | Article |
Título de Revista/Publicación: | International Journal of High Performance Computing Applications |
Date: | 2014 |
ISSN: | 1094-3420 |
Volume: | 28 |
Subjects: | |
Freetext Keywords: | Message passing; Shared memory; Hybrid programming; Multicore; Clusters; Ocean model; Finite element method |
Faculty: | E.T.S. de Ingenieros Informáticos (UPM) |
Department: | Arquitectura y Tecnología de Sistemas Informáticos |
Creative Commons Licenses: | Recognition - No derivative works - Non commercial |
Preview |
PDF
- Requires a PDF viewer, such as GSview, Xpdf or Adobe Acrobat Reader
Download (1MB) | Preview |
This paper focuses on the parallelization of an ocean model applying current multicore processor-based cluster architectures
to an irregular computational mesh. The aim is to maximize the efficiency of the computational resources used.
To make the best use of the resources offered by these architectures, this parallelization has been addressed at all the
hardware levels of modern supercomputers: firstly, exploiting the internal parallelism of the CPU through vectorization;
secondly, taking advantage of the multiple cores of each node using OpenMP; and finally, using the cluster nodes to
distribute the computational mesh, using MPI for communication within the nodes. The speedup obtained with each
parallelization technique as well as the combined overall speedup have been measured for the western Mediterranean
Sea for different cluster configurations, achieving a speedup factor of 73.3 using 256 processors. The results also show
the efficiency achieved in the different cluster nodes and the advantages obtained by combining OpenMP and MPI versus
using only OpenMP or MPI. Finally, the scalability of the model has been analysed by examining computation and communication
times as well as the communication and synchronization overhead due to parallelization.
Item ID: | 40121 |
---|---|
DC Identifier: | https://oa.upm.es/40121/ |
OAI Identifier: | oai:oa.upm.es:40121 |
DOI: | 10.1177/1094342013512344 |
Official URL: | https://uk.sagepub.com/en-gb/eur/journal/internati... |
Deposited by: | Memoria Investigacion |
Deposited on: | 10 May 2016 12:36 |
Last Modified: | 13 Nov 2017 11:38 |