High Performance Algorithms and Implementations Using Sparse and Parallelization Techniques on MBS

Hidalgo, Andrés Francisco and García de Jalón de la Fuente, Francisco Javier and Tapia Fernández, Santiago (2011). High Performance Algorithms and Implementations Using Sparse and Parallelization Techniques on MBS. In: "Multibody Dynamics 2011, ECCOMAS", 04/07/2011 - 07/07/2011, Bruselas, Bélgica. pp. 19-38.

Description

Title: High Performance Algorithms and Implementations Using Sparse and Parallelization Techniques on MBS
Author/s:
  • Hidalgo, Andrés Francisco
  • García de Jalón de la Fuente, Francisco Javier
  • Tapia Fernández, Santiago
Item Type: Presentation at Congress or Conference (Article)
Event Title: Multibody Dynamics 2011, ECCOMAS
Event Dates: 04/07/2011 - 07/07/2011
Event Location: Bruselas, Bélgica
Title of Book: Proceedings of Multibody Dynamics 2011, ECCOMAS
Date: 2011
Subjects:
Faculty: E.T.S.I. Industriales (UPM)
Department: Automática, Ingeniería Electrónica e Informática Industrial [hasta 2014]
Creative Commons Licenses: Recognition - No derivative works - Non commercial

Full text

[img]
Preview
PDF - Requires a PDF viewer, such as GSview, Xpdf or Adobe Acrobat Reader
Download (1MB) | Preview

Abstract

In this paper we will see how the efficiency of the MBS simulations can be improved in two different ways, by considering both an explicit and implicit semi-recursive formulation. The explicit method is based on a double velocity transformation that involves the solution of a redundant but compatible system of equations. The high computational cost of this operation has been drastically reduced by taking into account the sparsity pattern of the system. Regarding this, the goal of this method is the introduction of MA48, a high performance mathematical library provided by Harwell Subroutine Library. The second method proposed in this paper has the particularity that, depending on the case, between 70 and 85% of the computation time is devoted to the evaluation of forces derivatives with respect to the relative position and velocity vectors. Keeping in mind that evaluating these derivatives can be decomposed into concurrent tasks, the main goal of this paper lies on a successful and straightforward parallel implementation that have led to a substantial improvement with a speedup of 3.2 by keeping all the cores busy in a quad-core processor and distributing the workload between them, achieving on this way a huge time reduction by doing an ideal CPU usage

More information

Item ID: 13244
DC Identifier: http://oa.upm.es/13244/
OAI Identifier: oai:oa.upm.es:13244
Official URL: http://sites-test.uclouvain.be/multibody2011/
Deposited by: Memoria Investigacion
Deposited on: 29 Nov 2012 08:22
Last Modified: 26 Sep 2017 15:13
  • Logo InvestigaM (UPM)
  • Logo GEOUP4
  • Logo Open Access
  • Open Access
  • Logo Sherpa/Romeo
    Check whether the anglo-saxon journal in which you have published an article allows you to also publish it under open access.
  • Logo Dulcinea
    Check whether the spanish journal in which you have published an article allows you to also publish it under open access.
  • Logo de Recolecta
  • Logo del Observatorio I+D+i UPM
  • Logo de OpenCourseWare UPM