Object

Title: The parallel tiled WZ factorization algorithm for multicore architectures

Creator:

Bylina, Beata ; Bylina, Jarosław

Date:

2019

Resource Type:

artykuł

Contributor:

Kobusińska, Anna - ed. ; Hsu, Ching-Hsien - ed. ; Lin, Kwei-Jay - ed.

Subtitle:

.

Group publication title:

AMCS, volume 29 (2019)

Abstract:

The aim of this paper is to investigate dense linear algebra algorithms on shared memory multicore architectures. The design and implementation of a parallel tiled WZ factorization algorithm which can fully exploit such architectures are presented. Three parallel implementations of the algorithm are studied. The first one relies only on exploiting multithreaded BLAS (basic linear algebra subprograms) operations. The second implementation, except for BLAS operations, employs the OpenMP standard to use the loop-level parallelism. The third implementation, except for BLAS operations, employs the OpenMP task directive with the depend clause. ; We report the computational performance and the speedup of the parallel tiled WZ factorization algorithm on shared memory multicore architectures for dense square diagonally dominant matrices. Then we compare our parallel implementations with the respective LU factorization from a vendor implemented LAPACK library. We also analyze the numerical accuracy. Two of our implementations can be achieved with near maximal theoretical speedup implied by Amdahl`s law.

Publisher:

Zielona Góra: Uniwersytet Zielonogórski

Resource Identifier:

oai:zbc.uz.zgora.pl:85983

DOI:

10.2478/amcs-2019-0030

Pages:

407-419

Source:

AMCS, volume 29, number 2 (2019) ; click here to follow the link

Language:

eng

License CC BY 4.0:

click here to follow the link

Rights:

Biblioteka Uniwersytetu Zielonogórskiego

×

Citation

Citation style:

This page uses 'cookies'. More information