Broadcast-based parallel LU factorization

Detalles Bibliográficos
Autor Principal: Tinetti, Fernando Gustavo
Otros autores o Colaboradores: De Giusti, Armando Eduardo
Formato: Capítulo de libro
Lengua:español
Temas:
Acceso en línea:http://www.springerlink.com/content/5wtvqnqjkvaa7tuh/
Consultar en el Cátalogo
Resumen:This paper presents a parallel LU factorization algorithm designed to take advantage of physical broadcast communication facilities as well as overlapping of communication computing. Physical broadcast is directly available on Ethernet networks hardware, one of the most used interconnection networks in current clusters installed for parallel computing. Overlapped communication is a well-known strategy for hiding communication latency, which is one of the most common source of parallel performance penalization. Performance analysis experimentation of the proposed parallel LU factorization algorithm are presented. Also, the performance of the proposed algorithm is compared with that of the algorithm used in ScaLAPACK (Scalable LAPACK), which is commonly accepted as having optimized performance.
Notas:Formato de archivo: PDF. -- Este documento es producción intelectual de la Facultad de Informática-UNLP (Colección BIPA / Biblioteca.) -- Disponible también en línea (Cons. 09/05/2011)

MARC

LEADER 00000naa a2200000 a 4500
003 AR-LpUFIB
005 20250423183023.0
008 230201s2005 ag o 000 0 spa d
024 8 |a DIF-M3134  |b 3237  |z DIF003044 
040 |a AR-LpUFIB  |b spa  |c AR-LpUFIB 
100 1 |a Tinetti, Fernando Gustavo  |9 44771 
245 1 0 |a Broadcast-based parallel LU factorization 
500 |a Formato de archivo: PDF. -- Este documento es producción intelectual de la Facultad de Informática-UNLP (Colección BIPA / Biblioteca.) -- Disponible también en línea (Cons. 09/05/2011) 
520 |a This paper presents a parallel LU factorization algorithm designed to take advantage of physical broadcast communication facilities as well as overlapping of communication computing. Physical broadcast is directly available on Ethernet networks hardware, one of the most used interconnection networks in current clusters installed for parallel computing. Overlapped communication is a well-known strategy for hiding communication latency, which is one of the most common source of parallel performance penalization. Performance analysis experimentation of the proposed parallel LU factorization algorithm are presented. Also, the performance of the proposed algorithm is compared with that of the algorithm used in ScaLAPACK (Scalable LAPACK), which is commonly accepted as having optimized performance. 
534 |a Euro-Par 2005 Parallel Processing (11th : 2005 : Lisbon) 
650 4 |a BROADCAST  |9 45695 
700 1 |a De Giusti, Armando Eduardo  |9 43366 
856 4 0 |u http://www.springerlink.com/content/5wtvqnqjkvaa7tuh/ 
942 |c CP 
952 |0 0  |1 0  |4 0  |6 A0245  |7 3  |8 BD  |9 77729  |a DIF  |b DIF  |d 2025-03-11  |l 0  |o A0245  |r 2025-03-11 17:03:03  |u http://catalogo.info.unlp.edu.ar/meran/getDocument.pl?id=202  |w 2025-03-11  |y CP 
999 |c 52898  |d 52898