TY - GEN
T1 - Data redistribution using MPI user-defined types
AU - Yang, Chu Sing
AU - Bai, Sheng Wen
N1 - Publisher Copyright:
© 2002 IEEE.
PY - 2002
Y1 - 2002
N2 - In many parallel programs, run-time data redistribution is usually required to enhance data locality and reduce remote memory access on the distributed memory multicomputers. Recently researches in data redistribution algorithm have become very mature. The time required to generate data sets and processor sets is much lesser then before. That means packing/unpacking becomes a relatively heavy cost in the redistribution. In this paper we present methods to perform BLOCK-CYCLIC(s) to BLOCK-CYCLIC(t) redistribution using MPI user-defined types. In this approach, we can reduce the requirement of memory buffers and avoid unnecessary data-movement. The theoretical models are presented to determine the best method for redistribution. To evaluate the performance of the proposed methods, we have implemented our methods on an IBM SP2 parallel machine. The experimental results show that this approach can obviously improve the performance of redistribution in most cases.
AB - In many parallel programs, run-time data redistribution is usually required to enhance data locality and reduce remote memory access on the distributed memory multicomputers. Recently researches in data redistribution algorithm have become very mature. The time required to generate data sets and processor sets is much lesser then before. That means packing/unpacking becomes a relatively heavy cost in the redistribution. In this paper we present methods to perform BLOCK-CYCLIC(s) to BLOCK-CYCLIC(t) redistribution using MPI user-defined types. In this approach, we can reduce the requirement of memory buffers and avoid unnecessary data-movement. The theoretical models are presented to determine the best method for redistribution. To evaluate the performance of the proposed methods, we have implemented our methods on an IBM SP2 parallel machine. The experimental results show that this approach can obviously improve the performance of redistribution in most cases.
UR - https://www.scopus.com/pages/publications/84961832080
UR - https://www.scopus.com/pages/publications/84961832080#tab=citedBy
U2 - 10.1109/CW.2002.1180859
DO - 10.1109/CW.2002.1180859
M3 - Conference contribution
AN - SCOPUS:84961832080
T3 - Proceedings - 1st International Symposium on Cyber Worlds, CW 2002
SP - 47
EP - 53
BT - Proceedings - 1st International Symposium on Cyber Worlds, CW 2002
A2 - Peng, Shietung
A2 - Yukita, Shuichi
A2 - Savchenko, Vladimir V.
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 1st International Symposium on Cyber Worlds, CW 2002
Y2 - 6 November 2002 through 8 November 2002
ER -