TY - GEN
T1 - Does multicast communication make sense in write invalidation traffic?
AU - Hsiao, Hung Chang
AU - King, Chung Ta
N1 - Publisher Copyright:
© 2000 IEEE
PY - 2000
Y1 - 2000
N2 - In distributed shared-memory (DSM) multiprocessors, a write operation requires multiple messages to invalidate the nodes which share and cache the memory block to be written. The resultants write stall time is a performance hurdle to such systems. One approach to efficient invalidation is to use multicast messages to reach the sharing nodes. In this paper, we use application-driven simulation to evaluate two multicast-based invalidation schemes: dual-path [10] and pruning [11 ]. Based on our experimental settings, we found that multicasts improve invalidation traffic for four of the six evaluated real applications. The remaining two programs are computation intensive, and multicast-based invalidation is less effective. But since they induce bursty communication, we found that multicasts help to relieve the network congestion during those periods of time. Dual-path performs a little better than pruning, because it is less sensitive to routing delay in the routers. We also found that cache size is an important design parameter for multicast-based invalidation. It is more effective for DSM multiprocessors with larger caches.
AB - In distributed shared-memory (DSM) multiprocessors, a write operation requires multiple messages to invalidate the nodes which share and cache the memory block to be written. The resultants write stall time is a performance hurdle to such systems. One approach to efficient invalidation is to use multicast messages to reach the sharing nodes. In this paper, we use application-driven simulation to evaluate two multicast-based invalidation schemes: dual-path [10] and pruning [11 ]. Based on our experimental settings, we found that multicasts improve invalidation traffic for four of the six evaluated real applications. The remaining two programs are computation intensive, and multicast-based invalidation is less effective. But since they induce bursty communication, we found that multicasts help to relieve the network congestion during those periods of time. Dual-path performs a little better than pruning, because it is less sensitive to routing delay in the routers. We also found that cache size is an important design parameter for multicast-based invalidation. It is more effective for DSM multiprocessors with larger caches.
UR - https://www.scopus.com/pages/publications/85114019324
UR - https://www.scopus.com/pages/publications/85114019324#tab=citedBy
U2 - 10.1109/ICPADS.2000.857702
DO - 10.1109/ICPADS.2000.857702
M3 - Conference contribution
AN - SCOPUS:85114019324
T3 - Proceedings - 7th International Conference on Parallel and Distributed Systems: Workshops
SP - 221
EP - 228
BT - Proceedings - 7th International Conference on Parallel and Distributed Systems
A2 - Shibata, Yoshitaka
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 7th International Conference on Parallel and Distributed Systems, ICPADS 2000
Y2 - 4 July 2000 through 7 July 2000
ER -