Dynamic Load Balancing in Multicomputer Database Systems Using Partition Tuning

Kien A. Hua, Chiang Lee, Chau M. Hua

Research output: Contribution to journalArticlepeer-review

34 Citations (Scopus)

Abstract

Shared nothing multiprocessor architecture is known to be more scalable to support very large databases. Compared to other join strategies, a hash-based join algorithm is particularly efficient and easily parallelized for this computation model. However, this hardware structure is very sensitive to the skew in tuple distribution. Unless the parallel hash join algorithm includes some dynamic load balancing mechanism, the skew effect can severely deteriorate the system performance. In this paper, we investigate this issue. In particular, three parallel hash join algorithms are presented. We implement a simulator to study the effectiveness of these schemes. The simulation model is validated by comparing the simulation results to those produced by the actual implementation of the algorithms running on a multiprocessor system. Our performance study indicates that a naive approach is not able to provide tangible savings. However, the carefully designed strategies can offer substantial improvement over conventional techniques for a wide range of skew conditions.

Original languageEnglish
Pages (from-to)968-983
Number of pages16
JournalIEEE Transactions on Knowledge and Data Engineering
Volume7
Issue number6
DOIs
Publication statusPublished - 1995 Dec

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Computer Science Applications
  • Computational Theory and Mathematics

Fingerprint

Dive into the research topics of 'Dynamic Load Balancing in Multicomputer Database Systems Using Partition Tuning'. Together they form a unique fingerprint.

Cite this