Leveraging Non-Uniform Resources for Parallel Query Processing

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Modular clusters are now composed of non- uniform nodes with different CPUs, disks or network cards so that customers can adapt the cluster configuration to the changing technologies and to their changing needs. This challenges dataflow parallelism as the primary load balancing technique of existing parallel database systems. We show in this paper that dataflow parallelism alone is ill suited for modular clusters because running the same operation on different subsets of the data can not fully utilize non-uniform hardware resources. We propose and evaluate new load balancing techniques that blend pipeline parallelism with data parallelism. We consider relational operators as pipelines of fine-grained operations that can be located on different cluster nodes and executed in parallel on different data subsets to best exploit non-uniform resources. We present an experimental study that confirms the feasibility and effectiveness of the new techniques in a parallel execution engine prototype based on the open-source DBMS Predator.
Original languageEnglish
Title of host publicationThird IEEE International Symposium on Cluster Computing and the Grid
Publication date2003
DOIs
Publication statusPublished - 2003
EventCCGrid 2003 -
Duration: 29 Nov 2010 → …

Conference

ConferenceCCGrid 2003
Periode29/11/2010 → …

ID: 3185413