Mark L. McAuliffe, Michael J. Carey, et al.
SIGPLAN Notices (ACM Special Interest Group on Programming Languages)
In this paper, we re-examine the results of prior work on methods for computing ad hoc joins. We develop a detailed cost model for predicting join algorithm performance, and we use the model to develop cost formulas for the major ad hoc join methods found in the relational database literature. We show that various pieces of "common wisdom" about join algorithm performance fail to hold up when analyzed carefully, and we use our detailed cost model to derive optimal buffer allocation schemes for each of the join methods examined here. We show that optimizing their buffer allocations can lead to large performance improvements, e.g., as much as a 400% improvement in some cases. We also validate our cost model's predictions by measuring an actual implementation of each join algorithm considered. The results of this work should be directly useful to implementors of relational query optimizers and query processing systems.
Mark L. McAuliffe, Michael J. Carey, et al.
SIGPLAN Notices (ACM Special Interest Group on Programming Languages)
Laura M. Haas, J.C. Freytag, et al.
SIGMOD 1989
Boris Glavic, Gustavo Alonso, et al.
VLDB
Laura M. Haas, Patricia G. Selinger
SIGMOD Record