Efficiently Handling Skew in Outer Joins on Distributed Systems
(2014)
Presentation / Conference Contribution
Cheng, L., Kotoulas, S., Ward, T., & Theodoropoulos, T. (2014, May). Efficiently Handling Skew in Outer Joins on Distributed Systems. Presented at 2014 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, Chicago, IL, USA
Outer joins are ubiquitous in databases and big data systems. The question of how best to execute outer joins in large parallel systems is particularly challenging as real world datasets are characterized by data skew leading to performance issues. A... Read More about Efficiently Handling Skew in Outer Joins on Distributed Systems.