Welcome to the ADMT Publication Server

Processing of Aggregate Continuous Queries in a Distributed Environment

DocUID: 2015-008 Full Text: PDF

Author: Anatoli U. Shein, Panos K. Chrysanthis, Alexandros Labrinidis

Abstract: Data Stream Management Systems (DSMSs) performing on-line analytics rely on the efficient execution of large numbers of Aggregate Continuous Queries (ACQs). In this paper, we study the problem of generating high quality execution plans of ACQs in DSMSs deployed on multi-node (multi-core and multi-processor) distributed environments. Towards this goal, we classify optimizers based on how they partition the workload among computing nodes and on their usage of the concept of Weavability, which is utilized by the state-of-the-art WeaveShare optimizer to selectively combine ACQs and produce low cost execution plans for single-node environments. For each category, we propose an optimizer, which either adopts an existing strategy or develops a new one for assigning and grouping ACQs to computing nodes. We implement and experimentally compare all of our proposed optimizers in terms of (1) keeping the total cost of the ACQs execution plan low and (2) balancing the load among the computing nodes. Our extensive experimental evaluation shows that our newly developed Weave-Group to Nodes (WGTN) and Weave-Group Inserted (WGI) optimizers produce plans of significantly higher quality than the rest of the optimizers. WGTN minimizes the total cost, making it more suitable from a client perspective, and WGI achieves load balancing, making it more suitable from a system perspective.

Published In: Proceedings of the 9th International Workshop on Business Intelligence for the Real Time Enterprise

Place Published: Hawaii, USA

Year Published: 2015

Note: Web Proceedings: http://db.cs.pitt.edu/birte2015

Project: AQSIOS Subject Area: Data Streams

Publication Type: Workshop Paper

Sponsor: NSF CBET-1250171

Citation:Text Latex BibTex XML Anatoli U. Shein, Panos K. Chrysanthis, and Alexandros Labrinidis. Processing of Aggregate Continuous Queries in a Distributed Environment. Proceedings of the 9th International Workshop on Business Intelligence for the Real Time Enterprise. 2015. Hawaii, USA. (Note: Web Proceedings: http://db.cs.pitt.edu/birte2015).

Similar Publications: