[News] BIRTE Presentation, SoCC Poster, and VLDB Participation

Anatoli Shein presented a poster at the ACM Symposium on Cloud Computing 2015 (SoCC'15) and a paper at the 9th International Workshop on Business Intelligence for Real Time Enterprise (BIRTE'15). Both his poster and paper were entitled Processing of Aggregate Continuous Queries in a Distributed Environment on the Big Island in Hawaii.
Link

Abstract:
Data Stream Management Systems (DSMSs) performing on-line analytics rely on the efficient execution of large numbers of Aggregate Continuous Queries (ACQs). In this paper, we study the problem of generating high quality execution plans of ACQs in DSMSs deployed on multi-node (multi-core and multi-processor) distributed environments. Towards this goal, we classify optimizers based on how they partition the workload among computing nodes and on their usage of the concept of Weavability, which is utilized by the state-of-the-art WeaveShare optimizer to selectively combine ACQs and produce low cost execution plans for single-node environments. For each category, we propose an optimizer, which either adopts an existing strategy or develops a new one for assigning and grouping ACQs to computing nodes. We implement and experimentally compare all of our proposed optimizers in terms of (1) keeping the total cost of the ACQs execution plan low and (2) balancing the load among the computing nodes. Our extensive experimental evaluation shows that our newly developed Weave-Group to Nodes (WGTN) and Weave-Group Inserted (WGI) optimizers produce plans of significantly higher quality than the rest of the op- timizers. WGTN minimizes the total cost, making it more suitable from a client perspective, and WGI achieves load balancing, making it more suitable from a system perspective.

This paper was also presented as a poster at the SoCC conference colocated with VLDB.