4.4 Article

Anser: Adaptive Information Sharing Framework of AnalyticDB

Journal

PROCEEDINGS OF THE VLDB ENDOWMENT
Volume 16, Issue 12, Pages 3636-3648

Publisher

ASSOC COMPUTING MACHINERY
DOI: 10.14778/3611540.3611553

Keywords

-

Ask authors/readers for more resources

The surge in data analytics has driven the demand for AnalyticDB on Alibaba Cloud, which excels in handling diverse workloads including batch processing, real-time data analytics, and unstructured data analytics. To improve overall performance, the challenge lies in optimizing long-running complex queries without compromising the efficiency of short-running interactive queries. To address this challenge, a new framework called Anser is proposed to enhance the design of traditional distributed data warehouses by incorporating an information sharing mechanism. This framework enables efficient management of dynamic information across the system, leading to novel scheduling policies that optimize both data and information exchanges within the physical plan.
The surge in data analytics has fostered burgeoning demand for AnalyticDB on Alibaba Cloud, which has well served thousands of customers from various business sectors. The most notable feature is the diversity of the workloads it handles, including batch processing, real-time data analytics, and unstructured data analytics. To improve the overall performance for such diverse workloads, one of the major challenges is to optimize long-running complex queries without sacri.cing the processing e.ciency of short-running interactive queries. While existing methods attempt to utilize runtime dynamic statistics for adaptive query processing, they often focus on speci.c scenarios instead of providing a holistic solution. To address this challenge, we propose a new framework called Anser, which enhances the design of traditional distributed data warehouses by embedding a new information sharing mechanism. This allows for the e.cient management of the production and consumption of various dynamic information across the system. Building on top of Anser, we introduce a novel scheduling policy that optimizes both data and information exchanges within the physical plan, enabling the acceleration of complex analytical queries without sacri.cing the performance of short-running interactive queries. We conduct comprehensive experiments over public and in-house workloads to demonstrate the e.ectiveness and e.ciency of our proposed information sharing framework.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available