And synopses for all: A synopses data engine for extreme scale analytics-as-a-service

Publication
Inf. Syst.

When analyzing big data, it is often necessary to work with data synopses, approximate summaries of the data that come with guarantees. We propose a novel synopsis-as-a-service paradigm and design a Synopses Data Engine as a Service (SDEaaS) system, built on top of Apache Flink, that combines the virtues of parallel processing and stream summarization towards delivering interactive analytics at extreme scale.

image
Architecture of the Synopses Data Engine as a Service.