Propagation of Densities of Streaming Data within Query Graphs

Beitrag bei einer Tagung
(Originalarbeit)


Details zur Publikation

Autor(en): Daum M, Lauterwald F, Baumgärtel P, Meyer-Wegener K
Auflage: 1st
Titel Sammelwerk: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Verlag: Springer-verlag
Verlagsort: Heidelberg
Jahr der Veröffentlichung: 2010
Tagungsband: Scientific and Statistical Database Management: 22nd International Conference
Seitenbereich: 584-601
ISBN: 978-3-642-13817-1
ISSN: 0302-9743
Sprache: Englisch


Abstract


Data Stream SystemsDSS use cost models to determine if a DSS can cope with a given workload and to optimize query graphs. However, certain relevant input parameters of these models are often unknown or highly imprecise. Especially selectivities are stream-dependent and application-specific parameters. In this paper, we describe a method that supports selectivity estimation considering input streams' attribute value distribution. The novelty of our approach is the propagation of the probability distributions through the query graph in order to give estimates for the inner nodes of the graph. For most common stream operators, we establish formulas that describe their output distribution as a function of their input distributions. For unknown operators like User-Defined OperatorsUDO, we introduce a method to measure the influence of these operators on arbitrary probability distributions. This method is able to do most of the computational work before the query is deployed and introduces minimal overhead at runtime. Our evaluation framework facilitates the appropriate combination of both methods and allows to model almost arbitrary query graphs. © 2010 Springer-Verlag Berlin Heidelberg.



FAU-Autoren / FAU-Herausgeber

Baumgärtel, Philipp
Lehrstuhl für Informatik 6 (Datenmanagement)
Daum, Michael Dr.-Ing.
Lehrstuhl für Informatik 6 (Datenmanagement)
Lauterwald, Frank
Lehrstuhl für Informatik 6 (Datenmanagement)
Meyer-Wegener, Klaus Prof. Dr.-Ing.
Lehrstuhl für Informatik 6 (Datenmanagement)


Zitierweisen

APA:
Daum, M., Lauterwald, F., Baumgärtel, P., & Meyer-Wegener, K. (2010). Propagation of Densities of Streaming Data within Query Graphs. In Scientific and Statistical Database Management: 22nd International Conference (pp. 584-601). Heidelberg, DE: Heidelberg: Springer-verlag.

MLA:
Daum, Michael, et al. "Propagation of Densities of Streaming Data within Query Graphs." Proceedings of the SSDBM, Heidelberg Heidelberg: Springer-verlag, 2010. 584-601.

BibTeX: 

Zuletzt aktualisiert 2018-09-08 um 22:39