On the Expressiveness of Languages for Complex Event Recognition

Authors Alejandro Grez, Cristian Riveros, Martín Ugarte, Stijn Vansummeren



PDF
Thumbnail PDF

File

LIPIcs.ICDT.2020.15.pdf
  • Filesize: 0.5 MB
  • 17 pages

Document Identifiers

Author Details

Alejandro Grez
  • Pontificia Universidad Católica de Chile, Santiago, Chile
  • Millennium Institute for Foundational Research on Data, Santiago, Chile
Cristian Riveros
  • Pontificia Universidad Católica de Chile, Santiago, Chile
  • Millennium Institute for Foundational Research on Data, Santiago, Chile
Martín Ugarte
  • Millennium Institute for Foundational Research on Data, Santiago, Chile
Stijn Vansummeren
  • Université Libre de Bruxelles, Brussels, Belgium

Cite AsGet BibTex

Alejandro Grez, Cristian Riveros, Martín Ugarte, and Stijn Vansummeren. On the Expressiveness of Languages for Complex Event Recognition. In 23rd International Conference on Database Theory (ICDT 2020). Leibniz International Proceedings in Informatics (LIPIcs), Volume 155, pp. 15:1-15:17, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2020)
https://doi.org/10.4230/LIPIcs.ICDT.2020.15

Abstract

Complex Event Recognition (CER for short) has recently gained attention as a mechanism for detecting patterns in streams of continuously arriving event data. Numerous CER systems and languages have been proposed in the literature, commonly based on combining operations from regular expressions (sequencing, iteration, and disjunction) and relational algebra (e.g., joins and filters). While these languages are naturally first-order, meaning that variables can only bind single elements, they also provide capabilities for filtering sets of events that occur inside iterative patterns; for example requiring sequences of numbers to be increasing. Unfortunately, these type of filters usually present ad-hoc syntax and under-defined semantics, precisely because variables cannot bind sets of events. As a result, CER languages that provide filtering of sequences commonly lack rigorous semantics and their expressive power is not understood. In this paper we embark on two tasks: First, to define a denotational semantics for CER that naturally allows to bind and filter sets of events; and second, to compare the expressive power of this semantics with that of CER languages that only allow for binding single events. Concretely, we introduce Set-Oriented Complex Event Logic (SO-CEL for short), a variation of the CER language introduced in [Grez et al., 2019] in which all variables bind to sets of matched events. We then compare SO-CEL with CEL, the CER language of [Grez et al., 2019] where variables bind single events. We show that they are equivalent in expressive power when restricted to unary predicates but, surprisingly, incomparable in general. Nevertheless, we show that if we restrict to sets of binary predicates, then SO-CEL is strictly more expressive than CEL. To get a better understanding of the expressive power, computational capabilities, and limitations of SO-CEL, we also investigate the relationship between SO-CEL and Complex Event Automata (CEA), a natural computational model for CER languages. We define a property on CEA called the *-property and show that, under unary predicates, SO-CEL captures precisely the subclass of CEA that satisfy this property. Finally, we identify the operations that SO-CEL is lacking to characterize CEA and introduce a natural extension of the language that captures the complete class of CEA under unary predicates.

Subject Classification

ACM Subject Classification
  • Information systems → Data streams
  • Information systems → Query operators
  • Theory of computation → Formal languages and automata theory
Keywords
  • Query languages
  • Complex Event Recognition
  • Logics
  • Automata theory

Metrics

  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    0
    PDF Downloads

References

  1. Jagrati Agrawal, Yanlei Diao, Daniel Gyllstrom, and Neil Immerman. Efficient Pattern Matching over Event Streams. In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, SIGMOD '08, pages 147-160, 2008. Google Scholar
  2. Alfred V. Aho. Algorithms for Finding Patterns in Strings. In Handbook of Theoretical Computer Science, Volume A: Algorithms and Complexity (A), pages 255-300. North Holland, 1990. Google Scholar
  3. Alexander Artikis, Alessandro Margara, Martín Ugarte, Stijn Vansummeren, and Matthias Weidlich. Complex Event Recognition Languages: Tutorial. In International Conference on Distributed and Event-based Systems, DEBS 2017,, pages 7-10, 2017. Google Scholar
  4. Alexander Artikis, Marek J. Sergot, and Georgios Paliouras. An Event Calculus for Event Recognition. IEEE Trans. Knowl. Data Eng., 27(4):895-908, 2015. Google Scholar
  5. Cezar Câmpeanu, Kai Salomaa, and Sheng Yu. A Formal Study Of Practical Regular Expressions. Int. J. Found. Comput. Sci., 14(6):1007-1018, 2003. URL: https://doi.org/10.1142/S012905410300214X.
  6. Paris Carbone, Asterios Katsifodimos, Stephan Ewen, Volker Markl, Seif Haridi, and Kostas Tzoumas. Apache Flinktrademark: Stream and Batch Processing in a Single Engine. IEEE Data Eng. Bull., 38(4):28-38, 2015. URL: http://sites.computer.org/debull/A15dec/p28.pdf.
  7. Benjamin Carle and Paliath Narendran. On Extended Regular Expressions. In LATA 2009, volume 5457 of Lecture Notes in Computer Science, pages 279-289, 2009. URL: https://doi.org/10.1007/978-3-642-00982-2_24.
  8. Badrish Chandramouli, Jonathan Goldstein, Mike Barnett, Robert DeLine, John C. Platt, James F. Terwilliger, and John Wernsing. Trill: A High-Performance Incremental Query Processor for Diverse Analytics. PVLDB, 8(4):401-412, 2014. URL: https://doi.org/10.14778/2735496.2735503.
  9. Charles D. Cranor, Yuan Gao, Theodore Johnson, Vladislav Shkapenyuk, and Oliver Spatscheck. Gigascope: high performance network monitoring with an SQL interface. In SIGMOD, page 623, 2002. Google Scholar
  10. Charles D. Cranor, Theodore Johnson, Oliver Spatscheck, and Vladislav Shkapenyuk. Gigascope: A Stream Database for Network Applications. In SIGMOD, pages 647-651, 2003. Google Scholar
  11. Gianpaolo Cugola and Alessandro Margara. Complex Event Processing with T-REX. The Journal of Systems and Software, 2012. Google Scholar
  12. Gianpaolo Cugola and Alessandro Margara. Processing flows of information: From data stream to complex event processing. ACM Computing Surveys (CSUR), 2012. Google Scholar
  13. Alan Demers, Johannes Gehrke, Mingsheng Hong, Mirek Riedewald, and Walker White. A general algebra and implementation for monitoring event streams. Technical report, Cornell University, 2005. Google Scholar
  14. Alan Demers, Johannes Gehrke, Mingsheng Hong, Mirek Riedewald, and Walker White. Towards expressive publish/subscribe systems. In EDBT, 2006. Google Scholar
  15. Alan J. Demers, Johannes Gehrke, Biswanath Panda, Mirek Riedewald, Varun Sharma, and Walker M. White. Cayuga: A General Purpose Event Monitoring System. In CIDR 2007, pages 412-422, 2007. Google Scholar
  16. Ronald Fagin, Benny Kimelfeld, Frederick Reiss, and Stijn Vansummeren. Document Spanners: A Formal Approach to Information Extraction. J. ACM, 62(2):12:1-12:51, 2015. URL: https://doi.org/10.1145/2699442.
  17. Alejandro Grez, Cristian Riveros, and Martin Ugarte. A formal framework for Complex Event Processing. In ICDT, 2019. Google Scholar
  18. Mahmudul Hasan, Jonghyun Choi, Jan Neumann, Amit K. Roy-Chowdhury, and Larry S. Davis. Learning Temporal Regularity in Video Sequences. In 2016 Conference on Computer Vision and Pattern Recognition, pages 733-742, 2016. Google Scholar
  19. Martin Hirzel, Guillaume Baudart, Angela Bonifati, Emanuele Della Valle, Sherif Sakr, and Akrivi Vlachou. Stream Processing Languages in the Big Data Era. SIGMOD Record, 47(2):29-40, 2018. URL: https://doi.org/10.1145/3299887.3299892.
  20. Ilya Kolchinsky, Izchak Sharfman, and Assaf Schuster. Lazy evaluation methods for detecting complex events. In International Conference on Distributed Event-Based Systems, DEBS '15, pages 34-45, 2015. Google Scholar
  21. Leonid Libkin. Elements of finite model theory. Springer Science & Business Media, 2013. Google Scholar
  22. Walker M. White, Mirek Riedewald, Johannes Gehrke, and Alan J. Demers. What is "next" in event processing? In PODS, pages 263-272, 2007. Google Scholar
  23. Eugene Wu, Yanlei Diao, and Shariq Rizvi. High-performance complex event processing over streams. In SIGMOD, 2006. Google Scholar
  24. Haopeng Zhang, Yanlei Diao, and Neil Immerman. On complexity and optimization of expensive queries in complex event processing. In SIGMOD, 2014. Google Scholar
Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail