Conditional Independence on Semiring Relations

Author Miika Hannula



PDF
Thumbnail PDF

File

LIPIcs.ICDT.2024.20.pdf
  • Filesize: 0.75 MB
  • 20 pages

Document Identifiers

Author Details

Miika Hannula
  • University of Helsinki, Finland

Cite AsGet BibTex

Miika Hannula. Conditional Independence on Semiring Relations. In 27th International Conference on Database Theory (ICDT 2024). Leibniz International Proceedings in Informatics (LIPIcs), Volume 290, pp. 20:1-20:20, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2024)
https://doi.org/10.4230/LIPIcs.ICDT.2024.20

Abstract

Conditional independence plays a foundational role in database theory, probability theory, information theory, and graphical models. In databases, a notion similar to conditional independence, known as the (embedded) multivalued dependency, appears in database normalization. Many properties of conditional independence are shared across various domains, and to some extent these commonalities can be studied through a measure-theoretic approach. The present paper proposes an alternative approach via semiring relations, defined by extending database relations with tuple annotations from some commutative semiring. Integrating various interpretations of conditional independence in this context, we investigate how the choice of the underlying semiring impacts the corresponding axiomatic and decomposition properties. We specifically identify positivity and multiplicative cancellativity as the key semiring properties that enable extending results from the relational context to the broader semiring framework. Additionally, we explore the relationships between different conditional independence notions through model theory.

Subject Classification

ACM Subject Classification
  • Theory of computation → Database theory
Keywords
  • semiring
  • conditional independence
  • functional dependency
  • decomposition
  • axiom

Metrics

  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    0
    PDF Downloads

References

  1. William W. Armstrong. Dependency Structures of Data Base Relationships. In Proc. of IFIP World Computer Congress, pages 580-583, 1974. Google Scholar
  2. Albert Atserias and Phokion G. Kolaitis. Consistency, acyclicity, and positive semirings. In Alessandra Palmigiano and Mehrnoosh Sadrzadeh, editors, Samson Abramsky on Logic and Structure in Computer Science and Beyond, pages 623-668, Cham, 2023. Springer International Publishing. Google Scholar
  3. Timon Barlag, Miika Hannula, Juha Kontinen, Nina Pardal, and Jonni Virtema. Unified foundations of team semantics via semirings. In KR, pages 75-85, 2023. URL: https://doi.org/10.24963/kr.2023/8.
  4. Catriel Beeri, Ronald Fagin, and John H. Howard. A complete axiomatization for functional and multivalued dependencies in database relations. In SIGMOD Conference, pages 47-61. ACM, 1977. URL: https://doi.org/10.1145/509404.509414.
  5. J. Bochnak, M. Coste, and M-F. Roy. Real Algebraic Geometry. Springer, 1998. Google Scholar
  6. Chen C. Chang and H. Jerome Keisler. Model theory, Third Edition, volume 73 of Studies in logic and the foundations of mathematics. North-Holland, 1992. Google Scholar
  7. Shumo Chu, Brendan Murphy, Jared Roesch, Alvin Cheung, and Dan Suciu. Axiomatic foundations and algorithms for deciding semantic equivalences of SQL queries. Proc. VLDB Endow., 11(11):1482-1495, 2018. URL: https://doi.org/10.14778/3236187.3236200.
  8. A. P. Dempster. A generalization of bayesian inference. Journal of the Royal Statistical Society. Series B (Methodological), 30(2):205-247, 1968. Google Scholar
  9. Pietro Galliani and Jouko Väänänen. Diversity, dependence and independence. Ann. Math. Artif. Intell., 90(2-3):211-233, 2022. URL: https://doi.org/10.1007/s10472-021-09778-8.
  10. Dan Geiger, Thomas Verma, and Judea Pearl. Identifying independence in bayesian networks. Networks, 20(5):507-534, 1990. URL: https://doi.org/10.1002/net.3230200504.
  11. Todd J. Green, Gregory Karvounarakis, and Val Tannen. Provenance semirings. In PODS, pages 31-40. ACM, 2007. URL: https://doi.org/10.1145/1265530.1265535.
  12. Marc Gyssens, Mathias Niepert, and Dirk Van Gucht. On the completeness of the semigraphoid axioms for deriving arbitrary from saturated conditional independence statements. Inf. Process. Lett., 114(11):628-633, 2014. URL: https://doi.org/10.1016/j.ipl.2014.05.010.
  13. Miika Hannula. Conditional independence on semiring relations. CoRR, abs/2310.01910, 2023. URL: https://doi.org/10.48550/arXiv.2310.01910.
  14. Christian Herrmann. On the undecidability of implications between embedded multivalued database dependencies. Information and Computation, 122(2):221-235, 1995. URL: https://doi.org/10.1006/inco.1995.1148.
  15. Douglas Stott Parker Jr. and Kamran Parsaye-Ghomi. Inferences involving embedded multivalued dependencies and transitive dependencies. In SIGMOD Conference, pages 52-57. ACM Press, 1980. URL: https://doi.org/10.1145/582250.582259.
  16. Batya Kenig and Dan Suciu. Integrity constraints revisited: From exact to approximate implication. Log. Methods Comput. Sci., 18(1), 2022. URL: https://doi.org/10.46298/LMCS-18(1:5)2022.
  17. Mahmoud Abo Khamis, Phokion G. Kolaitis, Hung Q. Ngo, and Dan Suciu. Decision problems in information theory. In ICALP, volume 168 of LIPIcs, pages 106:1-106:20, 2020. URL: https://doi.org/10.4230/LIPIcs.ICALP.2020.106.
  18. Lukas Kühne and Geva Yashfe. On entropic and almost multilinear representability of matroids. CoRR, abs/2206.03465, 2022. URL: https://doi.org/10.48550/arXiv.2206.03465.
  19. Tony T. Lee. An information-theoretic analysis of relational databases - part I: data dependencies and information metric. IEEE Trans. Software Eng., 13(10):1049-1061, 1987. URL: https://doi.org/10.1109/TSE.1987.232847.
  20. Cheuk Ting Li. Undecidability of network coding, conditional information inequalities, and conditional independence implication. IEEE Trans. Inf. Theory, 69(6):3493-3510, 2023. URL: https://doi.org/10.1109/TIT.2023.3247570.
  21. J. Pearl. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann, San Mateo, CA, 1988. Google Scholar
  22. Yehoshua Sagiv, Claude Delobel, Douglas Stott Parker Jr., and Ronald Fagin. An equivalence between relational database dependencies and a fragment of propositional logic. J. ACM, 28(3):435-453, 1981. URL: https://doi.org/10.1145/322261.322263.
  23. Glenn Shafer. A Mathematical Theory of Evidence. Princeton University Press, Princeton, 1976. Google Scholar
  24. Wolfgang Spohn. Ordinal conditional functions. a dynamic theory of epistemic states. In W. L. Harper and B. Skyrms, editors, Causation in Decision, Belief Change, and Statistics, vol. II. Kluwer Academic Publishers, 1988. Google Scholar
  25. Milan Studený. Multiinformation and the problem of characterization of conditional independence relations. Problems of Control and Information Theory, 18(1):3-16, 1989. Google Scholar
  26. Milan Studený. Conditional independence relations have no finite complete characterization. Transactions of the 11th Prague Conference on Information Theory, pages 377-396, 1992. Google Scholar
  27. Milan Studený. Formal properties of conditional independence in different calculi of AI. In ECSQARU, volume 747 of Lecture Notes in Computer Science, pages 341-348. Springer, 1993. URL: https://doi.org/10.1007/BFb0028219.
  28. Milan Studený. Conditional independence and natural conditional functions. Int. J. Approx. Reason., 12(1):43-68, 1995. URL: https://doi.org/10.1016/0888-613X(94)00014-T.
  29. L.A Zadeh. Fuzzy sets as a basis for a theory of possibility. Fuzzy Sets and Systems, 1(1):3-28, 1978. Google Scholar
Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail