Learning and Reasoning with Graph Data: Neural and Statistical-Relational Approaches (Invited Paper)

<firstName>Manfred</firstName>

<lastName>Jaeger</lastName>

<name>Manfred Jaeger</name>

<affiliation>Aalborg University, Denmark</affiliation>

<orcid>https://orcid.org/0000-0002-5641-8153</orcid>

</author>

<text>Ralph Abboud, Ismail Ilkan Ceylan, Martin Grohe, and Thomas Lukasiewicz. The surprising power of graph neural networks with random node initialization. In Proceedings of IJCAI 2021, 2021.</text>

<url/>

</reference>

<text>Francis Bach. Breaking the curse of dimensionality with convex neural networks. The Journal of Machine Learning Research, 18(1):629-681, 2017.</text>

<url/>

</reference>

<text>Albert-László Barabási and Réka Albert. Emergence of scaling in random networks. science, 286(5439):509-512, 1999.</text>

<url/>

</reference>

<text>Pablo Barceló, Egor Kostylev, Mikael Monet, Jorge Pérez, Juan Reutter, and Juan-Pablo Silva. The logical expressiveness of graph neural networks. In 8th International Conference on Learning Representations (ICLR 2020), 2020.</text>

<url/>

</reference>

<text>Elena Bellodi and Fabrizio Riguzzi. Structure learning of probabilistic logic programs by searching the clause space. Theory and Practice of Logic Programming, 15(2):169-212, 2015.</text>

<url/>

</reference>

<text>Yoshua Bengio, Nicolas Le Roux, Pascal Vincent, Olivier Delalleau, and Patrice Marcotte. Convex neural networks. Advances in neural information processing systems, 18:123, 2006.</text>

<url/>

</reference>

<text>J. S. Breese, R. P. Goldman, and M. P. Wellman. Introduction to the special section on knowledge-based construction of probabilistic decision models. IEEE Transactions on Systems, Man, and Cybernetics, 24(11), 1994.</text>

<url/>

</reference>

<text>Luitzen EJ Brouwer. Beweis der Invarianz des n-dimensionalen Gebiets. Mathematische Annalen, 71(3):305-313, 1911.</text>

<url/>

</reference>

<text>Alon Brutzkus and Amir Globerson. Why do larger models generalize better? a theoretical perspective via the xor problem. In International Conference on Machine Learning, pages 822-830. PMLR, 2019.</text>

<url/>

</reference>

<text>Gabriele Corso, Luca Cavalleri, Dominique Beaini, Pietro Liò, and Petar Veličković. Principal neighbourhood aggregation for graph nets. Advances in Neural Information Processing Systems, 33, 2020.</text>

<url/>

</reference>

<text>Hanjun Dai, Azade Nazi, Yujia Li, Bo Dai, and Dale Schuurmans. Scalable deep generative modeling for sparse graphs. In International Conference on Machine Learning, pages 2302-2312. PMLR, 2020.</text>

<url/>

</reference>

<text>L. De Raedt. Logical and Relational Learning. Springer, 2008.</text>

<url/>

</reference>

<text>R. de Salvo Braz, E. Amir, and D. Roth. Lifted first-order probabilistic inference. In Proceedings of the 19th International Joint Conference on Artificial Intelligence (IJCAI-05), pages 1319-1325, 2005.</text>

<url/>

</reference>

<text>Thomas Elsken, Jan Hendrik Metzen, and Frank Hutter. Neural architecture search: A survey. The Journal of Machine Learning Research, 20(1):1997-2017, 2019.</text>

<url/>

</reference>

<text>Varun Embar, Sriram Srinivasan, and Lise Getoor. A comparison of statistical relational learning and graph neural networks for aggregate graph queries. Machine Learning, pages 1-20, 2021.</text>

<url/>

</reference>

<text>Herbert B Enderton. A mathematical introduction to logic. Elsevier, 2001.</text>

<url/>

</reference>

<text>Paul Erdos, Alfréd Rényi, et al. On the evolution of random graphs. Publ. Math. Inst. Hung. Acad. Sci, 5(1):17-60, 1960.</text>

<url/>

</reference>

<text>N. Friedman, Lise Getoor, D. Koller, and A. Pfeffer. Learning probabilistic relational models. In Proceedings of the 16th International Joint Conference on Artificial Intelligence (IJCAI-99), 1999.</text>

<url/>

</reference>

<text>Vikas Garg, Stefanie Jegelka, and Tommi Jaakkola. Generalization and representational limits of graph neural networks. In International Conference on Machine Learning, pages 3419-3430. PMLR, 2020.</text>

<url/>

</reference>

<text>L. Getoor and B. Taskar, editors. Introduction to Statistical Relational Learning. MIT Press, 2007.</text>

<url/>

</reference>

<text>Justin Gilmer, Samuel S Schoenholz, Patrick F Riley, Oriol Vinyals, and George E Dahl. Neural message passing for quantum chemistry. In International conference on machine learning, pages 1263-1272. PMLR, 2017.</text>

<url/>

</reference>

<text>Ian Goodfellow, Yoshua Bengio, and Aaron Courville. Deep Learning. MIT Press, 2016. URL: http://www.deeplearningbook.org.</text>

<url>http://www.deeplearningbook.org</url>

</reference>

<text>William L Hamilton. Graph representation learning. Synthesis Lectures on Artifical Intelligence and Machine Learning, 14(3):1-159, 2020.</text>

<url/>

</reference>

<text>William L. Hamilton, Zhitao Ying, and Jure Leskovec. Inductive representation learning on large graphs. In Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 4-9 December 2017, Long Beach, CA, USA, pages 1024-1034, 2017. URL: http://papers.nips.cc/paper/6703-inductive-representation-learning-on-large-graphs.</text>

<url>http://papers.nips.cc/paper/6703-inductive-representation-learning-on-large-graphs</url>

</reference>

<text>D. Heckerman, C. Meek, and D. Koller. Probabilistic entity-relationship models, PRMs, and plate models. In L. Getoor and B. Taskar, editors, Introduction to Statistical Relational Learning. MIT Press, 2007.</text>

<url/>

</reference>

<text>Peter D Hoff, Adrian E Raftery, and Mark S Handcock. Latent space approaches to social network analysis. Journal of the American Statistical Association, 97(460):1090-1098, 2002.</text>

<url/>

</reference>

<text>Paul W Holland, Kathryn Blackmond Laskey, and Samuel Leinhardt. Stochastic blockmodels: First steps. Social networks, 5(2):109-137, 1983.</text>

<url/>

</reference>

<text>Kurt Hornik. Approximation capabilities of multilayer feedforward networks. Neural networks, 4(2):251-257, 1991.</text>

<url/>

</reference>

<text>Manfred Jaeger. Relational Bayesian networks. In Dan Geiger and Prakash Pundalik Shenoy, editors, Proceedings of the 13th Conference of Uncertainty in Artificial Intelligence (UAI-13), pages 266-273, Providence, USA, 1997. Morgan Kaufmann.</text>

<url/>

</reference>

<text>Manfred Jaeger. On the complexity of inference about probabilistic relational models. Artificial Intelligence, 117:297-308, 2000.</text>

<url/>

</reference>

<text>Manfred Jaeger. Model-theoretic expressivity analysis. In L. De Raedt, K. Frasconi, P.and Kersting, and S.H. Muggleton, editors, Probabilistic Inductive Logic Programming, volume 4911 of LNCS, pages 325-339. Springer, 2008.</text>

<url/>

</reference>

<text>Manfred Jaeger. Probabilistic logic and relational models. In Reda Alhajj and Jon Rokne, editors, Encyclopedia of Social Network Analysis and Mining, pages 1-15. Springer New York, New York, NY, 2017. URL: https://doi.org/10.1007/978-1-4614-7163-9_157-1.</text>

<url>https://doi.org/10.1007/978-1-4614-7163-9_157-1</url>

</reference>

<text>Manfred Jaeger, Marco Lippi, Andrea Passerini, and Paolo Frasconi. Type extension trees for feature construction and learning in relational domains. Artificial Intelligence, 204:30-55, 2013. URL: https://doi.org/10.1016/j.artint.2013.08.002.</text>

<url>https://doi.org/10.1016/j.artint.2013.08.002</url>

</reference>

<text>Manfred * Jaeger. Complex probabilistic modeling with recursive relational Bayesian networks. Annals of Mathematics and Artificial Intelligence, 32:179-220, 2001.</text>

<url/>

</reference>

<text>Manfred Jaeger*. Parameter learning for relational Bayesian networks. In Proceedings of the 24th International Conference on Machine Learning (ICML), 2007.</text>

<url/>

</reference>

<text>Jiuchuan Jiang and Manfred Jaeger. Numeric input relations for relational learning with applications to community structure analysis. CoRR, abs/1506.05055, 2015. URL: http://arxiv.org/abs/1506.05055.</text>

<url>http://arxiv.org/abs/1506.05055</url>

</reference>

<text>K. Kersting and L. De Raedt. Towards combining inductive logic programming and Bayesian networks. In Proceedings of the Eleventh International Conference on Inductive Logic Programming (ILP-2001), Springer Lecture Notes in AI 2157, 2001.</text>

<url/>

</reference>

<text>Tushar Khot, Sriraam Natarajan, Kristian Kersting, and Jude Shavlik. Learning Markov logic networks via functional gradient boosting. In 2011 IEEE 11th international conference on data mining, pages 320-329. IEEE, 2011.</text>

<url/>

</reference>

<text>Angelika Kimmig, Bart Demoen, L De Raedt, V. Santos Costa, and Ricardo Rocha. On the implementation of the probabilistic logic programming language ProbLog. Theory and Practice of Logic Programming, 11(2-3):235-262, 2011. URL: https://doi.org/10.1017/S1471068410000566.</text>

<url>https://doi.org/10.1017/S1471068410000566</url>

</reference>

<text>Diederik P Kingma and Jimmy Ba. Adam: A method for stochastic optimization. arXiv preprint, 2014. URL: http://arxiv.org/abs/1412.6980.</text>

<url>http://arxiv.org/abs/1412.6980</url>

</reference>

<text>Thomas N Kipf and Max Welling. Semi-supervised classification with graph convolutional networks. arXiv preprint, 2016. URL: http://arxiv.org/abs/1609.02907.</text>

<url>http://arxiv.org/abs/1609.02907</url>

</reference>

<text>Thomas N Kipf and Max Welling. Variational graph auto-encoders. arXiv preprint, 2016. URL: http://arxiv.org/abs/1611.07308.</text>

<url>http://arxiv.org/abs/1611.07308</url>

</reference>

<text>Stanley Kok and Pedro Domingos. Learning the structure of markov logic networks. In Proceedings of the 22nd international conference on Machine learning, pages 441-448, 2005.</text>

<url/>

</reference>

<text>Daphne Koller and Nir Friedman. Probabilistic graphical models: principles and techniques. MIT press, 2009.</text>

<url/>

</reference>

<text>Kathryn Blackmond Laskey. MEBN: A language for first-order Bayesian knowledge bases. Artificial Intelligence, 172(2-3):140-178, 2008. URL: https://doi.org/10.1016/j.artint.2007.09.006.</text>

<url>https://doi.org/10.1016/j.artint.2007.09.006</url>

</reference>

<text>Kathryn Blackmond Laskey and Suzanne M. Mahoney. Network fragments: Representing knowledge for constructing probabilistic models. In Proceedings of the 13th Annual Conference on Uncertainty in Artificial Intelligence (UAI-97), pages 334-341, San Francisco, CA, 1997. Morgan Kaufmann Publishers.</text>

<url/>

</reference>

<text>Yujia Li, Oriol Vinyals, Chris Dyer, Razvan Pascanu, and Peter Battaglia. Learning deep generative models of graphs. arXiv preprint, 2018. URL: http://arxiv.org/abs/1803.03324.</text>

<url>http://arxiv.org/abs/1803.03324</url>

</reference>

<text>Yao Ma, Suhang Wang, Chara C Aggarwal, Dawei Yin, and Jiliang Tang. Multi-dimensional graph convolutional networks. In Proceedings of the 2019 SIAM International Conference on Data Mining, pages 657-665. SIAM, 2019.</text>

<url/>

</reference>

<text>Robin Manhaeve, Sebastijan Dumancic, Angelika Kimmig, Thomas Demeester, and Luc De Raedt. Deepproblog: Neural probabilistic logic programming. Advances in Neural Information Processing Systems, 31:3749-3759, 2018.</text>

<url/>

</reference>

<text>Lilyana Mihalkova and Raymond J Mooney. Bottom-up learning of Markov logic network structure. In Proceedings of the 24th international conference on Machine learning, pages 625-632, 2007.</text>

<url/>

</reference>

<text>Christopher Morris, Martin Ritzert, Matthias Fey, William L Hamilton, Jan Eric Lenssen, Gaurav Rattan, and Martin Grohe. Weisfeiler and leman go neural: Higher-order graph neural networks. In Proceedings of the AAAI Conference on Artificial Intelligence, pages 4602-4609, 2019.</text>

<url/>

</reference>

<text>Behnam Neyshabur, Zhiyuan Li, Srinadh Bhojanapalli, Yann LeCun, and Nathan Srebro. The role of over-parametrization in generalization of neural networks. In International Conference on Learning Representations, 2018.</text>

<url/>

</reference>

<text>L. Ngo and P. Haddawy. Probabilistic logic programming and Bayesian networks. In Algorithms, Concurrency and Knowledge (Proceedings ACSC95), Springer Lecture Notes in Computer Science 1023, pages 286-300, 1995.</text>

<url/>

</reference>

<text>Jorge Nocedal. Updating quasi-newton matrices with limited storage. Mathematics of computation, 35(151):773-782, 1980.</text>

<url/>

</reference>

<text>Giovanni Pellegrini, Alessandro Tibo, Paolo Frasconi, Andrea Passerini, and Manfred Jaeger. Learning aggregation functions. In Proceedings of the Thirty International Joint Conference on Artificial Intelligence (IJCAI-21). International Joint Conferences on Artificial Intelligence, 2021.</text>

<url/>

</reference>

<text>Trang Pham, Truyen Tran, Dinh Phung, and Svetha Venkatesh. Column networks for collective classification. In Thirty-first AAAI conference on artificial intelligence, 2017.</text>

<url/>

</reference>

<text>D. Poole. First-order probabilistic inference. In Proceedings of the 18th International Joint Conference on Artificial Intelligence (IJCAI-03), 2003.</text>

<url/>

</reference>

<text>David Poole. The independent choice logic for modelling multiple agents under uncertainty. Artificial Intelligence, 94(1-2):7-56, 1997.</text>

<url/>

</reference>

<text>Meng Qu, Yoshua Bengio, and Jian Tang. Gmnn: Graph Markov neural networks. In International conference on machine learning, pages 5241-5250. PMLR, 2019.</text>

<url/>

</reference>

<text>Luc De Raedt, Kristian Kersting, Sriraam Natarajan, and David Poole. Statistical relational artificial intelligence: Logic, probability, and computation. Synthesis lectures on artificial intelligence and machine learning, 10(2):1-189, 2016.</text>

<url/>

</reference>

<text>Marco Tulio Ribeiro, Sameer Singh, and Carlos Guestrin. "Why should I trust you?" explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pages 1135-1144, 2016.</text>

<url/>

</reference>

<text>M. Richardson and P. Domingos. Markov logic networks. Machine Learning, 62(1-2):107-136, 2006.</text>

<url/>

</reference>

<text>S. Russell and P. Norvig. Artificial Intelligence: A Modern Approach. Pearson, third edition edition, 2010.</text>

<url/>

</reference>

<text>Ryoma Sato. A survey on the expressive power of graph neural networks. arXiv preprint, 2020. URL: http://arxiv.org/abs/2003.04078.</text>

<url>http://arxiv.org/abs/2003.04078</url>

</reference>

<text>Ryoma Sato, Makoto Yamada, and Hisashi Kashima. Random features strengthen graph neural networks. In Proceedings of the 2021 SIAM International Conference on Data Mining (SDM), pages 333-341. SIAM, 2021.</text>

<url/>

</reference>

<text>T. Sato. A statistical learning method for logic programs with distribution semantics. In Proceedings of the 12th International Conference on Logic Programming (ICLP'95), pages 715-729, 1995.</text>

<url/>

</reference>

<text>Franco Scarselli, Marco Gori, Ah Chung Tsoi, Markus Hagenbuchner, and Gabriele Monfardini. The graph neural network model. IEEE transactions on neural networks, 20(1):61-80, 2008.</text>

<url/>

</reference>

<text>Michael Schlichtkrull, Thomas N Kipf, Peter Bloem, Rianne Van Den Berg, Ivan Titov, and Max Welling. Modeling relational data with graph convolutional networks. In European semantic web conference, pages 593-607. Springer, 2018.</text>

<url/>

</reference>

<text>Gustav Šourek, Filip Železnỳ, and Ondřej Kuželka. Beyond graph neural networks with lifted relational neural networks. Machine Learning, pages 1-44, 2021.</text>

<url/>

</reference>

<text>J. van Mill. Domain invariance. Encyclopedia of Mathematics. URL: http://encyclopediaofmath.org/index.php?title=Domain_invariance&oldid=16623.</text>

<url>http://encyclopediaofmath.org/index.php?title=Domain_invariance&oldid=16623</url>

</reference>

<text>Petar Veličković, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Liò, and Yoshua Bengio. Graph attention networks. In International Conference on Learning Representations, 2018.</text>

<url/>

</reference>

<text>Clément Vignac, Andreas Loukas, and Pascal Frossard. Building powerful and equivariant graph neural networks with structural message-passing. In NeurIPS, 2020.</text>

<url/>

</reference>

<text>Edward Wagstaff, Fabian Fuchs, Martin Engelcke, Ingmar Posner, and Michael A Osborne. On the limitations of representing functions on sets. In International Conference on Machine Learning, pages 6487-6494. PMLR, 2019.</text>

<url/>

</reference>

<text>Zonghan Wu, Shirui Pan, Fengwen Chen, Guodong Long, Chengqi Zhang, and S Yu Philip. A comprehensive survey on graph neural networks. IEEE transactions on neural networks and learning systems, 32(1):4-24, 2021.</text>

<url/>

</reference>

<text>Keyulu Xu, Weihua Hu, Jure Leskovec, and Stefanie Jegelka. How powerful are graph neural networks? In International Conference on Learning Representations, 2019.</text>

<url/>

</reference>

<text>Jiaxuan You, Rex Ying, Xiang Ren, William Hamilton, and Jure Leskovec. Graphrnn: Generating realistic graphs with deep auto-regressive models. In International conference on machine learning, pages 5708-5717. PMLR, 2018.</text>

<url/>

</reference>

<text>Manzil Zaheer, Satwik Kottur, Siamak Ravanbakhsh, Barnabas Poczos, Russ R Salakhutdinov, and Alexander J Smola. Deep sets. In I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett, editors, Advances in Neural Information Processing Systems, volume 30. Curran Associates, Inc., 2017. URL: https://proceedings.neurips.cc/paper/2017/file/f22e4747da1aa27e363d86d40ff442fe-Paper.pdf.</text>

<url>https://proceedings.neurips.cc/paper/2017/file/f22e4747da1aa27e363d86d40ff442fe-Paper.pdf</url>

</reference>

<text>Muhan Zhang and Yixin Chen. Link prediction based on graph neural networks. Advances in Neural Information Processing Systems, 31:5165-5175, 2018.</text>

<url/>

</reference>

<text>Jie Zhou, Ganqu Cui, Shengding Hu, Zhengyan Zhang, Cheng Yang, Zhiyuan Liu, Lifeng Wang, Changcheng Li, and Maosong Sun. Graph neural networks: A review of methods and applications. AI Open, 1:57-81, 2020.</text>

<url/>

</reference>