Explaining SAT Solving Using Causal Reasoning

Yang, Jiong; Shaw, Arijit; Baluta, Teodora; Soos, Mate; Meel, Kuldeep S.

doi:10.4230/LIPIcs.SAT.2023.28

Abstract

The past three decades have witnessed notable success in designing efficient SAT solvers, with modern solvers capable of solving industrial benchmarks containing millions of variables in just a few seconds. The success of modern SAT solvers owes to the widely-used CDCL algorithm, which lacks comprehensive theoretical investigation. Furthermore, it has been observed that CDCL solvers still struggle to deal with specific classes of benchmarks comprising only hundreds of variables, which contrasts with their widespread use in real-world applications. Consequently, there is an urgent need to uncover the inner workings of these seemingly weak yet powerful black boxes.
In this paper, we present a first step towards this goal by introducing an approach called {CausalSAT}, which employs causal reasoning to gain insights into the functioning of modern SAT solvers. {CausalSAT} initially generates observational data from the execution of SAT solvers and learns a structured graph representing the causal relationships between the components of a SAT solver. Subsequently, given a query such as whether a clause with low literals blocks distance (LBD) has a higher clause utility, {CausalSAT} calculates the causal effect of LBD on clause utility and provides an answer to the question. We use {CausalSAT} to quantitatively verify hypotheses previously regarded as "rules of thumb" or empirical findings, such as the query above or the notion that clauses with high LBD experience a rapid drop in utility over time. Moreover, {CausalSAT} can address previously unexplored questions, like which branching heuristic leads to greater clause utility in order to study the relationship between branching and clause management. Experimental evaluations using practical benchmarks demonstrate that {CausalSAT} effectively fits the data, verifies four "rules of thumb", and provides answers to three questions closely related to implementing modern solvers.

Kartik Ahuja, Yixin Wang, Divyat Mahajan, and Yoshua Bengio. Interventional causal representation learning. In Proc. of NeurIPS Workshop on Causality for Real-world Impact, 2022.
Martin Arjovsky, Léon Bottou, Ishaan Gulrajani, and David Lopez-Paz. Invariant risk minimization. arXiv preprint, 2019. URL: https://arxiv.org/abs/1907.02893.
Gilles Audemard and Laurent Simon. Glucose: a solver that predicts learnt clauses quality. SAT Competition, 2009.
Gilles Audemard and Laurent Simon. Predicting learnt clauses quality in modern SAT solvers. In Proc. of IJCAI, 2009.
Gilles Audemard and Laurent Simon. On the glucose SAT solver. International Journal on Artificial Intelligence Tools, 2018.
Armin Biere. Lingeling, Plingeling and Treengeling entering the SAT competition 2013. Proc. of SAT competition, 2013.
Armin Biere, Alessandro Cimatti, Edmund Clarke, and Yunshan Zhu. Symbolic model checking without BDDs. In Proc. of TACAS, 1999.
Armin Biere, Katalin Fazekas, Mathias Fleury, and Maximillian Heisinger. CaDiCaL, Kissat, Paracooba, Plingeling and Treengeling entering the SAT Competition 2020. In Proc. of SAT Competition - Solver and Benchmark Descriptions, 2020.
Armin Biere and Andreas Fröhlich. Evaluating CDCL variable scoring schemes. In Proc. of DAC, 2015.
Johann Brehmer, Pim De Haan, Phillip Lippe, and Taco Cohen. Weakly supervised causal representation learning. In Proc. of UAI Workshop on Causal Representation Learning, 2022.
Edmund Clarke, Daniel Kroening, and Flavio Lerda. A tool for checking ANSI-C programs. In Proc. of TACAS, 2004.
Diego Colombo and Marloes H. Maathuis. Order-Independent Constraint-Based Causal Structure Learning. Journal of Machine Learning Research, 2014.
Stephen A Cook. The complexity of theorem-proving procedures. In Proc. of STOC, 1971.
Martin Davis, George Logemann, and Donald Loveland. A machine program for theorem-proving. Communications of the ACM, 1962.
Niklas Eén and Niklas Sörensson. An extensible SAT-solver. In Proc. of SAT, 2003.
Jan Elffers, Jesús Giráldez-Cru, Stephan Gocht, Jakob Nordström, and Laurent Simon. Seeking practical cdcl insights from theoretical sat benchmarks. In IJCAI, pages 1300-1308, 2018.
Laurent Simon Gilles Audemard. Glucose 1.0, 2009. sources/glucose/core/Solver.C:Lines 631-635. URL: https://www.labri.fr/perso/lsimon/downloads/softwares/glucose_1.0.zip.
Carla P Gomes, Bart Selman, Henry Kautz, et al. Boosting combinatorial search through randomization. Proc. of AAAI/IAAI, 1998.
Jinbo Huang et al. The Effect of Restarts on the Efficiency of Clause Learning. In Proc. of IJCAI, 2007.
Yimin Huang and Marco Valtorta. Pearl’s calculus of intervention is complete. In Proc. of UAI, 2006.
Matti Järvisalo, Daniel Le Berre, Olivier Roussel, and Laurent Simon. The international SAT solver competitions. AI Magazine, 2012.
Henry A Kautz, Bart Selman, et al. Planning as Satisfiability. In Proc. of ECAI, 1992.
Janne I Kokkala and Jakob Nordström. Using resolution proofs to analyse CDCL solvers. In Proc. of CP, 2020.
Jia Hui Liang, Vijay Ganesh, Pascal Poupart, and Krzysztof Czarnecki. Learning rate based branching heuristic for SAT solvers. In Proc. of SAT, 2016.
Jia Hui Liang, Chanseok Oh, Minu Mathew, Ciza Thomas, Chunxiao Li, and Vijay Ganesh. Machine learning-based restart policy for CDCL SAT solvers. In Proc. of SAT, 2018.
Jia Hui Liang, Pascal Poupart, Krzysztof Czarnecki, and Vijay Ganesh. An empirical study of branching heuristics through the lens of global learning rate. In Proc. of SAT, 2017.
Michael Luby, Alistair Sinclair, and David Zuckerman. Optimal speedup of Las Vegas algorithms. Information Processing Letters, 1993.
Mao Luo, Chu-Min Li, Fan Xiao, Felip Manya, and Zhipeng Lü. An effective learnt clause minimization approach for CDCL SAT solvers. In Proc. of IJCAI, 2017.
Inês Lynce and Joao Marques-Silva. SAT in bioinformatics: Making the case with haplotype inference. In Proc. of SAT, 2006.
João P Marques-Silva and Karem A Sakallah. GRASP: A search algorithm for propositional satisfiability. IEEE Transactions on Computers, 1999.
Matthew W Moskewicz, Conor F Madigan, Ying Zhao, Lintao Zhang, and Sharad Malik. Chaff: Engineering an efficient SAT solver. In Proc. of DAC, 2001.
Alexander Nadel and Vadim Ryvchin. Chronological backtracking. In Proc. of SAT, 2018.
Chanseok Oh. Between SAT and UNSAT: the fundamental difference in CDCL SAT. In Proc. of SAT, 2015.
Chanseok Oh. Improving SAT solvers by exploiting empirical characteristics of CDCL. PhD thesis, New York University, 2016.
Judea Pearl. Causal inference in statistics: An overview. Statistics Surveys, 2009.
Judea Pearl. Causality. Cambridge University Press, 2009.
Jonas Peters, Peter Bühlmann, and Nicolai Meinshausen. Causal inference by using invariant prediction: identification and confidence intervals. Journal of the Royal Statistical Society. Series B (Statistical Methodology), pages 947-1012, 2016.
Knot Pipatsrisawat and Adnan Darwiche. A lightweight component caching scheme for satisfiability solvers. In Proc. of SAT, 2007.
Bernhard Schölkopf, Francesco Locatello, Stefan Bauer, Nan Rosemary Ke, Nal Kalchbrenner, Anirudh Goyal, and Yoshua Bengio. Toward causal representation learning. Proc. of IEEE, 109(5):612-634, 2021.
Gideon Schwarz. Estimating the dimension of a model. The Annals of Statistics, 1978.
Marco Scutari. Learning Bayesian Networks with the bnlearn R Package. Journal of Statistical Software, 2010.
Marco Scutari and Radhakrishnan Nagarajan. Identifying significant edges in graphical models of molecular networks. Artificial Intelligence in Medicine, 2013.
Amit Sharma, Emre Kiciman, et al. DoWhy: A Python package for causal inference, 2019. URL: https://github.com/microsoft/dowhy.
Qiang Shen and Rónán Daly. Methods to accelerate the learning of bayesian network structures. In Proc. of UK Workshop on Computational Intelligence, 2007.
JP Marques Silva and Karem A Sakallah. Conflict analysis in search algorithms for satisfiability. In Proc. of ICTAI, 1996.
Laurent Simon. Post mortem analysis of sat solver proofs. In POS@ SAT, pages 26-40, 2014.
Mate Soos, Jo Devriendt, Stephan Gocht, Arijit Shaw, and Kuldeep S Meel. Cryptominisat with CCAnr at the SAT competition 2020. SAT COMPETITION, 2020.
Mate Soos, Raghav Kulkarni, and Kuldeep S Meel. CrystalBall: Gazing in the Black Box of SAT Solving. In Proc. of SAT, 2019.
Mate Soos, Karsten Nohl, and Claude Castelluccia. Extending SAT solvers to cryptographic problems. In Proc. of SAT, 2009.
Toby Walsh. Search in a small world. In Proc. of IJCAI, pages 1172-1177, 1999.
Ruoyu Wang, Mingyang Yi, Zhitang Chen, and Shengyu Zhu. Out-of-distribution generalization with causal invariant transformations. In Proc. of CVPR, pages 375-385, 2022.
Nathan Wetzler, Marijn JH Heule, and Warren A Hunt Jr. DRAT-trim: Efficient checking and trimming using expressive clausal proofs. In Proc. of SAT, 2014.
Lin Xu, Frank Hutter, Holger H Hoos, and Kevin Leyton-Brown. SATzilla: portfolio-based algorithm selection for SAT. Journal of artificial intelligence research, 2008.
Jiong Yang, Arijit Shaw, Teodora Baluta, Mate Soos, and Kuldeep S. Meel. Explaining sat solving using causal reasoning. arXiv, 2023. URL: https://arxiv.org/abs/2306.06294.

Explaining SAT Solving Using Causal Reasoning

Authors Jiong Yang, Arijit Shaw, Teodora Baluta, Mate Soos, Kuldeep S. Meel

File

Document Identifiers

Subject Classification

ACM Subject Classification

Keywords

Metrics

Abstract

Cite As Get BibTex

Author Details

Acknowledgements

References

Thanks for your feedback!

Could not send message

Explaining SAT Solving Using Causal Reasoning

Authors Jiong Yang, Arijit Shaw, Teodora Baluta, Mate Soos, Kuldeep S. Meel

File

Document Identifiers

Related Versions

Subject Classification

ACM Subject Classification

Keywords

Metrics

Abstract

Cite As Get BibTex

Author Details

Funding

Acknowledgements

Supplementary Materials

References

Thanks for your feedback!

Could not send message