Domain-Based Nucleic-Acid Minimum Free Energy: Algorithmic Hardness and Parameterized Bounds

Authors Erik D. Demaine, Timothy Gomez, Elise Grizzell, Markus Hecher, Jayson Lynch, Robert Schweller, Ahmed Shalaby, Damien Woods



PDF
Thumbnail PDF

File

LIPIcs.DNA.30.2.pdf
  • Filesize: 1.06 MB
  • 24 pages

Document Identifiers

Author Details

Erik D. Demaine
  • Massachusetts Institute of Technology, USA
Timothy Gomez
  • Massachusetts Institute of Technology, USA
Elise Grizzell
  • University of Texas Rio Grande Valley, USA
Markus Hecher
  • Massachusetts Institute of Technology, USA
Jayson Lynch
  • Massachusetts Institute of Technology, USA
Robert Schweller
  • University of Texas Rio Grande Valley, USA
Ahmed Shalaby
  • Hamilton Institute, Department of Computer , Science, Maynooth University, Ireland
Damien Woods
  • Hamilton Institute, Department of Computer , Science, Maynooth University, Ireland

Acknowledgements

We thank Jenny Diomidova, Marco Rodriguez, Tim Wylie for helpful discussions.

Cite AsGet BibTex

Erik D. Demaine, Timothy Gomez, Elise Grizzell, Markus Hecher, Jayson Lynch, Robert Schweller, Ahmed Shalaby, and Damien Woods. Domain-Based Nucleic-Acid Minimum Free Energy: Algorithmic Hardness and Parameterized Bounds. In 30th International Conference on DNA Computing and Molecular Programming (DNA 30). Leibniz International Proceedings in Informatics (LIPIcs), Volume 314, pp. 2:1-2:24, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2024)
https://doi.org/10.4230/LIPIcs.DNA.30.2

Abstract

Molecular programmers and nanostructure engineers use domain-level design to abstract away messy DNA/RNA sequence, chemical and geometric details. Such domain-level abstractions are enforced by sequence design principles and provide a key principle that allows scaling up of complex multistranded DNA/RNA programs and structures. Determining the most favoured secondary structure, or Minimum Free Energy (MFE), of a set of strands, is typically studied at the sequence level but has seen limited domain-level work. We analyse the computational complexity of MFE for multistranded systems in a simple setting were we allow only 1 or 2 domains per strand. On the one hand, with 2-domain strands, we find that the MFE decision problem is NP-complete, even without pseudoknots, and requires exponential time algorithms assuming SAT does. On the other hand, in the simplest case of 1-domain strands there are efficient MFE algorithms for various binding modes. However, even in this single-domain case, MFE is P-hard for promiscuous binding, where one domain may bind to multiple as experimentally used by Nikitin [Nat Chem., 2023], which in turn implies that strands consisting of a single domain efficiently implement arbitrary Boolean circuits.

Subject Classification

ACM Subject Classification
  • Theory of computation → Problems, reductions and completeness
  • Computer systems organization → Molecular computing
Keywords
  • Domain-based DNA designs
  • minimum free energy
  • efficient algorithms
  • NP-hard
  • P-hard
  • NC
  • fixed-parameter tractable

Metrics

  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    0
    PDF Downloads

References

  1. Tatsuya Akutsu. Dynamic programming algorithms for RNA secondary structure prediction with pseudoknots. Discrete Applied Mathematics, 104(1-3):45-62, 2000. Google Scholar
  2. Sanjeev Arora and Boaz Barak. Computational complexity: a modern approach. Cambridge University Press, 2009. Google Scholar
  3. Nikhil Bansal, Tim Oosterwijk, Tjark Vredeveld, and Ruben Van Der Zwaan. Approximating vector scheduling: almost matching upper and lower bounds. Algorithmica, 76:1077-1096, 2016. Google Scholar
  4. Robert D Barish, Rebecca Schulman, Paul WK Rothemund, and Erik Winfree. An information-bearing seed for nucleating algorithmic self-assembly. Proceedings of the National Academy of Sciences, 106(15):6054-6059, 2009. Google Scholar
  5. David A Mix Barrington, Neil Immerman, and Howard Straubing. On uniformity within NC¹. Journal of Computer and System Sciences, 41(3):274-306, 1990. Google Scholar
  6. Alexander Barvinok and Alex Samorodnitsky. Computing the partition function for perfect matchings in a hypergraph. Combinatorics, Probability and Computing, 20(6):815-835, 2011. Google Scholar
  7. Kimon Boehmer, Sarah J Berkemer, Sebastian Will, and Yann Ponty. Rna triplet repeats: Improved algorithms for structure prediction and interactions. In WABI 2024 - 24th Workshop on Algorithms in Bioinformatics, 2024. To appear. URL: https://hal.science/hal-04589903.
  8. Richard A Brualdi. Introductory combinatorics. Pearson Education India, 1977. Google Scholar
  9. Andrei Bulatov and Martin Grohe. The complexity of partition functions. Theoretical Computer Science, 348(2):148-186, 2005. Automata, Languages and Programming: Algorithms and Complexity (ICALP-A 2004). URL: https://doi.org/10.1016/j.tcs.2005.09.011.
  10. Luca Cardelli. Two-domain DNA strand displacement. Mathematical Structures in Computer Science, 23(2):247-271, 2013. Google Scholar
  11. Ho-Lin Chen, Anne Condon, and Hosna Jabbari. An O(n⁵) algorithm for MFE prediction of kissing hairpins and 4-chains in nucleic acids. Journal of Computational Biology, 16(6):803-815, 2009. Google Scholar
  12. Anne Condon, Monir Hajiaghayi, and Chris Thachuk. Predicting Minimum Free Energy Structures of Multi-Stranded Nucleic Acid Complexes Is APX-Hard. In Matthew R. Lakin and Petr Šulc, editors, 27th International Conference on DNA Computing and Molecular Programming (DNA 27), volume 205 of Leibniz International Proceedings in Informatics (LIPIcs), pages 9:1-9:21, Dagstuhl, Germany, 2021. Schloss Dagstuhl - Leibniz-Zentrum für Informatik. URL: https://doi.org/10.4230/LIPIcs.DNA.27.9.
  13. Robert M Dirks, Justin S Bois, Joseph M Schaeffer, Erik Winfree, and Niles A Pierce. Thermodynamic analysis of interacting nucleic acid strands. SIAM review, 49(1):65-88, 2007. Google Scholar
  14. Robert M Dirks and Niles A Pierce. A partition function algorithm for nucleic acid secondary structure including pseudoknots. Journal of computational chemistry, 24(13):1664-1677, 2003. Google Scholar
  15. Robert M Dirks and Niles A Pierce. An algorithm for computing nucleic acid base-pairing probabilities including pseudoknots. Journal of computational chemistry, 25(10):1295-1304, 2004. Google Scholar
  16. David Doty, Trent A Rogers, David Soloveichik, Chris Thachuk, and Damien Woods. Thermodynamic binding networks. In DNA23: The 23rd International Conference on DNA Computing and Molecular Programming, volume 10467 of LNCS, pages 249-266. Springer, 2017. Google Scholar
  17. Martin E. Dyer and Alan M. Frieze. Planar 3DM is NP-Complete. J. Algorithms, 7(2):174-184, 1986. URL: https://doi.org/10.1016/0196-6774(86)90002-7.
  18. Constantine G. Evans. Crystals that count! Physical principles and experimental investigations of DNA tile self-assembly. PhD thesis, Caltech, 2014. Google Scholar
  19. Constantine G Evans, Jackson O’Brien, Erik Winfree, and Arvind Murugan. Pattern recognition in the nucleation kinetics of non-equilibrium self-assembly. Nature, 625(7995):500-507, 2024. Google Scholar
  20. M. R. Garey and David S. Johnson. Computers and Intractability: A Guide to the Theory of NP-Completeness. W. H. Freeman, 1979. Google Scholar
  21. Michael R Garey and David S Johnson. "strong" NP-completeness results: Motivation, examples, and implications. Journal of the ACM (JACM), 25(3):499-508, 1978. Google Scholar
  22. Cody Geary, Paul WK Rothemund, and Ebbe S Andersen. A single-stranded architecture for cotranscriptional folding of RNA nanostructures. Science, 345(6198):799-804, 2014. Google Scholar
  23. Leslie M Goldschlager. The monotone and planar circuit value problems are LOG SPACE complete for P. ACM SIGACT news, 9(2):25-29, 1977. Google Scholar
  24. Raymond Greenlaw, H James Hoover, and Walter L Ruzzo. Limits to parallel computation: P-completeness theory. Oxford University Press, USA, 1995. Google Scholar
  25. Ivo L Hofacker, Christian M Reidys, and Peter F Stadler. Symmetric circular matchings and RNA folding. Discrete mathematics, 312(1):100-112, 2012. Google Scholar
  26. Russell Impagliazzo, Ramamohan Paturi, and Francis Zane. Which Problems Have Strongly Exponential Complexity? J. Comput. Syst. Sci., 63(4):512-530, 2001. URL: https://doi.org/10.1006/JCSS.2001.1774.
  27. Hosna Jabbari, Ian Wark, Carlo Montemagno, and Sebastian Will. Knotty: efficient and accurate prediction of complex RNA pseudoknot structures. Bioinformatics, 34(22):3849-3856, 2018. Google Scholar
  28. Ronny Lorenz, Stephan H Bernhart, Christian Höner zu Siederdissen, Hakim Tafer, Christoph Flamm, Peter F Stadler, and Ivo L Hofacker. ViennaRNA package 2.0. Algorithms for molecular biology, 6:1-14, 2011. Google Scholar
  29. Rune B Lyngsø and Christian NS Pedersen. Pseudoknots in RNA secondary structures. In Proceedings of the fourth annual international conference on Computational molecular biology, pages 201-209, 2000. Google Scholar
  30. Rune B Lyngsø and Christian NS Pedersen. RNA pseudoknot prediction in energy-based models. Journal of computational biology, 7(3-4):409-427, 2000. Google Scholar
  31. Rune B Lyngsø, Michael Zuker, and CN Pedersen. Fast evaluation of internal loops in RNA secondary structure prediction. Bioinformatics (Oxford, England), 15(6):440-445, 1999. Google Scholar
  32. John S McCaskill. The equilibrium partition function and base pair binding probabilities for RNA secondary structure. Biopolymers: Original Research on Biomolecules, 29(6-7):1105-1119, 1990. Google Scholar
  33. Cristopher Moore and Stephan Mertens. The Nature of Computation. Oxford University Press, 2011. Google Scholar
  34. Cristopher Moore and Stephan Mertens. The Nature of Computation. Oxford University Press, 2011. Google Scholar
  35. Maxim P Nikitin. Non-complementary strand commutation as a fundamental alternative for information processing by DNA and gene regulation. Nature Chemistry, 15(1):70-82, 2023. Google Scholar
  36. Ruth Nussinov and Ann B Jacobson. Fast algorithm for predicting the secondary structure of single-stranded RNA. Proceedings of the National Academy of Sciences, 77(11):6309-6313, 1980. Google Scholar
  37. Lulu Qian and Erik Winfree. Scaling up digital circuit computation with DNA strand displacement cascades. Science, 332(6034):1196-1201, 2011. Google Scholar
  38. Jens Reeder and Robert Giegerich. Design, implementation and evaluation of a practical pseudoknot folding algorithm based on thermodynamics. BMC bioinformatics, 5:1-12, 2004. Google Scholar
  39. Elena Rivas and Sean R Eddy. A dynamic programming algorithm for RNA structure prediction including pseudoknots. Journal of molecular biology, 285(5):2053-2068, 1999. Google Scholar
  40. Paul WK Rothemund. Folding DNA to create nanoscale shapes and patterns. Nature, 440(7082):297-302, 2006. Google Scholar
  41. John SantaLucia Jr and Donald Hicks. The thermodynamics of DNA structural motifs. Annu. Rev. Biophys. Biomol. Struct., 33:415-440, 2004. Google Scholar
  42. Thomas J. Schaefer. The Complexity of Satisfiability Problems. In Richard J. Lipton, Walter A. Burkhard, Walter J. Savitch, Emily P. Friedman, and Alfred V. Aho, editors, Proceedings of the 10th Annual ACM Symposium on Theory of Computing, May 1-3, 1978, San Diego, California, USA, pages 216-226. ACM, 1978. URL: https://doi.org/10.1145/800133.804350.
  43. Rebecca Schulman and Erik Winfree. Synthesis of crystals with a programmable kinetic barrier to nucleation. Proceedings of the National Academy of Sciences, 104(39):15236-15241, 2007. Google Scholar
  44. Ahmed Shalaby, Chris Thachuk, and Damien Woods. Minimum free energy, partition function and kinetics simulation algorithms for a multistranded scaffolded DNA computer. In Ho-Lin Chen and Constantine G. Evans, editors, 29th International Conference on DNA Computing and Molecular Programming (DNA 29), volume 276 of Leibniz International Proceedings in Informatics (LIPIcs), pages 1:1-1:22, Dagstuhl, Germany, 2023. Schloss Dagstuhl - Leibniz-Zentrum für Informatik. URL: https://doi.org/10.4230/LIPIcs.DNA.29.1.
  45. Ahmed Shalaby and Damien Woods. An efficient algorithm to compute the minimum free energy of interacting nucleic acid strands, 2024. Arxiv preprint: URL: https://arxiv.org/pdf/2407.09676.
  46. Friedrich C Simmel, Bernard Yurke, and Hari R Singh. Principles and applications of nucleic acid strand displacement reactions. Chemical reviews, 119(10):6326-6369, 2019. Google Scholar
  47. David Soloveichik, Georg Seelig, and Erik Winfree. DNA as a universal substrate for chemical kinetics. Proceedings of the National Academy of Sciences, 107(12):5393-5398, 2010. Google Scholar
  48. Niranjan Srinivas, James Parkin, Georg Seelig, Erik Winfree, and David Soloveichik. Enzyme-free nucleic acid dynamical systems. Science, 358(6369):eaal2052, 2017. Google Scholar
  49. Yasuo Uemura, Aki Hasegawa, Satoshi Kobayashi, and Takashi Yokomori. Tree adjoining grammars for RNA structure prediction. Theoretical computer science, 210(2):277-303, 1999. Google Scholar
  50. Klaus W Wagner. The complexity of combinatorial problems with succinct input representation. Acta informatica, 23:325-356, 1986. Google Scholar
  51. Boya Wang, Chris Thachuk, Andrew D Ellington, Erik Winfree, and David Soloveichik. Effective design principles for leakless strand displacement systems. Proceedings of the National Academy of Sciences, 115(52):E12182-E12191, 2018. Google Scholar
  52. Bryan Wei, Mingjie Dai, and Peng Yin. Complex shapes self-assembled from single-stranded DNA tiles. Nature, 485(7400):623-626, 2012. Google Scholar
  53. Erik Winfree, Furong Liu, Lisa A Wenzler, and Nadrian C Seeman. Design and self-assembly of two-dimensional DNA crystals. Nature, 394(6693):539-544, 1998. Google Scholar
  54. Damien Woods, David Doty, Cameron Myhrvold, Joy Hui, Felix Zhou, Peng Yin, and Erik Winfree. Diverse and robust molecular algorithms using reprogrammable DNA self-assembly. Nature, 567(7748):366-372, 2019. Google Scholar
  55. David Yu Zhang and Georg Seelig. Dynamic DNA nanotechnology using strand-displacement reactions. Nature chemistry, 3(2):103-113, 2011. Google Scholar
  56. Michael Zuker and Patrick Stiegler. Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information. Nucleic acids research, 9(1):133-148, 1981. Google Scholar
Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail