,
Laurent Bulteau
,
Yann Ponty
Creative Commons Attribution 4.0 International license
Inverse folding is a classic instance of negative RNA design which consists in finding a sequence that uniquely folds into a target secondary structure with respect to energy minimization. A breakthrough result of Bonnet et al. shows that, even in simple base pairs-based (BP) models, the decision version of a mildly constrained version of inverse folding is NP-hard.
In this work, we show that inverse folding can be solved in linear time for a large collection of targets, including every structure that contains no isolated BP and no isolated stack (or, equivalently, when all helices consist of 3^{+} base pairs). For structures featuring shorter helices, our linear algorithm is no longer guaranteed to produce a solution, but still does so for a large proportion of instances.
Our approach introduces a notion of modulo m-separability, generalizing a property pioneered by Hales et al. Separability is a sufficient condition for the existence of a solution to the inverse folding problem. We show that, for any input secondary structure of length n, a modulo m-separated sequence can be produced in time 𝒪(n 2^m) anytime such a sequence exists. Meanwhile, we show that any structure consisting of 3^{+} base pairs is either trivially non-designable, or always admits a modulo-2 separated solution (m = 2). Solution sequences can thus be produced in linear time, and even be uniformly generated within the set of modulo-2 separable sequences.
@InProceedings{boury_et_al:LIPIcs.WABI.2024.19,
author = {Boury, Th\'{e}o and Bulteau, Laurent and Ponty, Yann},
title = {{RNA Inverse Folding Can Be Solved in Linear Time for Structures Without Isolated Stacks or Base Pairs}},
booktitle = {24th International Workshop on Algorithms in Bioinformatics (WABI 2024)},
pages = {19:1--19:23},
series = {Leibniz International Proceedings in Informatics (LIPIcs)},
ISBN = {978-3-95977-340-9},
ISSN = {1868-8969},
year = {2024},
volume = {312},
editor = {Pissis, Solon P. and Sung, Wing-Kin},
publisher = {Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
address = {Dagstuhl, Germany},
URL = {https://drops.dagstuhl.de/entities/document/10.4230/LIPIcs.WABI.2024.19},
URN = {urn:nbn:de:0030-drops-206632},
doi = {10.4230/LIPIcs.WABI.2024.19},
annote = {Keywords: RNA structure, String Design, Parameterized Complexity, Uniform Sampling}
}
archived version
archived version