Linear Insertion Deletion Codes in the High-Noise and High-Rate Regimes

Authors Kuan Cheng , Zhengzhong Jin, Xin Li , Zhide Wei, Yu Zheng



PDF
Thumbnail PDF

File

LIPIcs.ICALP.2023.41.pdf
  • Filesize: 0.76 MB
  • 17 pages

Document Identifiers

Author Details

Kuan Cheng
  • Department of Computer Science, Peking University, Beijing, China
Zhengzhong Jin
  • Massachusetts Institute of Technology, Cambridge, MA, USA
Xin Li
  • Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
Zhide Wei
  • Department of Computer Science, Peking University, Beijing, China
Yu Zheng
  • Meta Platforms Inc

Cite AsGet BibTex

Kuan Cheng, Zhengzhong Jin, Xin Li, Zhide Wei, and Yu Zheng. Linear Insertion Deletion Codes in the High-Noise and High-Rate Regimes. In 50th International Colloquium on Automata, Languages, and Programming (ICALP 2023). Leibniz International Proceedings in Informatics (LIPIcs), Volume 261, pp. 41:1-41:17, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2023)
https://doi.org/10.4230/LIPIcs.ICALP.2023.41

Abstract

This work continues the study of linear error correcting codes against adversarial insertion deletion errors (insdel errors). Previously, the work of Cheng, Guruswami, Haeupler, and Li [Kuan Cheng et al., 2021] showed the existence of asymptotically good linear insdel codes that can correct arbitrarily close to 1 fraction of errors over some constant size alphabet, or achieve rate arbitrarily close to 1/2 even over the binary alphabet. As shown in [Kuan Cheng et al., 2021], these bounds are also the best possible. However, known explicit constructions in [Kuan Cheng et al., 2021], and subsequent improved constructions by Con, Shpilka, and Tamo [Con et al., 2022] all fall short of meeting these bounds. Over any constant size alphabet, they can only achieve rate < 1/8 or correct < 1/4 fraction of errors; over the binary alphabet, they can only achieve rate < 1/1216 or correct < 1/54 fraction of errors. Apparently, previous techniques face inherent barriers to achieve rate better than 1/4 or correct more than 1/2 fraction of errors. In this work we give new constructions of such codes that meet these bounds, namely, asymptotically good linear insdel codes that can correct arbitrarily close to 1 fraction of errors over some constant size alphabet, and binary asymptotically good linear insdel codes that can achieve rate arbitrarily close to 1/2. All our constructions are efficiently encodable and decodable. Our constructions are based on a novel approach of code concatenation, which embeds the index information implicitly into codewords. This significantly differs from previous techniques and may be of independent interest. Finally, we also prove the existence of linear concatenated insdel codes with parameters that match random linear codes, and propose a conjecture about linear insdel codes.

Subject Classification

ACM Subject Classification
  • Theory of computation → Error-correcting codes
Keywords
  • Error correcting code
  • Edit distance
  • Pseudorandomness
  • Derandomization

Metrics

  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    0
    PDF Downloads

References

  1. Khaled A.S. Abdel-Ghaffar, Hendrik C. Ferreira, and Ling Cheng. On linear and cyclic codes for correcting deletions. In 2007 IEEE International Symposium on Information Theory (ISIT), pages 851-855, 2007. Google Scholar
  2. Noga Alon, Oded Goldreich, Johan Håstad, and René Peralta. Simple constructions of almost k-wise independent random variables. Random Structures & Algorithms, 3(3):289-304, 1992. Google Scholar
  3. J. Bornholt, R. Lopez, D. M. Carmean, L. Ceze, G. Seelig, and K. Strauss. A dna-based archival storage system. ACM SIGARCH Comput. Archit. News, 44:637-649, 2016. Google Scholar
  4. Joshua Brakensiek, Venkatesan Guruswami, and Samuel Zbarsky. Efficient low-redundancy codes for correcting multiple deletions. IEEE Transactions on Information Theory, 64(5):3403-3410, 2018. Preliminary version in SODA 2016. Google Scholar
  5. Boris Bukh, Venkatesan Guruswami, and Johan Håstad. An improved bound on the fraction of correctable deletions. IEEE Trans. Information Theory, 63(1):93-103, 2017. Preliminary version in SODA 2016. URL: https://doi.org/10.1109/TIT.2016.2621044.
  6. Kuan Cheng, Venkatesan Guruswami, Bernhard Haeupler, and Xin Li. Efficient linear and affine codes for correcting insertions/deletions. In Proceedings of the 2021 ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 1-20, 2021. URL: https://doi.org/10.1137/1.9781611976465.1.
  7. Kuan Cheng, Zhengzhong Jin, Xin Li, and Ke Wu. Block Edit Errors with Transpositions: Deterministic Document Exchange Protocols and Almost Optimal Binary Codes. In Christel Baier, Ioannis Chatzigiannakis, Paola Flocchini, and Stefano Leonardi, editors, 46th International Colloquium on Automata, Languages, and Programming (ICALP 2019), volume 132 of Leibniz International Proceedings in Informatics (LIPIcs), pages 37:1-37:15, Dagstuhl, Germany, 2019. Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik. URL: https://doi.org/10.4230/LIPIcs.ICALP.2019.37.
  8. Kuan Cheng, Zhengzhong Jin, Xin Li, and Ke Wu. Deterministic document exchange protocols, and almost optimal binary codes for edit errors. Journal of the ACM (JACM), 69(6):1-39, 2022. Preliminary version in 2018 IEEE 59th Annual Symposium on Foundations of Computer Science (FOCS). Google Scholar
  9. Roni Con, Amir Shpilka, and Itzhak Tamo. Explicit and efficient constructions of linear codes against adversarial insertions and deletions. IEEE Transactions on Information Theory, 68(10):6516-6526, 2022. URL: https://doi.org/10.1109/TIT.2022.3173185.
  10. Roni Con, Amir Shpilka, and Itzhak Tamo. Reed solomon codes against adversarial insertions and deletions. In 2022 IEEE International Symposium on Information Theory (ISIT), pages 2940-2945, 2022. URL: https://doi.org/10.1109/ISIT50566.2022.9834672.
  11. V. Guruswami and M. Sudan. Improved decoding of reed-solomon and algebraic-geometry codes. IEEE Transactions on Information Theory, 45(6):1757-1767, 1999. URL: https://doi.org/10.1109/18.782097.
  12. Venkatesan Guruswami, Bernhard Haeupler, and Amirbehshad Shahrasbi. Optimally resilient codes for list-decoding from insertions and deletions. In Proccedings of the 52nd Annual ACM Symposium on Theory of Computing, pages 524-537, 2020. URL: https://doi.org/10.1145/3357713.3384262.
  13. Venkatesan Guruswami, Xiaoyu He, and Ray Li. The zero-rate threshold for adversarial bit-deletions is less than 1/2. IEEE Transactions on Information Theory, pages 1-1, 2022. URL: https://doi.org/10.1109/TIT.2022.3223023.
  14. Venkatesan Guruswami and Carol Wang. Deletion codes in the high-noise and high-rate regimes. IEEE Trans. Information Theory, 63(4):1961-1970, 2017. Google Scholar
  15. Bernhard Haeupler. Optimal document exchange and new codes for insertions and deletions. In 60th IEEE Annual Symposium on Foundations of Computer Science, pages 334-347, 2019. Google Scholar
  16. Bernhard Haeupler, Aviad Rubinstein, and Amirbehshad Shahrasbi. Near-Linear Time Insertion-Deletion Codes and (1+eps)-Approximating Edit Distance via Indexing. Proceeding of the ACM Symposium on Theory of Computing (STOC), pages 697-708, 2019. Google Scholar
  17. Bernhard Haeupler and Amirbehshad Shahrasbi. Synchronization strings: codes for insertions and deletions approaching the singleton bound. Journal of the ACM (JACM), 68(5):1-39, 2021. Preliminary version in 49th Annual ACM SIGACT Symposium on Theory of Computing (STOC). Google Scholar
  18. Bernhard Haeupler, Amirbehshad Shahrasbi, and Madhu Sudan. Synchronization strings: List decoding for insertions and deletions. Proceeding of the International Colloquium on Automata, Languages and Programming (ICALP), pages 76:1-76:14, 2018. URL: https://doi.org/10.4230/LIPIcs.ICALP.2018.76.
  19. Bernhard Haeupler, Amirbehshad Shahrasbi, and Ellen Vitercik. Synchronization strings: Channel simulations and interactive coding for insertions and deletions. Proceeding of the International Colloquium on Automata, Languages and Programming (ICALP), pages 75:1-75:14, 2018. URL: https://doi.org/10.4230/LIPIcs.ICALP.2018.75.
  20. Tomohiro Hayashi and Kenji Yasunaga. On the list decodability of insertions and deletions. In 2018 IEEE International Symposium on Information Theory (ISIT), pages 86-90. IEEE, 2018. Google Scholar
  21. Leonard J. Schulman and David Zuckerman. Asymptotically good codes correcting insertions, deletions, and transpositions. IEEE Trans. Inf. Theory, 45(7):2552-2557, 1999. Preliminary version in SODA 1997. URL: https://doi.org/10.1109/18.796406.
  22. Jin Sima and Jehoshua Bruck. Optimal k-deletion correcting codes. In IEEE International Symposium on Information Theory, pages 847-851, 2019. URL: https://doi.org/10.1109/ISIT.2019.8849750.
  23. C. Thommesen. The existence of binary linear concatenated codes with reed - solomon outer codes which asymptotically meet the gilbert- varshamov bound. IEEE Transactions on Information Theory, 29(6):850-853, 1983. URL: https://doi.org/10.1109/TIT.1983.1056765.
  24. Antonia Wachter-Zeh. List decoding of insertions and deletions. IEEE Transactions on Information Theory, 64(9):6297-6304, 2017. Google Scholar
  25. S. M. Hossein Tabatabaei Yazdi, Ryan Gabrys, and Olgica Milenkovic. Portable and error-free dna-based data storage. Scientific Reports, 7:2045-2322, 2017. URL: https://doi.org/10.1038/s41598-017-05188-1.
Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail