Linear Insertion Deletion Codes in the High-Noise and High-Rate Regimes

Cheng, Kuan; Jin, Zhengzhong; Li, Xin; Wei, Zhide; Zheng, Yu

doi:10.4230/LIPIcs.ICALP.2023.41

File

LIPIcs.ICALP.2023.41.pdf

Filesize: 0.76 MB
17 pages

Document Identifiers

DOI: 10.4230/LIPIcs.ICALP.2023.41
URN: urn:nbn:de:0030-drops-180931

Author Details

Kuan Cheng

Department of Computer Science, Peking University, Beijing, China

Zhengzhong Jin

Massachusetts Institute of Technology, Cambridge, MA, USA

Xin Li

Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA

Zhide Wei

Department of Computer Science, Peking University, Beijing, China

Yu Zheng

Meta Platforms Inc

Cite AsGet BibTex

Kuan Cheng, Zhengzhong Jin, Xin Li, Zhide Wei, and Yu Zheng. Linear Insertion Deletion Codes in the High-Noise and High-Rate Regimes. In 50th International Colloquium on Automata, Languages, and Programming (ICALP 2023). Leibniz International Proceedings in Informatics (LIPIcs), Volume 261, pp. 41:1-41:17, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2023)
https://doi.org/10.4230/LIPIcs.ICALP.2023.41

Abstract

This work continues the study of linear error correcting codes against adversarial insertion deletion errors (insdel errors). Previously, the work of Cheng, Guruswami, Haeupler, and Li [Kuan Cheng et al., 2021] showed the existence of asymptotically good linear insdel codes that can correct arbitrarily close to 1 fraction of errors over some constant size alphabet, or achieve rate arbitrarily close to 1/2 even over the binary alphabet. As shown in [Kuan Cheng et al., 2021], these bounds are also the best possible. However, known explicit constructions in [Kuan Cheng et al., 2021], and subsequent improved constructions by Con, Shpilka, and Tamo [Con et al., 2022] all fall short of meeting these bounds. Over any constant size alphabet, they can only achieve rate < 1/8 or correct < 1/4 fraction of errors; over the binary alphabet, they can only achieve rate < 1/1216 or correct < 1/54 fraction of errors. Apparently, previous techniques face inherent barriers to achieve rate better than 1/4 or correct more than 1/2 fraction of errors. In this work we give new constructions of such codes that meet these bounds, namely, asymptotically good linear insdel codes that can correct arbitrarily close to 1 fraction of errors over some constant size alphabet, and binary asymptotically good linear insdel codes that can achieve rate arbitrarily close to 1/2. All our constructions are efficiently encodable and decodable. Our constructions are based on a novel approach of code concatenation, which embeds the index information implicitly into codewords. This significantly differs from previous techniques and may be of independent interest. Finally, we also prove the existence of linear concatenated insdel codes with parameters that match random linear codes, and propose a conjecture about linear insdel codes.

Subject Classification

ACM Subject Classification

Theory of computation → Error-correcting codes

Keywords

Error correcting code
Edit distance
Pseudorandomness
Derandomization

Metrics

Access Statistics
Total Accesses (updated on a weekly basis)

0

PDF Downloads

0

Metadata Views

References

Khaled A.S. Abdel-Ghaffar, Hendrik C. Ferreira, and Ling Cheng. On linear and cyclic codes for correcting deletions. In 2007 IEEE International Symposium on Information Theory (ISIT), pages 851-855, 2007.
Noga Alon, Oded Goldreich, Johan Håstad, and René Peralta. Simple constructions of almost k-wise independent random variables. Random Structures & Algorithms, 3(3):289-304, 1992.
J. Bornholt, R. Lopez, D. M. Carmean, L. Ceze, G. Seelig, and K. Strauss. A dna-based archival storage system. ACM SIGARCH Comput. Archit. News, 44:637-649, 2016.
Joshua Brakensiek, Venkatesan Guruswami, and Samuel Zbarsky. Efficient low-redundancy codes for correcting multiple deletions. IEEE Transactions on Information Theory, 64(5):3403-3410, 2018. Preliminary version in SODA 2016.
Boris Bukh, Venkatesan Guruswami, and Johan Håstad. An improved bound on the fraction of correctable deletions. IEEE Trans. Information Theory, 63(1):93-103, 2017. Preliminary version in SODA 2016. URL: https://doi.org/10.1109/TIT.2016.2621044.
Kuan Cheng, Venkatesan Guruswami, Bernhard Haeupler, and Xin Li. Efficient linear and affine codes for correcting insertions/deletions. In Proceedings of the 2021 ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 1-20, 2021. URL: https://doi.org/10.1137/1.9781611976465.1.
Kuan Cheng, Zhengzhong Jin, Xin Li, and Ke Wu. Block Edit Errors with Transpositions: Deterministic Document Exchange Protocols and Almost Optimal Binary Codes. In Christel Baier, Ioannis Chatzigiannakis, Paola Flocchini, and Stefano Leonardi, editors, 46th International Colloquium on Automata, Languages, and Programming (ICALP 2019), volume 132 of Leibniz International Proceedings in Informatics (LIPIcs), pages 37:1-37:15, Dagstuhl, Germany, 2019. Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik. URL: https://doi.org/10.4230/LIPIcs.ICALP.2019.37.
Kuan Cheng, Zhengzhong Jin, Xin Li, and Ke Wu. Deterministic document exchange protocols, and almost optimal binary codes for edit errors. Journal of the ACM (JACM), 69(6):1-39, 2022. Preliminary version in 2018 IEEE 59th Annual Symposium on Foundations of Computer Science (FOCS).
Roni Con, Amir Shpilka, and Itzhak Tamo. Explicit and efficient constructions of linear codes against adversarial insertions and deletions. IEEE Transactions on Information Theory, 68(10):6516-6526, 2022. URL: https://doi.org/10.1109/TIT.2022.3173185.
Roni Con, Amir Shpilka, and Itzhak Tamo. Reed solomon codes against adversarial insertions and deletions. In 2022 IEEE International Symposium on Information Theory (ISIT), pages 2940-2945, 2022. URL: https://doi.org/10.1109/ISIT50566.2022.9834672.
V. Guruswami and M. Sudan. Improved decoding of reed-solomon and algebraic-geometry codes. IEEE Transactions on Information Theory, 45(6):1757-1767, 1999. URL: https://doi.org/10.1109/18.782097.
Venkatesan Guruswami, Bernhard Haeupler, and Amirbehshad Shahrasbi. Optimally resilient codes for list-decoding from insertions and deletions. In Proccedings of the 52nd Annual ACM Symposium on Theory of Computing, pages 524-537, 2020. URL: https://doi.org/10.1145/3357713.3384262.
Venkatesan Guruswami, Xiaoyu He, and Ray Li. The zero-rate threshold for adversarial bit-deletions is less than 1/2. IEEE Transactions on Information Theory, pages 1-1, 2022. URL: https://doi.org/10.1109/TIT.2022.3223023.
Venkatesan Guruswami and Carol Wang. Deletion codes in the high-noise and high-rate regimes. IEEE Trans. Information Theory, 63(4):1961-1970, 2017.
Bernhard Haeupler. Optimal document exchange and new codes for insertions and deletions. In 60th IEEE Annual Symposium on Foundations of Computer Science, pages 334-347, 2019.
Bernhard Haeupler, Aviad Rubinstein, and Amirbehshad Shahrasbi. Near-Linear Time Insertion-Deletion Codes and (1+eps)-Approximating Edit Distance via Indexing. Proceeding of the ACM Symposium on Theory of Computing (STOC), pages 697-708, 2019.
Bernhard Haeupler and Amirbehshad Shahrasbi. Synchronization strings: codes for insertions and deletions approaching the singleton bound. Journal of the ACM (JACM), 68(5):1-39, 2021. Preliminary version in 49th Annual ACM SIGACT Symposium on Theory of Computing (STOC).
Bernhard Haeupler, Amirbehshad Shahrasbi, and Madhu Sudan. Synchronization strings: List decoding for insertions and deletions. Proceeding of the International Colloquium on Automata, Languages and Programming (ICALP), pages 76:1-76:14, 2018. URL: https://doi.org/10.4230/LIPIcs.ICALP.2018.76.
Bernhard Haeupler, Amirbehshad Shahrasbi, and Ellen Vitercik. Synchronization strings: Channel simulations and interactive coding for insertions and deletions. Proceeding of the International Colloquium on Automata, Languages and Programming (ICALP), pages 75:1-75:14, 2018. URL: https://doi.org/10.4230/LIPIcs.ICALP.2018.75.
Tomohiro Hayashi and Kenji Yasunaga. On the list decodability of insertions and deletions. In 2018 IEEE International Symposium on Information Theory (ISIT), pages 86-90. IEEE, 2018.
Leonard J. Schulman and David Zuckerman. Asymptotically good codes correcting insertions, deletions, and transpositions. IEEE Trans. Inf. Theory, 45(7):2552-2557, 1999. Preliminary version in SODA 1997. URL: https://doi.org/10.1109/18.796406.
Jin Sima and Jehoshua Bruck. Optimal k-deletion correcting codes. In IEEE International Symposium on Information Theory, pages 847-851, 2019. URL: https://doi.org/10.1109/ISIT.2019.8849750.
C. Thommesen. The existence of binary linear concatenated codes with reed - solomon outer codes which asymptotically meet the gilbert- varshamov bound. IEEE Transactions on Information Theory, 29(6):850-853, 1983. URL: https://doi.org/10.1109/TIT.1983.1056765.
Antonia Wachter-Zeh. List decoding of insertions and deletions. IEEE Transactions on Information Theory, 64(9):6297-6304, 2017.
S. M. Hossein Tabatabaei Yazdi, Ryan Gabrys, and Olgica Milenkovic. Portable and error-free dna-based data storage. Scientific Reports, 7:2045-2322, 2017. URL: https://doi.org/10.1038/s41598-017-05188-1.

Linear Insertion Deletion Codes in the High-Noise and High-Rate Regimes

Authors Kuan Cheng , Zhengzhong Jin, Xin Li , Zhide Wei, Yu Zheng

File

Document Identifiers

Author Details

Cite AsGet BibTex

Abstract

Subject Classification

ACM Subject Classification

Keywords

Metrics

References

Thanks for your feedback!

Could not send message

Linear Insertion Deletion Codes in the High-Noise and High-Rate Regimes

Authors Kuan Cheng , Zhengzhong Jin, Xin Li , Zhide Wei, Yu Zheng

File

Document Identifiers

Author Details

Funding

Cite AsGet BibTex

Abstract

Subject Classification

ACM Subject Classification

Keywords

Metrics

Related Versions

References

Thanks for your feedback!

Could not send message