Retrospective: Avoiding the Disk Bottleneck in the Data Domain Deduplication File System

Author Kai Li



PDF
Thumbnail PDF

File

LIPIcs.FUN.2024.33.pdf
  • Filesize: 444 kB
  • 4 pages

Document Identifiers

Author Details

Kai Li
  • Department of Computer Science, Princeton University, NJ, USA

Cite AsGet BibTex

Kai Li. Retrospective: Avoiding the Disk Bottleneck in the Data Domain Deduplication File System. In 12th International Conference on Fun with Algorithms (FUN 2024). Leibniz International Proceedings in Informatics (LIPIcs), Volume 291, pp. 33:1-33:4, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2024)
https://doi.org/10.4230/LIPIcs.FUN.2024.33

Abstract

The paper titled "Avoiding the Disk Bottleneck in the Data Domain Deduplication File System" [Zhu et al., 2008] describes several fundamental ideas behind the file system that drives Data Domain’s deduplication storage products. Initially submitted to the 2007 ACM SIGOPS Symposium on Operating System Principles (SOSP), the paper was rejected by its program committee. It was subsequently submitted and accepted for publication at the USENIX Conference on File And Storage Technologies (FAST) in 2008. Twelve years later, it was honored with the USENIX Test-of-Time Award. This retrospective explores the paper’s historical significance and impact, analyzes the reasons behind its initial rejection, and suggests methods to enhance the paper review process in the academic community.

Subject Classification

ACM Subject Classification
  • Hardware → Communication hardware, interfaces and storage
Keywords
  • Deduplication
  • file systems
  • compression

Metrics

  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    0
    PDF Downloads

References

  1. Athicha Muthitacharoen, Benjie Chen, and David Mazieres. A low-bandwidth network file system. In Proceedings of the eighteenth ACM symposium on Operating systems principles, pages 174-187, 2001. Google Scholar
  2. Sean Quinlan and Sean Dorward. Venti: A new approach to archival data storage. In Conference on File and Storage Technologies (FAST 02), Monterey, CA, January 2002. USENIX Association. URL: https://www.usenix.org/conference/fast-02/venti-new-approach-archival-data-storage.
  3. Benjamin Zhu, Kai Li, and R Hugo Patterson. Avoiding the disk bottleneck in the data domain deduplication file system. In Fast, volume 8, pages 1-14, 2008. Google Scholar
  4. Jacob Ziv and Abraham Lempel. Compression of individual sequences via variable-rate coding. IEEE transactions on Information Theory, 24(5):530-536, 1978. Google Scholar