4.7 Article

A Comprehensive Study of the Past, Present, and Future of Data Deduplication

期刊

PROCEEDINGS OF THE IEEE
卷 104, 期 9, 页码 1681-1710

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JPROC.2016.2571298

关键词

Data compression; data deduplication; data reduction; delta compression; storage security; storage systems

资金

  1. National Science Foundation of China (NSFC) [61502190, 61232004]
  2. 863 Project [2013AA013203]
  3. State Key Laboratory of Computer Architecture [CARCH201505]
  4. Fundamental Research Funds for the Central Universities, HUST [2015MS073]
  5. U.S. National Science Foundation (NSF) [CNS-1116606, CNS-1016609]
  6. Key Laboratory of Information Storage System, Ministry of Education, China
  7. EMC Corporation
  8. Direct For Computer & Info Scie & Enginr
  9. Division Of Computer and Network Systems [1116606] Funding Source: National Science Foundation

向作者/读者索取更多资源

Data deduplication, an efficient approach to data reduction, has gained increasing attention and popularity in large-scale storage systems due to the explosive growth of digital data. It eliminates redundant data at the file or subfile level and identifies duplicate content by its cryptographically secure hash signature (i.e., collision-resistant fingerprint), which is shown to be much more computationally efficient than the traditional compression approaches in large-scale storage systems. In this paper, we first review the background and key features of data deduplication, then summarize and classify the state-of-the-art research in data deduplication according to the key workflow of the data deduplication process. The summary and taxonomy of the state of the art on deduplication help identify and understand the most important design considerations for data deduplication systems. In addition, we discuss the main applications and industry trend of data deduplication, and provide a list of the publicly available sources for deduplication research and studies. Finally, we outline the open problems and future research directions facing deduplication-based storage systems.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据