4.6 Article

De-Frag: an efficient scheme to improve deduplication performance via reducing data placement de-linearization

出版社

SPRINGER
DOI: 10.1007/s10586-014-0397-5

关键词

Data deduplication; Data placement de-linearization; Spatial locality

资金

  1. Central Universities Fundamental Research Foundation of China [106112013CDJZR180009, CDJZR14185501]
  2. Chongqing Basic and Frontier Research Project of China [cstc2013jcyjA40016, cstc2012ggC40005, cstc2013jcyjA40025]
  3. Research Fund for the Doctoral Program of Higher Education of China [20130191120031, 20130191120030]
  4. National Natural Science Foundation of China [61309004]
  5. National High Technology Research and Development (863 Program) of China [2013AA013202]
  6. National Basic Research 973 Program of China [2011CB302301]
  7. NSFC [61025008]
  8. U.S. National Science Foundation (NSF) [CCF-1102624, CNS-1218960]
  9. Direct For Computer & Info Scie & Enginr
  10. Division Of Computer and Network Systems [1218960] Funding Source: National Science Foundation

向作者/读者索取更多资源

Data deduplication has become a commodity in large-scale storage systems, especially in data backup and archival systems. However, due to the removal of redundant data, data deduplication de-linearizes data placement and forces the data chunks of the same data object to be divided into multiple separate units. In our preliminary study, we found that the de-linearization of data placement compromises the data spatial locality that is used to improve data read performance, deduplication throughput and deduplication efficiency in some deduplication approaches, which significantly affects deduplication performance and makes some deduplication approaches become less effective. In this paper, we first analyze the negative effect of data placement de-linearization to deduplication performance, and then propose an effective approach called De-Frag to reduce the de-linearization of data placement. The key idea of De-Frag is to choose some redundant data to be written to the disks rather than be removed. It quantifies the spatial locality of each chunk group by spatial locality level (SPL for short) and writes the redundant chunks to disks when SPL value is smaller than a preset value, thus to reduce the de-linearization of data placement and enhance the spatial locality. As shown in our experimental results driven by real world datasets, De-Frag effectively enhances data spatial locality and improves deduplication throughput, deduplication efficiency, and data read performance, at the cost of slightly lower compression ratios.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据