
Leveraging Data Deduplication to Improve the Performance of Primary Storage Systems in the Cloud
Abstract
In addition, Leveraging Data Deduplication to Improve the Performance of Primary Storage Systems in the Cloud directly applying data deduplication to primary cloud storage systems is likely to cause space contention in disk memory and data fragmentation. Based on these observations, we propose a performance-oriented I / O deduplication, called POD, rather than a capacity-oriented I / O deduplication, exemplified by iDedup, to enhance the I / O performance of primary cloud storage systems without sacrificing the latter’s capacity savings.
Advantages
- POD significantly improves the performance and saves capacity of primary storage systems in the Cloud
Disadvantages
- From a performance perspective, the existing data deduplication schemes fail to consider these workload characteristics in primary storage systems, missing the opportunity to address one of the most important issues in primary storage, that of performance.
System Configuration
Platform : cloud computing
Conclusion
In this paper, we propose POD, a performance-oriented deduplication scheme, to improve the performance of primary cloud storage systems by leveraging I / O path data deduplication to remove redundant write requests while also saving storage space. It takes a request-based selective deduplication approach (Select-Dedupe) to deduplicate the I / O redundancy on the critical I / O path in a way that minimizes the problem of data fragmentation.