Reducing Fragmentation for In-line Deduplication Backup Storage via Exploiting Backup History and Cache Knowledge

0
888
Reducing Fragmentation for In-line Deduplication Backup Storage via Exploiting Backup History and Cache Knowledge

Reducing Fragmentation for In-line Deduplication Backup Storage via Exploiting Backup History and Cache Knowledge

Abstract

The chunks of each backup are physically scattered after deduplication in backup systems, which causes a challenging fragmentation problem. Reducing Fragmentation for In-line Deduplication Backup Storage via Exploiting Backup History and Cache Knowledge We observe the fragmentation in sparse and out-of-order containers. The sparse container decreases efficiency in restore performance and collection of garbage, while the out – of-order container decreases performance in restore if the restore cache is small.

 

We propose Reducing Fragmentation for In-line Deduplication Backup Storage via Exploiting Backup History and Cache Knowledge. HAR exploits historical information in backup systems to accurately identify and reduce sparse containers, and CAF exploits restore cache knowledge to identify out – of-order containers that hurt performance restore. In datasets where out – of-order containers are dominant, CAF efficiently complements HAR.

System Configuration

H/W System Configuration
Speed                   : 1.1 GHz
RAM                      : 256 MB(min)
Hard Disk              : 20 GB
Floppy Drive          : 1.44 MB
Key Board             : Standard Windows Keyboard
Mouse                  : Two or Three Button Mouse
Monitor                : SVGA
S/W System Configuration

Platform                     :  cloud computing

Operating system       : Windows Xp,7,
Server                       : WAMP/Apache
Working on                : Browser Like Firefox, IE

Conclusion

Hybrid cloud storage is useful to further improve performance in datasets where out – of-charge containers are dominant. To avoid a significant decrease in the hybrid scheme’s deduplication ratio, we develop a two-algorithm such as container marker algorithm and history-conscious rewriting algorithm to exploit backup history and cache knowledge. With the help of CMA, the hybrid scheme significantly improves the deduplication ratio without reducing restore performance.