Figure 5 Data stored after deduplication
In actual practice, data deduplication is often used in conjunction with other forms of data reduction
such as conventional data compression to offer the greatest reduction in data volume stored.
Technology types
There are a number of technology types that use different deduplication algorithms. The HP D2D
applies on-the-fly deduplication to chunks of backup data as they are written to the unit. It does not
matter which application was used to create the backup. For further information about deduplication
techniques and relative benefits, refer to the white papers at http://www.hp.com.
Data deduplication and the HP D2D
Data deduplication is applied per library device. When you configure the library, it defaults to
deduplication enabled. If you disable it, deduplication cannot be selectively applied to any data on
the library device. Compression is also disabled, if deduplication is disabled.
A library is associated with a host server and deduplication allows a greater amount of backup history
to be stored for that host. A larger number of full backups can be achieved, which makes possible a
rotation strategy with a longer retention history. It does not increase the number of host servers that
may be connected. The deduplication factor that has been applied to a library device is calculated
and displayed on the Web Interface. This figure is dynamic, it updates automatically as more data
is written to the library.
Benefits of HP D2D deduplication
The main benefit is that it is possible to store more data and retain data for longer on each cartridge
within a library. The host using the library device has access to a greater depth of historical data that
would otherwise have been archived offline.
Deduplication:
•
Provides efficient use of storage capacity by eliminating duplicate data
•
Reduces disk expenditures by reducing storage space required. You can get more backups for
each host on the same size of storage (HP D2D device)
•
Prolongs disk data retention periods
•
Reduces the volume of data that must be sent across a WAN for remote backups, replication, and
disaster recovery – reducing both risk and operational costs
D2D Backup System HP D2D4112
27
Need help?
Do you have a question about the StorageWorks D2D - Backup Systems and is the answer not in the manual?
Questions and answers