Albireo Software Development Kit (SDK)
for Virtual Environment Backup
Both server and desktop virtualization have compelling propositions for IT teams. The use of Virtual Machines (VMs) has caused organizations of all sizes to rethink their data protection strategies. Many companies have realized that having a server backup and disaster recovery plan built specifically for virtual environments is essential. New data protection software offerings have emerged to address these specific requirements, and one key element of a strong VM backup strategy is deduplication.
Since the backup needs of virtual environments involve backing up images of each VM, the application
of deduplication technology lends itself perfectly because there is a high duplicate ratio. A dedupe rate of 35X or higher is very common, saving storage space and reducing the impact on backup cycles. Albireo provides extremely granular (4 KB chunk) deduplication at a very low resource cost, so when applied to VM backup operations Albireo delivers extremely high backup efficiency and the highest possible cost reduction while improving overall operations (including recovery cycles).
Permabit Albireo is the industry’s first purpose-built OEM data deduplication software designed to meet the needs of hardware, software, and service providers who wish to expand their existing solutions without negatively impacting existing differentiating capabilities or reducing overall performance. Albireo delivers deduplication at the sub-file level and can be flexibly integrated into current or next-generation storage and platform architectures. Albireo deduplication is seamlessly deployed in primary, archive, and backup storage across the data center and the cloud. With Albireo, OEMs leverage their existing R&D investments while accelerating time to market for must-have, industry-leading data optimization capabilities.
An initial deduplication implementation for VM backup software can be completed in a matter of days using the Albireo SDK. As shown in Figure 1, Albireo operates outside the data path of the storage application software as a duplicate advisory service. This ensures that data integrity is never at risk and that there is zero performance impact.

Figure 1: Albireo Process Flow
Deduplication Efficiency
The Albireo architecture supports both fixed and variable block deduplication. Data can be intelligently segmented into chunks of variable length based on analysis of its content. Albireo uses content “scanners” to identify and optimize deduplication of objects within specific compound data formats (e.g. Microsoft® Office® documents, ZIP, PDF, tar). An API is available for vendors to implement their own application-specific scanners for further savings. Independent validation has demonstrated that Albireo offers 7X data reduction for general purpose environments, while data reduction rates of over 35X are seen for virtualization images generated by products such as VMware®.
High Performance Index
The Albireo index utilizes advanced techniques to identify duplicates across massive storage pools using memory-resident information over 99% of the time, avoiding costly disk access and eliminating the largest single bottleneck in storage deduplication. Index operations average less than 10 microseconds — orders of magnitude faster than other deduplication solutions. This enables sustainable ingestion rates of 11 GB per second with a single processor core (64 KB chunks and hardware-accelerated hashing) and scales out linearly by leveraging Albireo grid technology.
Flexible Deployment Options
Albireo is integrated by the OEM as an inline, post-process, or parallel solution, depending on which is best suited to address the customer requirements for the vendor’s architecture. Source-based deduplication deployment (where data is deduplicated at the source for bandwidth optimization) is supported along with the more traditional target-based approach.
Complete SDK for Easy Implementation
Albireo is delivered as a Software Development Kit (SDK). The SDK contains the Albireo software library, full API documentation, code samples, and application notes for integration with primary, archive, and backup storage solutions. Permabit provides technical assistance to ensure rapid and seamless integration. OEM customers typically implement Albireo deduplication (and begin performance optimization and testing) in under 5 days.
Features and Benefits

Table 1: Albireo — Features and Benefits
Find out More
To learn more about Permabit Albireo technology, or to license our products, visit our website at permabit.com or call us directly at 617.252.9600.