DISASTER RECOVERY 
JOURNAL


P. O. Box 510110
St. Louis, MO 63151
(314) 894-0276 
Fax: (314) 894-7474
Internet
www.drj.com 
E-mail
drj@drj.com

PUBLISHER
Richard L. Arnold, CBCP
richard@drj.com

EDITOR-IN-CHIEF
Jon Seals
jon@drj.com

SENIOR EDITOR
Janette Ballman
janette@drj.com

COPY EDITORS
Richard Sandhofer
richards@drj.com
Pamela Clifton
pamelaclifton@hotmail.com

ADVERTISING 
Robert Arnold
bob@drj.com

_____________

Corporate

President/CEO
Richard L. Arnold, CBCP
richard@drj.com

Vice President 
Robert Arnold
bob@drj.com

CONFERENCE COORDINATOR
Patti Fitzgerald, CBCP
patti@drj.com

CONFERENCE REGISTRAR
Merce Knese
mercedes@drj.com

CIRCULATION
Laura Baugh
laurab@drj.com

EXECUTIVE COUNCIL
Jeff Dato, MBCP, KPMG
John Jackson, J Albright Advisors
Edward Devlin, E.S. Devlin & Associates
James Hammill, CBCP, JMH Consulting
Pat McAnally, SunGard Availability
Brian Turley, Strohl Systems
Belinda Wilson, Hewlett-Packard


INTERNATIONAL
CONTACTS
England: Thom Hetherington
Business Continuity 
Phone: 0161-237-1007
thomh@tempus.demon.co.uk

Australia: Anthony J. Harvey
Journal of Business Continuity
Phone: 0011-613-953-0055-8
fax: 0011-613-953-0528
sector@notability.com.au

Japan: Shinji Hosotsubo
Quake Japan Co., Ltd.
Phone: 03-3215-2880
fax: 03-3215-2881


 

Click Here for a Printable Version

By MAX LYDAVINSKY

Virtualization technology presents a classic dilemma for IT departments. On the positive side, virtualization offers clear economic and operational benefits for consolidating servers, supporting legacy applications, and managing pre-production test environments. Yet it also presents a disaster-recovery challenge. Virtualization not only puts many eggs in one basket, magnifying the impact of hardware failures; it also complicates backup and recovery. Using simple back-up techniques on a server that hosts multiple virtual machines isn’t sufficient to ensure that the state of individual virtual machines is fully recoverable. IT managers that are planning to employ virtualization are discovering that they need data protection approaches and disaster recovery plans that handle these realities and afford virtual machines the same safeguards as traditional servers.
These data-protection plans need not be vastly different than those organizations already in place. Many companies already use imaging technology to protect and restore their file, database, and application servers. These same imaging technologies, in conjunction with agents that run within individual virtual-machine instances, can provide a comprehensive, reliable, and cost-effective data-recovery solution that protects virtual as well as physical servers. This article outlines common virtualization scenarios and best practices for protecting and recovering the critical corporate data present in each.

Virtualization is Coming Around Again
Virtualization technology has many variants. IT veterans will recall that time-sharing, a form of virtualization, has been used since the 1960s to allocate precious mainframe resources amongst different companies or departments. On today’s PC servers, virtualization has been gaining in appeal as IT organizations seek to trim the cost of managing and maintaining large numbers of physical servers. The trend is being fueled by increasingly powerful hardware and a proliferation of capable virtualization products.
PC-server virtualization products implement virtualization by running a virtual server application on top of a “host” operating system to divide the hardware resources of the physical server – CPU, RAM, video, network – into separate and independent “guest” environments. Each guest environment can include one or more virtual hard disks and can run its own operating system and set of applications.
Data centers can realize numerous benefits from virtualization. Instead of being limited to a single operating system on each physical computer, companies can support legacy applications by deploying multiple environments on the same server. Companies can use virtual servers to eliminate costs of managing and upgrading legacy hardware by migrating older applications onto virtual machines running on new, reliable hardware. They can also consolidate low-use departmental servers onto a single physical server to decrease management complexity.
Additionally, virtualization continues to play a key role in pre-production test environments. Software-development and testing groups have long used virtual machines to simplify the creation and re-creation of realistic test environments. Because virtual machine partitions are encapsulated as single files on the host machine, they can be moved and copied easily, which makes them easy to deploy for any given test scenario.
It is in production environments, however, where the virtualization makes the disaster-recovery challenges most acute. In production environments, it’s often impractical to power down a virtual machine so that it can be copied wholesale. Production environments are also subject to constant changes – not just data, but applications and operating systems as well, thanks to the need to apply hotfixes and other updates.

Protecting Dynamic Production Environments
Virtual machines used for production purposes, like any production server, contain a constantly changing set of user data, settings, applications, and operating system files that must be protected. Gone are the days when the operating system and applications stayed essentially constant so administrators only needed to worry about protecting user data.
Security concerns, OS patches, anti-virus and malware updates, along with the desire and ability to upgrade features more frequently has led operating system, security, and application vendors into a steady stream of updates. If administrators wish to create data protection strategies that lead to successful and safe restores, they must not only have current versions of user data but also up-to-date images of these dynamic operating system and applications. Restoring an out-of-date operating system with vulnerabilities during a recovery effort could introduce significant security risks at a time when adminstrators need a stable foundation.
Some companies attempt to use virtual machines alone for disaster recovery. Since virtual hard disks are typically comprised of a single file holding the operating system, settings, and application stack, administators might plan to deploy this single file onto a different physical server when disaster recovery becomes necessary. For example, a company might have an alternate data center location with any number of virtual hard disk files that mirror a production server configuration. In the case of any service disruption, the virtual backups can be put into service. Many virtualization vendors even include tools to create virtual hard disk files from a physical disk. The problem with these approaches is that these virtual machine files can quickly become out-of-date and copying new versions requires shutting down the production machine.

Back Up From ‘Inside’ Virtual Machines
A better approach is to apply imaging technologies and products that can protect virtual machines even when they’re running by creating frequent images of the live production data, operating system, and applications.
Two mechanisms for protecting the virtual server are possible in principle. One is to backup the files that comprise the virtual machines from “outside” – that is, from the host operating system. The other is to backup the virtual machines from “inside,” treating each guest VM as a distinct physical server that needs to be backed up independently.
In practice, backing up production virtual machines from outside is insufficient. Because running virtual machines keep state information in memory, simply backing up the virtual hard disks cannot reliably capture the complete state of a running virtual machine. Again, just as with copying virtual hard disk files, it is possible to back up virtual machines when they are not running. But that would mean shutting them down during the back-up cycle – rarely a realistic option in a production environment. Backing up the live environments from within each virtual machine is thus the preferred approach in a production environment.
Specifically, administrators need to use imaging software that can run both on the host and within each virtual machine. Administrators can then back up on demand or schedule backups for the host and each virtual machine as desired, without incurring downtime. One common approach for protecting multiple virtual machines in this manner is to use lightweight distributed agents. One agent runs on each virtual machine; a central management console controls on-demand and scheduled image creation on any virtual machine or physical server with an agent installed.

Imaging is Fast and Efficient
Imaging itself is a fast and efficient alternative to using file-based back-up solutions. By using disk-to-disk techniques along with incremental or differential imaging technologies, administrators can protect the operating system, applications, settings, and data for the host operating system and for each virtual machine in a manner that is faster, more comprehensive, and simpler than traditional file-by-file backup to tape devices. The resulting back-up image can be stored on a separate physical drive attached to the virtual server or on any network-accessible drive. Imaging shares some of the simplicity of virtual machine files in that the image is just a single file that can be moved around easily.
By creating and verifying complete images of live hard disks and partitions, administrators can be certain that all data, settings, applications, and operating system files are protected. In a typical scenario, an administrator might schedule a full image creation for each virtual machine once each week and incremental or differential images daily throughout the week. An administrator can, of course, schedule or launch incremental or differential images or perform backups more or less frequently as required.
Imaging offerings today take different approaches to restoring to dissimilar hardware. The classical approach is to strip out the security identifier (SID) and all information about the network environment. You can think of it as the Microsoft Sysprep approach. This way, once the data is restored to the new hardware, an engineer will still have to reconfigure the system manually in order for it to be recognized on the network.
A more effective way to handle this is to maintain the SID and network domain information when restoring the system. When the system reboots, it will be network-aware and part of its appropriate domain, significantly reducing the amount of downtime and possible technical-induced errors from misconfiguring the network.
But creating an image is only the beginning – restoring images must be simple, fast, and trouble-free. If a data center suffers a catastrophic failure, such as a fire that destroys several mission-critical servers for example, the IT manager will want the best engineers solving that problem. Imaging software that employs an easy-to-use GUI and straightforward restore commands, such as Acronis True Image Enterprise Server, allows the IT manager to have technicians restore physical and virtual servers while the high-level engineers repair the catastrophic failures.

Old and New Threats
Imaging protects companies from unplanned downtime and data loss. Virtual servers are subject to the same variety of loss scenarios as traditional servers, as well as some additional ones that arise from the nature of the virtualization technology. These loss scenarios include:

  • Complete hardware loss due to theft, fire, flooding, or similar disasters;
  • Hard-disk corruption or failure;
  • Compromise of host operating system, whether by virus or similar malware, software failure, intentional hacking, or human error;
  • Compromise of guest operating system, by any of the mechanisms that can compromise the host OS;
  • Human error, including accidental deletion or modification of a virtual machine or virtual hard disk or its files on the host.

Disaster Recovery
When using imaging technologies, when any part of a production virtual machine fails, administrators need fast and simple recovery steps. There are typically a number of recovery options designed to get the production virtual machines running again:

  • For data and non-system partitions within a virtual machine, administrators can recover the partition in minutes using the latest image file for that partition;
  • For individual files within a virtual machine, most imaging products provide a selective file restore feature;
  • For system partitions within a virtual machine, administrators can restore the partition easily using a bootable rescue CD and then pointing to the latest image file;
  • In the case of full system loss, administrators can restore the host operating system and images back to the repaired hardware or new hardware and then restore each running virtual machine as outlined above.

The Future of Virtualization
Virtualization provides major cost and management benefits for corporate data centers. With advances in 64-bit hardware and multi-processor servers and the accompanying increase in CPU power and RAM capacity along with cost-effective clustering solutions, servers will increasingly be capable of supporting larger numbers of virtual machines.
Agent-based imaging technology enables IT managers to provide these virtual machines with the same protection as physical servers, so they can rest assured their data, settings, applications, and operating systems are protected against disaster. By addressing a live production environment where downtime is an expensive option, imaging technologies provide a more realistic protection and disaster recovery approach than simply copying virtual hard disk files. When disaster does hit, having an up-to-date image of the compromised server will ensure a smooth recovery.


Max Lydavinsky is the director of engineering for Acronis, Inc.


 

©Copyright Systems Support Inc. All rights reserved. Reproduction in whole or in part in any form or medium without the express written permission of System Support Inc. is prohibited.