DISASTER RECOVERY 
JOURNAL


P. O. Box 510110
St. Louis, MO 63151
(314) 894-0276 
Fax: (314) 894-7474
Internet
www.drj.com 
E-mail
drj@drj.com

PUBLISHER
Richard L. Arnold, CBCP
richard@drj.com

EDITOR-IN-CHIEF
Jon Seals
jon@drj.com

SENIOR EDITOR
Janette Ballman
janette@drj.com

COPY EDITORS
Richard Sandhofer
richards@drj.com
Pamela Clifton
pamelaclifton@hotmail.com

ADVERTISING 
Robert Arnold
bob@drj.com

_____________

Corporate

President/CEO
Richard L. Arnold, CBCP
richard@drj.com

Vice President 
Robert Arnold
bob@drj.com

CONFERENCE COORDINATOR
Patti Fitzgerald, CBCP
patti@drj.com

CONFERENCE REGISTRAR
Merce Knese
mercedes@drj.com

CIRCULATION
Laura Baugh
laurab@drj.com

EXECUTIVE COUNCIL
Jeff Dato, MBCP, KPMG
John Jackson, J Albright Advisors
Edward Devlin, E.S. Devlin & Associates
James Hammill, CBCP, JMH Consulting
Pat McAnally, SunGard Availability
Brian Turley, Strohl Systems
Belinda Wilson, Hewlett-Packard


INTERNATIONAL
CONTACTS
England: Thom Hetherington
Business Continuity 
Phone: 0161-237-1007
thomh@tempus.demon.co.uk

Australia: Anthony J. Harvey
Journal of Business Continuity
Phone: 0011-613-953-0055-8
fax: 0011-613-953-0528
sector@notability.com.au

Japan: Shinji Hosotsubo
Quake Japan Co., Ltd.
Phone: 03-3215-2880
fax: 03-3215-2881


 

Click Here for a Printable Version

Building a Comprehensive Disaster Recovery Plan
Integrating People, Processes and Technology to Prepare for Business Disruptions

By LINDA CERNI, CBCP

Disaster recovery has been top-of-mind for many IT managers as events that cause unplanned business downtime continue to surprise us; 2005 was no exception. Natural disasters, human conflicts and constant exposures to security breaches and attacks have driven organizations of all types and sizes to recognize the need to implement or improve their comprehensive business continuity plan (BCP) that includes a robust IT disaster recovery plan. What exactly does a disaster recovery plan entail? Backup polices? Recovery scripts? Security breach escalation procedures? An effective disaster recovery plan addresses not only the protection and recovery of technology, but as part of a broader BCP, also includes the people, processes, and procedures necessary to bring about the true end game: the ability for end users to manage corporate risk, respond to a potential disruption, and complete new business transactions while protecting historical transactions that together keep a company viable when faced with a disaster event.
That’s a tall order for most organizations. Many IT managers are prepared to take the first step to protect and secure their data; however, few are completely prepared to enable the resumption of critical business processes within a desired timeframe. According to a survey conducted by Applied Research, among 500 IT Managers, 70 percent of those surveyed have deployed data backup, replication, and recovery technologies. However, only slightly more than 54 percent of IT managers in the survey had a complete disaster recovery plan in place. Having recovered data but not applications and end users is a lot like having the sheet music but no instruments, musicians, concert hall or conductor to play the symphony. It’s just not going to get the job done.
According to Infonetics Research, large companies lose up to 16 percent of annual revenue due to unplanned network downtime. So what should IT consider while planning for disasters, events, or crises?

Understand and Communicate Needs
About one-third of respondents in Applied Research’s survey indicated they did not see a need for a disaster recovery plan. Awareness and communication, at the IT management level as well as executive level, is critical. Some leading questions that may help managers understand and communicate the need for an effective disaster recovery plan:

  • How long can we suffer a security attack or IT outage before there is a significant impact to customers/partners/corporate viability?
  • Is information currently available to everyone who is authorized to access it — and protected from everyone else?
  • Are the right policies in place to ensure that information is protected from both internal and external threats — and effectively recovered following an emergency?
  • Is the right data backed up, archived, and easily accessible for regulatory purposes — while unneeded information is permanently and securely deleted?
  • Is the data information put through a formal data information refresh process to ensure that data that is often stored at off-site facilities for extended periods of time is tested for integrity and can be used following technology refreshes and upgrades?
  • Is there a regularly maintained inventory of all purchased applications, tracking license numbers, most current version, versions maintained, and copies owned, with a prioritization in place for restoration?
  • Are application priorities known and agreed to? Are inter-dependencies between applications understood, and inter-dependencies between applications and business processes understood? Are these documented and agreed to by stakeholders?
  • Can improvements be made to the security and availability posture ahead of the most recent vulnerabilities and threats?
  • If an emergency arises that takes all or part of the company’s network down, how quickly can the organization return IT operations to a normal state?
  • Is the organization prepared to quickly and seamlessly restart business processes in order to continue operations?
  • Are we testing the plan to drive out deficiencies or to show success for audit or visibility purposes?
  • Do we truly understand the business needs to meet those service levels through proper investment to close the gap between expectations, realistic capabilities, and risk appetite?

IT managers should prepare a business case and communication with their executive team to obtain resources for disaster recovery plan development and execution. IT managers should also work with executive leadership to maintain an ongoing commitment to testing and change management.

Make It Happen
Thirty-three percent of respondents in Applied Research’s survey stated that the primary reason they did not have a plan in place was because of a lack of resources. Additionally, cost was the most frequently cited challenge for creating a disaster recovery plan.
It is important to note that the cost of developing a disaster recovery plan can be quite low, relative to the impact on recovery. Simply placing critical information such as employee and vendor contact lists, IT equipment lists, existing network diagrams and application manuals is a step in the right direction. You would be surprised how many businesses have not done this. Spending a half hour during a regular staff meeting to discuss who would be in charge at time of disaster and who would take responsibility for what, is another simple step. Asking management to approve of automatic higher spending limits for IT managers in the event of a disaster is another step. Disaster recovery planning is all about taking another step. It is never perfected but can always be improved, regardless of the level of available resources or funding.
Respondents in the Applied Research study consistently cited several motives for creating a disaster recovery plan (respondents could choose more than one): 61 percent cited data corruption, 59 percent cited component/application failure, 54 percent cited site outage, and 54 percent cited downtime required for maintenance.
Once IT has a clear understanding of the environment, key motivators, and possible exposures, take action. The first step is to prioritize identified risks and fill gaps that may exist via mitigation, acceptance, or assignment. These may be gaps in policy, technology, personnel, capabilities, processes, people, or all of the above. Filling these gaps helps harden the infrastructure against potential downtime. Without this step in place, IT may spend too much time recovering from emergencies, somewhat like bailing water out of a boat without repairing the leaks.
Action also includes ensuring the proper technologies, processes, and procedures are in place to recover from the unexpected. Regardless of how well IT might prepare, something can always happen. A ruptured sprinkler line in the server room can create a very bad day for IT personnel – which can turn into a nightmare without a solid, un-tested recovery plan in place.

Maintaining Control
Control is about managing the IT infrastructure for the highest level of resilience in the future. It’s about maintaining the highest operational state within the infrastructure, from servers to workstations to laptops. This starts from the moment a new piece of hardware or software is introduced into the environment. IT administrators need to be able to maintain control over the IT infrastructure to continuously ensure that client devices are secure, available and compliant with established corporate standards. Control means IT knows – not thinks, but knows – it can maintain the infrastructure in a known good state. It also means not just keeping up, but staying ahead.

Summary
Everyone wants to protect and guard against potential incidences – whether it’s a simple server outage or a catastrophic event. But in addition to the right technology, personnel, processes, and procedures, organizations must have a commitment to an effective business continuity plan, inclusive of more than just a disaster recovery plan, to make it work. This includes crossing some traditional boundaries to ensure that information is always secure and readily available. The entire system only works if all the parts are properly connected. The invisible “wall of silence” that often exists between the IT security and IT operations teams must drop in order for the IT organization to be able to fully understand, act, and control. It is all risk management at the core. In fact, with a broader view on the entire company’s operations, IT must communicate better and more often with the entire company. Creating a governance board comprised of senior IT, corporate, and business managers will greatly assist in building and maintaining an effective awareness and prioritization of business continuity with an organization.
Organizations should ask some important business questions before embarking on disaster recovery planning in order to prioritize investment spending and communicate the need to relevant parties. Additionally, business and IT executives should remember that building a disaster recovery plan is all well and good, but it is equally important to regularly test any disaster recovery plan in order to ensure that operations can be established as documented. Testing ensures that there are no surprises should an unplanned event occur.
While no organization can guarantee 100 percent resilience, they can take appropriate steps through proper planning that will help them quickly recover from any disruption to its infrastructure. By incorporating disaster recovery plans into the greater enterprise business continuity plan, organizations can protect corporate viability and ensure a continuity of operations to customers, partners and investors.


Linda Cerni, CBCP, is the worldwide business continuity practice director for Symantec Corporation. Cerni has been in the disaster planning, response, and recovery industry for more than 16 years. Her former employers include the American Red Cross, the Federal Emergency Management Agency, and the United Nations.


©Copyright Systems Support Inc. All rights reserved. Reproduction in whole or in part in any form or medium without the express written permission of System Support Inc. is prohibited.