|
DISASTER
RECOVERY
JOURNAL
P. O. Box 510110
St. Louis, MO 63151
(314) 894-0276
Fax: (314) 894-7474
Internet
www.drj.com
E-mail drj@drj.com
PUBLISHER &
EDITOR-IN-CHIEF
Richard L. Arnold, CBCP
richard@drj.com
SENIOR EDITOR
Janette Ballman
janette@drj.com
MANAGING EDITOR
Jon Seals
jon@drj.com
COPY EDITORS
Richard Sandhofer
richards@drj.com Pamela
Clifton
pamelaclifton@hotmail.com
ADVERTISING
Robert Arnold
bob@drj.com
_____________
Corporate
President/CEO
Richard L. Arnold, CBCP
richard@drj.com
Vice
President
Robert Arnold
bob@drj.com
CONFERENCE COORDINATOR
Patti Fitzgerald, CBCP
patti@drj.com
CONFERENCE REGISTRAR
Merce Knese
mercedes@drj.com
CIRCULATION
Laura Baugh
laurab@drj.com
INTERNATIONAL
CONTACTS
England: Thom Hetherington
Business Continuity
Phone: 0161-237-1007
thomh@tempus.demon.co.uk
Australia: Anthony J. Harvey
Journal of Business Continuity
Phone: 0011-613-953-0055-8
fax: 0011-613-953-0528
sector@notability.com.au
Japan: Shinji Hosotsubo
Quake Japan Co., Ltd.
Phone: 03-3215-2880
fax: 03-3215-2881
Brazil:
Jose Carlos Ferreira
Disaster Recovery Mercosul
Phone: 55
11 3666-9506
conc2000@uol.com.br
www.drms.com.br
|
|
Click
Here for a Printable Version

An
Interview With Robert Burns
of EverGreen Data Continuity, Inc.
By ELIZABETH M. FARRARINI
Storage infrastructures
have started to resemble the layers of a perpetually growing onion.
The good stuff data resides at the core. However, before
you can get to it, you have to unwrap each layer. You have storage devices,
systems that support these devices, networks that support systems, applications
that sit on the systems, and business units that need the systems. Robert
Burns, president and founder of EverGreen Data Continuity, Inc., Newbury,
Mass., says, You cant improve your storage infrastructure
unless you understand every one of the layers that make up your infrastructure.
Burns recently provided a glimpse at how he conducts these assessments,
what holes theyve uncovered, and how hed solve certain problems.
Farrarini: Where do organizations fall short when it comes to doing
their own assessments?
Burns: Most companies dont have the tools to evaluate their storage
environments. Most IT professionals dont understand all of the
issues. They just look at how much data they have and guess at what
theyre going to need. People just dont manage their data.
For example, our assessment usually shows that less than half of the
data on dedicated servers is active. Some customers have bought storage
management tools and never used them.
Management thought these tools would be a good idea, but the systems
administrators thought otherwise.
Farrarini: Can you
explain the methodology you use for an assessment?
Burns: To provide better data protection, you need to look at the storage
problem. Thats what we do. We breakdown a customers storage
infrastructure into modules, such as storage management, and, within
each module, rate their practices against our 356 best practices for
storage management.
For example, the storage management module looks at backup and recovery
practices, high availability, storage growth capability, storage management
performance concerning the network, and disaster preparedness.
Say a customer had a recovery time of two hours. We might rate them
from a one to five (strong) how they meet this recovery time as compared
to our best practice.
Well also provide an explanation of the best practice. Take backup.
Well describe our best practice for doing full backups. Well
explain how they are doing with backups relative to, for example, a
full backup, which would allow them to recover in the time objective
theyve defined.
Farrarini: Whats
the real meat of an assessment?
Burns: It has to be the design architecture phase, which may or may
not happen.
Let me clarify. If a customer doesnt agree with the findings in
the current storage infrastructure part of our assessment or doesnt
have the budget for it, then we dont want to recommend a new storage
architecture.
If a customer opts for it, then well dig deep into what the customer
has. By looking at the enterprise storage environment, we can start
to develop recovery criteria based on what theyve defined by for
their environment. The criteria deal with whats needed for storage
scalability.
From here, we can develop the architecture. We really get into why SANs
make sense. Weve developed evaluation criteria for all the functions
theyll require in a new storage architecture, as well the way
various classes of products stack up against each other.
Farrarini: Since your
assessment of a customers storage infrastructure covers many areas,
can you describe how you go about getting all of the information you
need?
Burns: The trick includes putting together a hierarchy of sources. We
begin by having a two-hour to a four-hour discussion with the CIO and
the top managers. We learn what they expect from the business critical
applications, such as recovery objectives.
Next, we like to speak with their internal customers, usually business
unit leaders, to get an understanding of their needs.
We speak with application managers, followed by individuals running
the data centers. We break the data center group into platforms, such
as Unix and Windows NT.
We next talk to the system architects responsible for the design of
the storage. Of course, we also include the operations personnel who
run the help desk.
Farrarini: Do you
find any inconsistencies among the way each group perceives the storage
environment? Let me add, do CIOs really know their storage environment?
Burns: Plenty of inconsistencies! Thats why we interview these
groups separately. Each group seems to have its own perceptions of the
way things are.
The biggest inconsistency occurs between what top management thinks
is happening versus what the architects say the systems will do, and
what the operations personnel have to cope with everyday.
Most of the time, CIOs dont know a lot about what works and what
doesnt work in their storage environment. These executives, however,
prefer to focus on the storage cost implications to the overall organization.
Farrarini: You mentioned
that your customers really dont have a good handle on how their
data is being used. Given this scenario, how do you get them to give
you this information?
Burns: The hardest thing to find out is what data they have, and how
much they have. For example, How much data do you backup weekly?
may seem like an easy question to answer. Its not. In fact, we
tend to flag this area as the most serious issue we have.
We give them a table with 13 columns. We ask for a servers RAID
levels, operating systems, backup frequency, and number of users, as
well as other information. We also want to know the types of backups
and when they occur.
One difficulty is that not all systems are backed up the same way. Another
problem is that its hard to get this information from reporting
part of backup software.
We can spend several weeks prying this information from the customer.
Most of the time, they dont have the information they say they
do.
Farrarini: When it
comes to storage policies and procedures, such as backup and recovery,
where are your customers usually the weakest?
Burns: You said it. Backup and recovery! Most companies have adequate
backup procedures. They dont have a good handle on how to handle
failures that occur in their backup process. In some cases, the person
doing the backups doesnt have a good methodology for how to recover
the data if something goes wrong.
Another problem is that some organizations have backup schedules that
dont complement the recovery objectives. For example, most organizations
do nightly backups. But if you have a recovery time of two hours, and
you have a failure, you can lose 23 hours worth of data because youre
not backing up every two hours. Thats why in the assessment we
take a good look at how people are doing things.
Farrarini: Prior to
doing the assessment, where are most of your customers with disaster
recovery?
Burns: The large organizations have out-of-date disaster recovery plans.
Many midsize organizations dont have a plan. Organizations dont
test what plan they have.
Financial organizations, on the other hand, are the exception. Federal
law mandates these organizations test their plans once a year. However,
most of this testing concerns mainframe systems. Most organizations
are still trying to figure how best to handle disaster recovery for
complex distributed system environments.
Farrarini: If a large
company isnt prepared for disaster recovery, then what preventative
measures do you recommend for small to midsize companies?
Burns: Organizations with either no IT department or a small centralized
one might want to consider outsourcing the management of their IT infrastructures
to managed service providers. These operations have facilities with
excellent disaster recovery capabilities such as redundant power
supplies, and redundant generators.
Another alternative would be to outsource the backups to a local service
provider that specializes in disk-to-disk backups and disaster recovery.
Before going this route, the organization needs to consider if a recovery
can take place within an acceptable time. Does the organization have
to wait for the service provider to ship a tape?
Farrarini: If a customer
had a recovery goal of zero downtime, what would you recommend?
Burns: You probably should be mirroring your data to a local facility
and an off-site facility. These should be high availability applications
only and both applications should be sitting on each site. You
also need to have some sort of a heartbeat monitoring system tied between
the two sites. If the second site has to kick in, the data has to be
in the same format as the data in the primary system.
Farrarini: Apart from
zero downtime, a lot of companies are considering mirroring to an offsite
location, perhaps a third-party site? What are some of recommendations
you make to customers about mirroring?
Burns: Since a lot of our customers have more than one site, we tend
to recommend they mirror or replicate data to one of their sites rather
than go with a third-party. Thats if the customer is equipped
to do it.
We recommend double mirroring. The first mirror creates operational
data to use if your disks go down. The mirroring occurs on-site. The
second mirror takes place at an off-site location owned by the organization.
This mirror gets used if a disaster occurs.
The two facilities should be between 20 to 30 miles away so you dont
have a regional power problem. The second site should have a data center
that can be used for processing.
The pecking order for application processing priority has to be clearly
defined. If the local site goes down, the computers at the other site
can be cut over to handle the high priority applications. The low priority
applications are either shutdown or reduced during the outage.
Farrarini: As a former
manager of large data centers, you lived through both centralized and
distributed control of storage. Wheres the best place to put storage?
Burns: For the past 12 years, Ive been telling everyone this:
storage ought to be in the network. A SAN puts storage in the network.
Shared storage in the network architecture provides the most direct
route for getting at your data. Storage should be within the site that
owns it.
Im not in favor of moving your primary storage into a co-located
facility managed by someone else. Id want control over my data.
Farrarini: Before
deploying an enterprise SAN, what changes should an organization make
to its storage infrastructure, especially in the data center?
Burns: Until you start consolidating your facilities, your SAN architecture
will be all over the place. We recommend centralizing as much as possible
to minimize bandwidth needs. Put as many applications as you can into
those servers. Also, bring as many of those servers into one standard
facility, rather than scattering them everywhere. Use the power of an
Internet portal to access servers in a common site.
Farrarini: Although
you folks dont get into how a customer purchases storage, what
recommendations would you suggest for the way an organization goes about
it?
Burns: We like the idea of centralizing purchases, as long as its
done intelligently. Ever wonder why a lot of data centers have one of
everything?
For example, applications engineers may recommend something that doesnt
fit with what the data center folks would use. Any type of a purchasing
task force, however, has to have the knowledge and understanding of
the problems they are trying collectively to solve. Otherwise, its
a failure.
Elizabeth Farrarini is a freelance writer based in Boston, Mass.
To comment on this article, go
to 1502-ask at www.drj.com/feedback.
|