| DISASTER
RECOVERY
JOURNAL
P. O. Box 510110
St. Louis, MO 63151
(314) 894-0276
Fax: (314) 894-7474
Internet
www.drj.com
E-mail drj@drj.com
PUBLISHER &
EDITOR-IN-CHIEF
Richard L. Arnold, CBCP
richard@drj.com
SENIOR EDITOR
Janette Ballman
janette@drj.com
MANAGING EDITOR
Jon Seals
jon@drj.com
COPY EDITORS
Richard Sandhofer
richards@drj.com
Pamela Clifton
pamelaclifton@hotmail.com
ADVERTISING
Robert Arnold
bob@drj.com
_____________
Corporate
President/CEO
Richard L. Arnold, CBCP
richard@drj.com
Vice
President
Robert Arnold
bob@drj.com
CONFERENCE COORDINATOR
Patti Fitzgerald, CBCP
patti@drj.com
CONFERENCE REGISTRAR
Merce Knese
mercedes@drj.com
CIRCULATION
Laura Baugh
laurab@drj.com
EXECUTIVE
COUNCIL
Jeff Dato, MBCP, KPMG
John Jackson, J Albright Advisors
Edward Devlin, E.S. Devlin & Associates
James Hammill, CBCP, JMH Consulting
Pat McAnally, SunGard Availability
Brian Turley, Strohl Systems
Belinda Wilson, Hewlett-Packard
INTERNATIONAL
CONTACTS
England: Thom Hetherington
Business Continuity
Phone: 0161-237-1007
thomh@tempus.demon.co.uk
Australia: Anthony J. Harvey
Journal of Business Continuity
Phone: 0011-613-953-0055-8
fax: 0011-613-953-0528
sector@notability.com.au
Japan: Shinji Hosotsubo
Quake Japan Co., Ltd.
Phone: 03-3215-2880
fax: 03-3215-2881
Brazil:
Jose Carlos Ferreira
Disaster Recovery Mercosul
Phone: 55
11 3666-9506
conc2000@uol.com.br
www.drms.com.br
|
|
Click
Here for a Printable Version
The Key to Information Lifecycle
Management is Cost-Effective Backup
By ERAN FARAJUN
Information lifecycle management (ILM) is a relatively
young concept and the industry will continue to encounter pitfalls as
ILM solutions develop. As it stands at the moment, ILM has two aims:
to reduce administration costs and to make the most efficient use of
storage hardware. But in order to achieve these, ILM needs to rely on
an ILM-aware backup system. A backup architecture lacking the ability
to maximize the use of ILM will reduce any chance of successfully realizing
the goals of cost reduction and greater storage efficiency.
Why The Need For ILM
Reliable and secure data storage is crucial to business continuity plans.
With the dependence enterprises have on information about their technical
processes, data storage is becoming somewhat of a headache for IT executives
and storage administrators. Many industries such as finance and healthcare
are facing new regulations that require enterprises to conform to evolving
regulations about data retention. These requirements, as well as the
increasing amount of data that enterprises store, mean that the costs
of managing information can grow up to 20 to 30 percent per year. With
estimates such as these – the need for cost-effective data storage,
and its management, becomes of paramount importance to enterprises and
IT managers.
What Is ILM
Definitions of ILM can vary, but ILM will be defined as a data archiving
process which moves data automatically to the most cost-effective storage
media available and is based on prescribed policies of accessibility,
security, and long-term storage. This automatic transferral of data
requires no manual intervention; reducing hardware and real estate costs,
therefore ILM vendors are able to promise a significant return of investment
(ROI).
The data generated by an enterprise can be placed into two categories:
Critical information is the data that is used for day-to-day operations
and is located within the enterprise’s primary storage system,
allowing for fast access.
Important information is the data that can be archived to secondary
storage, typically lower cost disks or tapes at an off-site location.
This information is historical, legal, and regulatory.
Critical data is accessed frequently, yet over time a file will be accessed
more sporadically, thus the file’s status changes from critical
to important. A prescribed policy can also determine a set length of
time by which a file ceases to be critical, such as after 90 days. The
ILM solution then automatically archives this data to secondary storage,
without manual assistance from IT personnel. ILM solutions then create
a “pointer” that contains the metadata for every file that
has been automatically moved to secondary storage. If the file’s
status then ever returns to critical status, the pointer directs the
user straight to the file’s new location to be retrieved for use.
The efficacy of ILM can be compared with systems libraries have used
to manage the thousands of books in their collections. It is fairly
easy and cheap to buy books, yet expensive to manage storage of the
books so you know where each book is at any point in time. Additionally,
a system needs to be set up to manually manage the movement of these
books as well as a system of categorizing the books. As new books are
added to the collection (i.e. critical data) they need to be categorized
and stored correctly. As books decrease in demand, they are filed away
to an archive (i.e. important data). An ILM system would automatically
categorize and store the new data books accordingly, as well as re-shelve
the low-demand books elsewhere, therefore negating the need for such
time-consuming management.
Where Does A Problem Arise With Backup?
Enterprises are recognizing through media “hype” surrounding
ILM that it is something worth investing in, and are quite rightly looking
to this new concept to improve the efficiency of their data storage
management. But in doing so, enterprises can forget to take into account
their existing back-up system and fail to ensure that the stored data
isn’t duplicated.
The typical architecture of a back-up system saves files from primary
(critical status) storage on a low-cost disk or tape on a daily basis.
If one given file remains critical, this frequent backing up remains
in process.
The ILM archiving of data is distinct from back-up operations as ILM
archiving moves the operational, non-critical data into long-term storage,
whereas backup protects critical data before it can be archived.
Back-up systems that are not ILM-aware will continue to store backed
up files on tape or secondary disks regardless of the data already archived
elsewhere. This is an important oversight as both sets of data must
now be managed, incurring an increase in costs and reduction in efficiency.
The result is a lower return on ILM investment than the IT directors
would have expected.
Referring back to the library analogy, this duplication problem could
occur if a library decided to ensure that its bestseller books are always
available for borrowing and made copies of a bestseller book each time
it was loaned out. The benefit of this is that the book is always available.
However, once the book is no longer a bestseller (no longer critical
data) and all the copies have been returned, the librarian would have
to ensure there is space on the archive shelves for all these duplications.
Although an important process has been put in place, it has proved costly
to the library. In the same way, a non-ILM distributed back-up system
can waste valuable storage space.
How To Counteract The Problem
A realistic and efficient solution to this major failing of backup is
to implement an ILM-aware backup, such as distributed backup. Distributed
backup removes entirely the need for daily backups of critical data
onto costly tapes, thereby automatically reducing the level of storage
management required by an enterprise.
A distributed back-up system collects the data from the network clients
and sends it to offsite disk storage in a compressed and encrypted format.
When the data is needed for a restore, the system will retrieve the
data as required. The process is fully automated and ensures fast and
multiple backup without duplicity. The back-up process is efficient
and the user can be assured of achieving the anticipated ROI.
This ILM-aware distributed backup makes efficient use of ILM’s
archive pointers by retaining one copy of a file on either backup or
secondary storage. The pointers enable the backup to decipher which
files have been archived and allow it to automatically remove these
excess files from the back-up disks. This improves cost-efficiency by
removing the problem of file duplication and uneconomical use of storage
space.
ILM-aware distributed backup is able to do this by locating and recognizing
a given file’s pointer in the back-up data (received from the
client) and automatically searches the back-up disk for the original
file, deleting it and saving the pointer.
A librarian could use pointers in the same way in order to solve the
problem of having to store multiple copies of a bestseller each time
one is made. A stamp (a pointer), for example, on the original copy
would automatically tell the librarian that any other returned copies
of the same book are not this original. The librarian can then discard
these excess versions each time they are returned to the library so
that they don’t have to find storage space on the shelf for more
than one copy. The library’s indexing system will automatically
detect the stamp on the original book and ensure that it is shelved
accordingly.
This system means that current data in primary storage is backed up
to disk, minimizing disk size and cost. Distributed backup results in
faster, more frequent backups and simpler restore operations, while
reducing hardware and storage costs and the necessity for daily administration.
It is important to realize that the life of a backup-file is separate
and distinct at whatever stage of life it is at: from when it is born;
to when it is kept on different tiers of storage media; to when the
backup-file is deleted.
Eran Farajun is executive vice president for Asigra Inc., the multi-site
backup/recovery specialist. His role at Asigra includes marketing and
strategic business development. Farajun holds a law degree from the
University of Sheffield in the UK.
©Copyright
Systems Support Inc. All rights reserved. Reproduction in whole or in
part in any form or medium without the express written permission of
System Support Inc. is prohibited.
|