9.2 Choosing a Tape Drive
Consider
the following issues when choosing a tape drive:
- Capacity
-
The single most important
consideration. Get a drive that can back up all data on one tape,
allowing for some growth. If your data set exceeds the capacity of
one tape, you may find that drive is no longer usable (if no one is
available to change tapes during an overnight backup) or that it has
suddenly become very expensive to use (because you must buy twice as
many tapes). In such a case, the only alternative to replacing the
drive is to use a backup scheme that mixes full and incremental or
differential partial backups, which is riskier for your data.
Most tape drive manufacturers arbitrarily rate their drives at double
actual native capacity, assuming you will use software or hardware
compression to double the amount of data that fits on a tape. AIT
drive specifications go further still, assuming 2.6X compression. The
actual compression ratio you experience depends on the data mix
(e.g., documents and spreadsheets compress well; executables, images,
and archives much less so), the backup software you use, and
sometimes on the speed of the computer where the drive resides. We
find that real-world data sets typically compress at 1.5:1 to 1.7:1,
so plan accordingly.
 |
Having adequate tape drive capacity does not necessarily mean being
able to back up your entire hard drive (or drive farm) to one tape in
one pass. For example, our network contains more than a terabyte
(1000 GB) of disk space with hundreds of gigabytes in use. Yet for
backups we use only two DDS-3 drives that store 12 GB natively, one
DDS-4 drive that stores 20 GB natively and one Travan NS-20 drive
that stores 10 GB natively. We are well protected, despite our total
tape drive capacity being much smaller than the amount of data we
have stored. Why? Because much of that data doesn't
need to be backed up frequently, if at all.
Some is archived
"real" data—things such as
previous editions of this book. We back up that data periodically to
inexpensive DDS-3 tapes as well as CD-R and DVD+R/RW discs. We need
to renew those backups only every few months when we move material
from our working data directories to our archive directories.
Hundreds of gigabytes of data need not be backed up at all. That
comprises copies of operating systems and applications that we keep
on network volumes for convenience, ISOs of numerous older Linux
distributions, MP3s we ripped from our CD collection (which we plan
to re-rip as OGG files anyway), movies we recorded and watched but
haven't yet gotten around to deleting, and so on. In
other words, stuff that we wouldn't miss if we lost
it, or that we could easily reproduce if necessary. Our actual
working data fits easily on one DDS-3 tape, so for us DDS-3 suffices.
|
|
- Speed
-
This may or may not be a critical factor, depending on your own
environment and practices. If you have a limited backup window
available, speed may be as important as capacity. If you can simply
start a backup when you finish work for the day and allow it to run
overnight, speed may be a minor factor.
Actual throughput depends on the drive mechanism, the interface, the
speed of the computer in which the drive is installed, and the data
set being backed up, but will likely be lower than the drive
manufacturer advertises. Compression may also have a significant
impact on throughput, for better or worse. For example, our Seagate
Scorpion 40 DDS-4 drive is rated at 165 MB/min native and 330 MB/min
compressed, but we typically get only 275 MB/min with compression
enabled. Similarly, our Seagate Travan NS20 SCSI drive is rated at 60
MB/min native and 120 MB/min compressed, but we actually get about
100 MB/min. The lower-than-expected throughput with compression is no
fault of the drives, but results from our data being less
compressible than the assumed 2:1. For these drives, compression
improves throughput, albeit not to the expected extent. Conversely,
when we tested an OnStream DI30 drive that is rated at 60 MB/min
native and 120 MB/min with compression, we actually got 45-50 MB/min
native, but only 15-17 MB/min when using the bundled Echo software
with compression enabled. In that case, it turned out the compression
software was poorly designed, a problem that was fixed in a later
release. All of these figures are for backing up local volumes.
Backing up data across a network, even a fast network, commonly cuts
throughput by half or more due to operating system overhead,
filesystem overhead, and network latency.
 |
If your tape drive provides hardware compression and your backup
application has software compression, don't use both
methods together. Either works well by itself, but using both may
actually reduce throughput, as the drive tries to compress data
that's already been compressed by the backup
software. In general, hardware compression is faster than software
compression. There are exceptions, though, and which compression
method is more efficient may depend on the drive, backup software,
and mix of data being backed up. The only way to determine which
method is faster is to try both, but be sure to try them one at a
time.
|
|
- Media cost
-
Travan and OnStream ADR drives are constructed with loose tolerances,
and are accordingly inexpensive, but require expensive tapes built to
close tolerances. DDS drives, conversely, are expensive because they
are built to tight tolerances, which allows them to use inexpensive,
loose-tolerance tapes. AIT drives are expensive and use expensive
tapes, but their very high capacity and throughput may make them the
only alternative when DDS-4 isn't large enough or
fast enough. A typical tape rotation may require from four to 50 or
more tapes. Tapes must be replaced periodically (on the schedule
recommended by the drive and/or tape manufacturer—trying to
stretch the lifetime of tapes is a foolish economy). Tape drives have
a realistic service life of perhaps two to three years with heavy
use, and four or five years with moderate use (by which time the
drive is likely no longer adequate for your needs anyway). Expect to
spend from as much to several times as much as the cost of the drive
to buy tapes over the drive's life.
- Interface
-
Internal
tape drives are available with ATAPI/IDE or SCSI interfaces. External
tape drives are available with SCSI, Parallel, USB 1.1, USB 2.0, or
FireWire (IEEE-1394) interfaces.
- ATAPI
-
ADR and
Travan drives are available with ATAPI interfaces. ATAPI drives are
typically less expensive than those using other interfaces, can use
the ubiquitous IDE interface present on any modern motherboard,
provide reasonably high throughput, and are easy to install. Choose
an ATAPI drive for convenience, ease of installation, or when cost is
an overriding issue.
- SCSI
-
ADR,
Travan, DDS, and AIT drives are available with SCSI interfaces. SCSI
drives typically sell for at least a $50 to $100 premium over similar
ATAPI models, require adding a $100+ SCSI interface card if the PC is
not already so equipped, and are more complicated to install and
configure than ATAPI models. SCSI drives typically provide much
higher throughput and much lower CPU utilization than ATAPI models.
The largest and most feature-laden drives are available only in SCSI.
Most SCSI-only models are designed for use on servers, and are
therefore better built and more reliable than ATAPI drives designed
for the mass market. Choose a SCSI drive for highest capacity,
performance, durability, and reliability. SCSI is the only option if
there are no available ATAPI connections, or if you require capacity
and/or features available only in a SCSI model.
- Parallel
-
Drives
that use the parallel interface typically have half or less the
throughput of SCSI or ATAPI/IDE, but may be a reasonable choice if
you must use one drive to back up local data on multiple standalone
PCs that do not have recent USB ports. On a small network, it is
usually better to map a drive on the server where a tape drive
resides for each local workstation volume, and back up centrally to
an ATAPI or SCSI drive. If you are considering a parallel drive
because you have several standalone PCs that must be backed up,
consider instead connecting those PCs with a simple network and using
an internal server-based tape drive.
- USB
-
USB 1.1
drives typically have between half and two-thirds the throughput of
ATAPI/IDE and SCSI drives, and are a better choice than parallel port
drives for backing up local data on multiple standalone PCs if all of
those PCs have reasonably recent USB ports. USB 1.1 tape drives have
a theoretical maximum throughput of about 90 MB/min and a typical
actual throughput of 60 MB/min or less. USB 2.0 models are limited by
the speed of the drive mechanism rather than the speed of the
interface.
- FireWire
-
There are
a few external tape drives available that use the FireWire interface,
none of which we have tested. FireWire tape drives offer Plug and
Play compatibility similar to USB models, but are much faster than
USB 1.1 drives. The relatively high cost of FireWire tape drives and
the fact that few PCs have FireWire interfaces make them a poor
choice for most people. The exception is if you have desktop PCs and,
particularly, notebook PCs that have a FireWire interface installed.
In that case, a FireWire tape drive may be the fastest, easiest tool
for backing up.
- Cross-drive compatibility
-
Here's a dirty little
secret that drive manufacturers don't talk much
about. You might reasonably assume that a tape you created in one
drive would be readable in a similar drive, but
that's not always the case. In particular, we have
found that some Travan TR-4 drives produce tapes that cannot be read
by another drive, even one of the identical make and model. We seldom
encounter that problem on AIT, DDS, Travan NS8/NS20, and OnStream ADR
drives, although we have had infrequent reports of such compatibility
problems occurring with them.
Unless you use a tape drive to transfer large quantities of data
between computers, this may seem a minor issue. It can be critical,
however, if your computer is stolen or damaged by flood or fire. Even
if your backup tapes are safely locked away, you may find that a
replacement drive of the same model is unable to read them. If your
data is important enough to warrant extreme precautions, buy two
identical tape drives and verify that a tape written in either drive
is readable by the other. Repeat this verification periodically
because drives do start marching to their own drummer as they age.
Alternatively, consider backing up key data frequently to CD-R or
writable DVD and verifying that the disc is readable.
- Operating system compatibility
-
Hardware-level compatibility and application compatibility are
important issues in choosing a tape drive. At the hardware level, any
current tape drive is almost certainly compatible with all recent
versions of Windows. Most current tape drives also support Linux, but
there are exceptions, so check hardware compatibility and driver
availability for your specific Linux distribution and version before
you buy a tape drive. Linux drivers may be available for a particular
tape drive, but using them may require recompiling the kernel or
taking other steps that you may not be comfortable performing.
Application compatibility varies. Many tape drives are bundled with
backup software that is targeted at a specific market. For example, a
drive intended for desktop systems may include backup software that
runs under Windows 9X, 2000 Professional, and XP Professional but not
under Windows 2000 Server or Linux. Conversely, a tape drive targeted
at servers may include bundled backup software that runs under one or
several server operating systems. If you intend to use the tape drive
with Linux or an unsupported Windows version, you must obtain backup
software separately. That software may be free, as are several Linux
backup programs, or it may be expensive enough to make a different
tape drive a better choice.
 |