Redundant Array of Inexpensive Disks. The term was coined in a paper by Patterson, Gibson and Katz in 1988
"While the capacity of Single Large Expensive
Disks (SLED) has grown rapidly, the performance improvement of SLED has
been modest. Redundant Arrays of Inexpensive Disks (RAID), based on the
magnetic disk technology developed for personal computers, offers an
attractive alternative to SLED, promising improvements of an order of
magnitude in performance, reliability, power consumption, and
scalability."
The main idea of RAID is to provide a group of
drives in a system that will continue to provide valid uninterrupted
data even if one (or sometimes more) drives fail. Secondary benefits are
faster access speeds and data transfer as well as the ability to
publish virtual disk drives to the host system that are far larger than a
single drive.
RAID comes in several different
configurations, which are notated RAID X, with X being a number. The
most commonly used are RAID 0, RAID 1, and RAID 5, with RAID 3 and RAID
0+1 making occasional appearances. RAID 2 and 4 are almost never used.
Each has its own strengths and weaknesses which I will now explain.
RAID 0: Striping
The first thing to realise with RAID 0 is that it
doesn't provide redundancy, so by definition it isn't truly RAID. If
even a single drive in a RAID 0 set goes faulty, then all of the data on
the set is lost. However, RAID 0 does provide the fastest IO rates of
the RAID levels.
RAID 0 gets its performance from striping the
data across all of the drives in the set. One way of explaining this
would be to imagine a controller (connected to a group of drives) which
spreads the data across all of the disks. Because it is sending data to
several drives at once, it is able to do so quickly. The same is true
for reading.
Characteristics
Striped disk array, the data is broken down into blocks and each block is written to a separate disk drive.
I/O performance is greatly improved by spreading the I/O load across many channels and drives.
Fastest and most efficient array type but offers no fault tolerance.
RAID 0 requires at least one drive
Recommended Use
Video production and editing: Image editing: Prepress applications: Any application requiring high bandwidth.
RAID 1: Disk Mirroring
This is a very simple concept. Each drive has a
complete mirror of itself on another drive. There is little or no
performance benefit from using RAID 1, in fact, there can be a
degradation to writes due to the overhead of error checking the data.
Characteristics
Better read transaction rate than single disks, same write transaction rate as single disks.
100% redundancy of data means that in case of disk failure, a
rebuild of data is unnecessary. Just make a copy to a replacement disk.
All the paired disks have the same data.
RAID 1 requires at least two drives.
Recommended Use
Accounting: Payroll: Financial: Any application requiring high availability.
|
RAID 0+1: Striping and Mirroring
This is striping and mirroring combined. Data
is striped across several disks, each disk having a partner with exactly
the same data on it.
You get the benefits of fast data access as in
RAID 0, with the fault tolerance of RAID 1. You need double the number
of disks as a RAID 0, half for each side of the mirror.
Characteristics
Optimal speed and reliability.
High use of drives, RAID 0+1 requires twice as many drives as RAID 0.
Recommended Use
Any application requiring both high bandwidth and high fault tolerance.
RAID 3: Stripe with Parity
RAID 1 required twice as many drives as normal to
provide fault tolerance. This can be expensive. RAID 3 addresses this
and requires one only extra drive to hold parity information. If a drive
fails, the data can be reassembled using the parity information. The
drawback is that this parity information has to be calculated, which
takes time and processor power on the RAID controller.
Characteristics
The data is striped across a set of disks with parity information calculated and written to a separate disk.
Very high read and write data transfer rates.
Disk failure has low impact on throughput.
RAID 3 requires at least three drives.
Recommended Use
Video production and live streaming:Image editing:
Video Editing: Any application requiring high throughput and
availability.
RAID 5: Striped Data, Striped Parity
An adaptation of RAID 3, RAID 5 spreads the
parity information across all of the drives instead of using a dedicated
drive. This gives a performance benefit compared to RAID 3, where the
dedicated parity drive is written to on every write operation. With RAID
5 this load is spread across all of the drives. This still only
requires an overhead of one drive.
RAID 5 is the most commonly chosen RAID configuration, giving fault tolerance and good performance.
Characteristics
The data is striped across a set of disks and parity information calculated and striped as well.
Highest read data rate, medium write rate.
RAID 5 requires a minimum of three drives.
Recommended Use
File and application servers: Database servers: Intranet servers: Most versatile RAID level.
About the Author
This article is reprinted with kind permission
from JAD Logic Ltd, a UK company of RAID experts www.jadlog.demon.co.uk/
The graphics, created locally, are based upon Accusys images.
Reprinted from the June 2003 issue of PC Update, the magazine of Melbourne PC User Group, Australia
|