Documente Academic
Documente Profesional
Documente Cultură
Some of the Amazing Features in Sun's Open Source ZFS File System
Paul Rubens with Enterprise Networking Planet lists what he believes are the top
10 features of Sun's open source ZFS file system. Briefs taken from Rubens on
each of these features are listed below:
The file system uses a 256-bit checksum, which is stored as metadata separate
from the data it relates to, when it writes information to disk. Unlike a simple disk
block checksum, this can detect phantom writes, misdirected reads and writes,
DMA parity errors, driver bugs and accidental overwrites as well as traditional "bit rot."
Copy on Write
Copy-on-write means that when data is changed it is not overwritten -- it is always written to a new block and
checksummed before pointers to the data are changed. The old data may be retained, creating snapshots of
the file system through time as changes are made.
Time Slider offers a view of the file system with a slider which can be moved back along a timeline to earlier
snapshot times. As this is done, the view changes to show the state of the file system or the contents of a
folder at the corresponding snapshot time. Recovering a file that has been overwritten by mistake or rolling
back the system after an unsuccessful update is then just a matter of moving the slider back to the appropriate
snapshot time.
ZFS takes available storage drives and pools them together as a single resource, called a zpool. This can be
optimized for capacity, or I/O performance, or redundancy, using striping, mirroring or some form of RAID. If
more storage is needed then more drives can simply be added to the zpool -- ZFS sees the new capacity and
starts using it automatically, balancing I/O and maximizing throughput.
Users need not be concerned with the RAID 5 write hole anymore. RAIDZ gets around this problem by using a
variable width stripe, so every write is effectively a full stripe write. This, together with ZFS's copy on write
characteristic eliminates the RAID 5 write hole completely. RAIDZ2 works in a similar way, but can tolerate the
loss of two disks in the array using double parity. Setting up a RAIDZ (or RAIDZ2) array is very easy and
involves issuing one command.
High performance SSDs can be added to a storage pool to create a hybrid storage pool. When these are
configured as high performance cache disks, ZFS uses them to hold frequently accessed data to improve
performance. It also uses a technology called L2 ARC (adaptive replacement cache) to write data that has to
be stored immediately. This can slowly be moved over to conventional hard drives for more permanent storage
when time and resources allow.
Capacity
ZFS is a 128-bit file system, which means that in theory it could store 256 quadrillion ZB (a ZB is a billion TB.)
In practice this is larger than would ever be necessary, for the foreseeable future at least.
Data Scrubbing
ZFS can be made to scrub all the data in a storage pool, checking each piece of data with its corresponding
checksum to verify its integrity, detect any silent data corruption, and to correct any errors in encounters where
possible.When the data is stored in a redundant fashion -- in a mirrored or RAID-type array -- it can correct any
corrupt data it detects invisibly and without any administrator intervention. Since data corruption is logged, ZFS
can bring to light defects in memory modules (or other hardware) which cause data to be stored on hard drives
incorrectly.
Using ZFS commands you can administer a system with short, efficient commands. For example, a five disk
RAIDZ array could be set up with the single command:
Interested in trying ZFS? Start by downloading OpenSolaris or a related distro from http://opensolaris.org
/os/downloads/.
There is also a project that has ported ZFS to the FUSE framework for the Linux operating system. Visit the
ZFS for FUSE/Linux project.
More Information
Things Are Getting Easier with ZFS, Live Upgrade and now, Flash Archive