Time-Limited Error Recovery
Encyclopedia
Time-Limited Error Recovery (TLER) is a name used by Western Digital
Western Digital
Western Digital Corporation is one of the largest computer hard disk drive manufacturers in the world. It has a long history in the electronics industry as an integrated circuit maker and a storage products company. Western Digital was founded on April 23, 1970 by Alvin B...

 for a hard disk drive firmware bugfix (to a misfeature) that allows improved error handling in a RAID
RAID
RAID is a storage technology that combines multiple disk drive components into a logical unit...

 environment. In some cases, there is a conflict as to whether error handling should be undertaken by the hard drive or by the RAID controller, which leads to drives being marked as unusable and significant performance degradation, when this could otherwise have been avoided. Similar technologies are called Error Recovery Control (ERC), used by competitor Seagate
Seagate Technology
Seagate Technology is one of the world's largest manufacturers of hard disk drives. Incorporated in 1978 as Shugart Technology, Seagate is currently incorporated in Dublin, Ireland and has its principal executive offices in Scotts Valley, California, United States.-1970s:On November 1, 1979...

, and Command Completion Time Limit (CCTL), used by Samsung
Samsung
The Samsung Group is a South Korean multinational conglomerate corporation headquartered in Samsung Town, Seoul, South Korea...

 and Hitachi
Hitachi, Ltd.
is a Japanese multinational conglomerate headquartered in Marunouchi 1-chome, Chiyoda, Tokyo, Japan. The company is the parent of the Hitachi Group as part of the larger DKB Group companies...

.

Overview

Modern hard drives feature an ability to recover from some read/write errors by internally remapping sectors
Disk sector
In computer disk storage, a sector is a subdivision of a track on a magnetic disk or optical disc. Each sector stores a fixed amount of user data. Traditional formatting of these storage media provides space for 512 bytes or 2048 bytes of user-accessible data per sector...

 and other forms of self test and recovery. The process for this can sometimes take several seconds or (under heavy usage) minutes, during which time the drive is unresponsive. RAID controllers are designed to recognise a drive which does not respond within a few seconds, and mark it as unreliable, indicating that it should be withdrawn from use and the array rebuilt from parity data. This is a long process, degrades performance, and if more drives fail under the resulting additional workload, it may be catastrophic.

If the drive itself is inherently reliable but has some bad sectors, then TLER and similar features prevent a disk from being unnecessarily marked as 'failed' by limiting the time spent on correcting detected errors before advising the array controller of a failed operation. The array controller can then handle the data recovery for the limited amount involved, rather than marking the entire drive as faulty.

Desktop Computers and TLER Effect

Effectively, TLER and similar features limit the performance of on-drive error handling, to allow RAID controllers to handle the error if problematic. In a non-RAID environment, such features are unhelpful, and manufacturers do not recommend their use.

Generally, Western Digital enterprise drives such as Raptor, Caviar RE2 and RE2-GP (RAID Edition) come with TLER Read "Enabled" (7 seconds) and TLER Write "Disabled" (0 seconds) while desktop drives such as Caviar SE, SE16, and GP come with TLER Read and Write Disabled (0 seconds).

The ability to change the TLER option on Western Digital Desktop and Enterprise hard drives coincides with the current trends in desktop computing allowing the user to have the flexibility of buying a start-up computer with a single desktop class hard drive and then upgrading to two or more drives in the future. Most of the motherboard manufacturers these days use chipsets from Intel and nVidia
NVIDIA
Nvidia is an American global technology company based in Santa Clara, California. Nvidia is best known for its graphics processors . Nvidia and chief rival AMD Graphics Techonologies have dominated the high performance GPU market, pushing other manufacturers to smaller, niche roles...

 that include on-board RAID controllers as a standard option and anywhere from 4, 6, or up to 10 SATA ports allowing for easy upgrades to multiple hard drives and RAID capability without the requirement of purchasing specialized add-on cards. Additionally, having the choice of changing the TLER option to disabled, gives the user the flexibility of purchasing the high performance drives that have TLER enabled and that are designed to run in a RAID array by using them as stand-alone single drives in their desktop computers and disabling the TLER option to enable better data recovery and lower chance of corruption and drive failures.

Stand-Alone vs RAID Hard Disk Usage Considerations

It is best for TLER to be "Enabled" when in a RAID array to prevent the recovery time from a disk read or write error from exceeding the RAID controller's timeout threshold. If a drive times out, the hard disk will need to be manually re-added to the array, requiring a re-build and re-synchronization of the hard disk. Enabling TLER seeks to prevent this by interrupting error correction before timeout, to report failures only for data segments. The result is increased reliability in a RAID array.

In a stand-alone configuration TLER should be disabled. As the drive is not redundant, reporting segments as failed will only increase manual intervention. Without a RAID controller to drop the disk, normal (no TLER) recovery ability is most stable.
Model TLER Default ( Read / Write ) Stand-Alone Recommendation RAID Recommendation
Caviar, SE, SE16, GP, Raptor Disabled ( 0s / 0s ) Default Enabled
Caviar RE2, RE2-GP Enabled ( 7s / 0s ) Disabled Default

Western Digital Time Limit Error Recovery Utility - WDTLER.EXE

The WDTLER utility allows for the enabling or disabling of the TLER parameter in the hard disk's firmware settings allowing the user to determine the best setting for his particular usage as either a stand-alone or RAID drive. This utility is written for DOS
DOS
DOS, short for "Disk Operating System", is an acronym for several closely related operating systems that dominated the IBM PC compatible market between 1981 and 1995, or until about 2000 if one includes the partially DOS-based Microsoft Windows versions 95, 98, and Millennium Edition.Related...

 and you will require a DOS bootable disk with this utility on it to use it.

The WDTLER utility works on and makes changes to all the connected and compatible Western Digital hard drives to the computer. It is important to remember that any change will affect all the hard drives. If you only wish to change specific hard drives on your computer then you should disconnect the other hard drives before you use this utility, then reconnect them after you are finished.

The WDTLER utility comes with three batch files, TLERSCAN.BAT to get the current state of the TLER setting on all the hard drives, TLER-ON.BAT to Enable TLER, and TLER-OFF.BAT to Disable TLER. The included TLER-ON.BAT will set the Read & Write TLER time to 7 seconds. If you wish to use a custom timeout value, you can use the WDTLER.EXE utility directly with the -r# -w# parameters to specify how many seconds the Time Limit value should be.

Western Digital now claims that using the WDTLER.EXE tool on newer drives can damage the firmware and make the disk unusable. The WDTLER.EXE tool is no longer available from Western Digital, and new disks will not be able to have the TLER setting changed. RE disks are only suitable for RAID arrays and Caviar are only suitable for non-RAID use. The utility still works for older disks.

Below is the WDTLER output for Western Digital Caviar SE16 320 GB and 500 GB hard disk for the default TLER configuration before and after TLER has been Enabled.

Before - TLER Read & Write: Disabled


WDTLER Version 1.03
Copyright (C) 2004-2006 Western Digital Corporation
Western Digital Time Limit Error Recovery Utility

Model: WDC WD3200KS-00PFB0 Serial Number: WD-WCAPD1234567
Read TLER is disabled.
Write TLER is disabled.

Model: WDC WD3200KS-00PFB0 Serial Number: WD-WCAPD1234567
Read TLER is disabled.
Write TLER is disabled.

Model: WDC WD5000KS-00MNB0 Serial Number: WD-WMANU1234567
Read TLER is disabled.
Write TLER is disabled.

Model: WDC WD5000KS-00MNB0 Serial Number: WD-WMANU1234567
Read TLER is disabled.
Write TLER is disabled.

Legend: WD3200KS - Western Digital Caviar SE16 320 GB, WD5000KS - Western Digital Caviar SE16 500 GB
After - TLER Read & Write: 7 seconds


WDTLER Version 1.03
Copyright (C) 2004-2006 Western Digital Corporation
Western Digital Time Limit Error Recovery Utility

Model: WDC WD3200KS-00PFB0 Serial Number: WD-WCAPD1234567
Read TLER time is 7.000 seconds.
Write TLER time is 7.000 seconds.

Model: WDC WD3200KS-00PFB0 Serial Number: WD-WCAPD1234567
Read TLER time is 7.000 seconds.
Write TLER time is 7.000 seconds.

Model: WDC WD5000KS-00MNB0 Serial Number: WD-WMANU1234567
Read TLER time is 7.000 seconds.
Write TLER time is 7.000 seconds.

Model: WDC WD5000KS-00MNB0 Serial Number: WD-WMANU1234567
Read TLER time is 7.000 seconds.
Write TLER time is 7.000 seconds.

Legend: WD3200KS - Western Digital Caviar SE16 320 GB, WD5000KS - Western Digital Caviar SE16 500 GB
Note: Western Digital (1.5TB Green Power) WD15EADS-00P8B0 (Nov 2009) drives do not support TLER. WD15EADS-00S2B0 (Feb 2010) models do support TLER.

Raid Controllers

Hardware raid controllers
Disk array controller
A disk array controller is a device which manages the physical disk drives and presents them to the computer as logical units. It almost always implements hardware RAID, thus it is sometimes referred to as RAID controller. It also often provides additional disk cache.A disk array controller name is...

 disconnect timeout value may vary from vendor to vendor. TLER should trigger before the controller times out the drive. For example,
  • 3ware 9650SE: 20 seconds
  • LSI Logic(for IBM x-series): 10 seconds (see BIOS Raid Config Utility > Advanced Device Properties)


Note:
  • Intel Matrix RAID / Intel Rapid Storage Technology embedded in Intel server motherboards and modern desktop motherboards is a pseudo-hardware controller, not real hardware raid.

Software Raid

  • GEOM
    GEOM
    GEOM is the main storage framework for the FreeBSD operating system. It is available in FreeBSD 5.0 and higher and provides a standardized way to access storage layers. GEOM is modular and allows for geom modules to connect to the framework. For example, the geom_mirror module will provide RAID1 or...

    : 4 seconds (kern.geom.mirror.timeout: 4)
  • Linux mdadm
    Mdadm
    mdadm is a Linux utility used to manage software RAID devices.The name is derived from the md device nodes it administers or manages, and it replaced a previous utility mdctl...

    simply holds and lets the drive complete its recovery - however, the default command timeout for the SCSI Disk layer (/sys/block/sd?/device/timeout) is 30 seconds, after which it will attempt to reset the drive, and if that fails, offline the drive

External links

The source of this article is wikipedia, the free encyclopedia.  The text of this article is licensed under the GFDL.
 
x
OK