TideLog Archive for the “PC Repair” Category

I’ve had this problem a few times on my laptop. It occurs mostly when the power suddenly goes off and it switches to battery. You lose all capacity monitoring, and can’t tell how much is left. The system tray icon changes to this:

no battery detected

Microsoft’s forums are hilarious. Their “Most Valuable Professionals” give the funniest canned cut ‘n’ paste responses, from, “Your power driver is corrupt” to your “Windows needs reinstalling!”. I know exactly what causes it, and it ain’t anything to do with “power drivers” or corrupt Windows. It’s the little monitoring chip in the battery. Like a lot of integrated electronics, it sometimes gets confused. Sudden switchovers from mains to battery tend to cause it, especially if there’s any surges from the battery as it kicks in.

The age old advice of “Reboot!” is the wise advice. If that doesn’t cure it, turn your machine off, remove the mains and battery, and hold your power button down to discharge the circuitry in your device (apart from the RTC circuit, but this doesn’t matter), that should cure it. Removing the battery opens the circuit to the sensing system in the battery, and resets it.

Simples. I hate MVP’s, they go on a 5 day course and think that gives them a Professional title? I’ve done MVP courses, but have the skills and years of software and electrical experience to further and back them up

Comments No Comments »

When a hard disk is manufactured, there are areas on the platter that have bad sectors. Considering that on a 2 TB hard disk there are 4 billion sectors, then a few bad sectors is only a tiny proportion of the total number of sectors on the drive. During the test phases of a hard disk, the platters are scanned at the factory and the bad sectors are mapped out – these are generally called ‘Primary Defects’. The primary defects are stored in tables in the firmware zone, or in some cases the ROM of a hard disk. When you buy a brand new hard disk, you will most likely be completely unaware of these bad sectors and the numbers because they are ‘mapped out’ using ‘translator‘ algorithms.

Modern hard disks use Logical Block Addressing or LBA, this describes the sector numbering system on the hard disk, and goes in sequence
0,1,2,3,4,5,…..n-1,n (where n is the last sector on the drive.

Spare sector pools

All modern hard disk drives have a spare sector pool. This is used when bad sectors develop during the normal life of the hard disk and any newly found bad sectors are ‘replaced’ with good ones from the spare sector pool. This process is invisible to the user and they will probably never know that anything has changed.

How Bad Sector Mapping Works:

There are at least two methods of bad sector re-mapping (or translation) these are P-List and G-List.

  • P-list are defects found during manufacture and are also know as Primary Defects
  • G-List are defects that develop in normal use of the drive and are known as Grown Defects

There are other defect lists found in modern drives but the principles are similar. For example, you may find a T-List or a Track defect list, or an S-List or System area defect list.

P-List Remapping
Lets get into how these defect lists actually work, so let’s say we have a small hard disk with 100 sectors and a 10 sector spares pool.

When bad sectors are found at the factory, shift-points are entered into the P-List, if we take the following LBA sequence 0,1,2,3,4,5,6,7,8,9,10 …99, 100 Lets say that Sectors 3, 6 and 9 are found to be bad. When the first bad sector is found, the first part of the re-mapping process will look like this

0,1,2,B,4,5,6,7,8,9,10 ..

What happens here is the bad sector at position 3 is recorded in the P-List. The new map now looks like this;

0,1,2,P,3,4,5,6,7,8,9,10 ..  You can see now that 3 is where 4 was.

The next bad sector at LBA 6 is now found

0,1,2,P,3,4,5,B,7 and is again mapped out giving 0,1,2,P,3,4,5,P,6,7

When the whole sequence is complete, our final map looks like this.

0,1,2,P,3,4,5,P,6,7,8,P,9,10

Because these sectors are mapped out, the user will never be aware that they exist. If you want to look at sector 6, the drive will translate that to physical sector 8. It takes the 6 and adds the shift points to it, +1 for the bad sector at LBA3 and +1 for the bad sector at LBA 6. When the testing gets to the end of the drive, in order that it is of the correct size of 100 sectors, it allocates the sectors from the spare sector pool completely concealing the fact that there are bad sectors on the media. To all intents and purposes the drive looks just like the original as 1,2,3,4,5,6,7,8,9,10. However, our spare pool has reduced in size and there are now 7 sectors remaining in the spares pool.

After using the drive for a while some bad sectors develop the drive takes care of these using a grown defect list.

G-List Remapping
The grown defect list or G-List is a table containing the location of bad sector defects found during normal operation of the hard disk drive. When a bad sector occurs during normal use of the drive, something a similar process to P-List generation occurs – resulting with the bad sectors being mapped out. The process for G-List mapping out is slightly different. Lets say our hard disk develops a bad sector at the current LBA 6. What happens in this case is first the bad sector is mapped out. Giving; 0,1,2,3,4,5,G,7,8,9,10 .. A sector from the spare pool is allocated in the bad sectors place. We used 3 of these sectors in factory testing, so the next available bad sector is 104 this now becomes mapped to LBA 6 so our sequence would look like this; 0,1,2,3,4,5,104,7,8,9,10

Again, this process is completely invisible to the user and will still look like the original sequence of 0,1,2,3,4,5,6,7,8,9,10

You might ask, ‘why don’t the new defects get added to the P-List?‘ the answer is that if you add a grown defect to the P-List it has the effect of shifting the data up the drive for each sector from the point where the new bad sector is found. If you look again at the methodology behind the P-List it will help you understand this.

Where a G-List entry can help to revive hard disk, if there was data stored in the original sector attempts then usually it is lost. This may appear to the user as a file that not longer opens, or a a program that doesn’t run anymore or some other errant behaviour. This will not become apparent until the next time the file is attempted to be opened. It may also be that it is such a long time since it was opened that a backup plan means there are no backups of the working version. So bear this in mind when developing you backup plan.

Defect Mapping in a live system
When a hard disk is powered up, the p-list and g-list are usually loaded into RAM on the controller card. As requests for data come through, the location where the data is required from is passed to the translator, which makes the calculations necessary so as to determine which sectors to actually read in order to get to the actual data requested. In our example above, if we wanted the data from LBA 6 the translator would first run through the p-list and add 2 sectors to the count for the two bad sectors found at the factory, it then checks this value in the G-list and finds it has been re-allocated to sector 104. It then reads sector 104 and presents you with the data.

All the magic that goes unnoticed by normal people 🙂

Comments No Comments »

Many electronics manufacturers, including HDD manufacturers like Seagate, have been using the industry standard “Mean Time Between Failures” (MTBF) to quantify disk drive average failure rates. MTBF has proven useful in the past, but it is flawed.

To address issues of reliability, Seagate is changing to another standard: “Annualized Failure Rate” (AFR).

MTBF is a statistical term relating to reliability as expressed in power on hours (p.o.h.) and is often a specification associated with hard drive mechanisms.
It was originally developed for the military and can be calculated several different ways, each yielding substantially different results. It is common to see MTBF ratings between 300,000 to 1,200,000 hours for hard disk drive mechanisms, which might lead one to conclude that the specification promises between 30 and 120 years of continuous operation. This is not the case! The specification is based on a large (statistically significant) number of drives running continuously at a test site, with data extrapolated according to various known statistical models to yield the results.

Based on the observed error rate over a few weeks or months, the MTBF is estimated and not representative of how long your individual drive, or any individual product, is likely to last. Nor is the MTBF a warranty – it is representative of the relative reliability of a family of products. A higher MTBF merely suggests a generally more reliable and robust family of mechanisms (depending upon the consistency of the statistical models used). Historically, the field MTBF, which includes all returns regardless of cause, is typically 50-60% of projected MTBF.

Seagate’s new standard is AFR. AFR is similar to MTBF and differs only in units. While MTBF is the probable average number of service hours between failures, AFR is the probable percent of failures per year, based on the manufacturer’s total number of installed units of similar type. AFR is an estimate of the percentage of products that will fail in the field due to a supplier cause in one year. Seagate has transitioned from average measures to percentage measures.

MTBF quantifies the probability of failure for a product, however, when a product is first introduced: this rate is often a predicted number, and only after a substantial amount of testing or extensive use in the field can a manufacturer provide demonstrated or actual MTBF measurements. AFR will better allow service plans and spare unit strategies to be set.

Hard drive reliability is closely related to temperature. By operational design, the ambient temperature is 86°F. Temperatures above 122°F or below 41°F, decrease reliability. Directed airflow up to 150 linear feet/min. is recommended for high speed drives.

The failure rate does not include drive returns with “no trouble found”, excessive shock failure, or handling damage.

Here is an example excerpt from a Product Manual, in this case for the Barracuda ES.2 Near-Line Serial ATA drive, which we installed in a backup server at Kana’s datacentre:

The product shall achieve an Annualized Failure Rate – AFR – of 0.73% (Mean Time Between Failures – MTBF – of 1.2 Million hrs) when operated in an environment that ensures the HDA case temperatures do not exceed 40°C. Operation at case temperatures outside the specifications in Section 2.9 may increase the product Annualized Failure Rate (decrease MTBF). AFR and MTBF are population statistics that are not relevant to individual units.
AFR and MTBF specifications are based on the following assumptions for business critical storage system environments:

  • 8,760 power-on-hours per year.
  • 250 average motor start/stop cycles per year.
  • Operations at nominal voltages.
  • Systems will provide adequate cooling to ensure the case temperatures do not exceed 40°C. Temperatures outside the specifications in Section 2.9 will increase the product AFR and decrease MTBF.

1.2 million hours MTBF? I’d have expected that kind of lifetime from an older hard drive, from when they were made to LAST, from the days of manufacturers like Connor and ExelStor, but you certainly won’t get THAT kind of running hours from a modern drive, certainly not 1.2 million hours CONSTANT running!

Comments No Comments »

Bad sectors are little clusters of data on your hard disk that cannot be read. More than that, though, they have the potential to cause real damage to your hard drive (catastrophic failure) if they build up over time, stressing your hard drive’s arm, which contains the read/write head, there are two for each platter, one for each side. Bad sectors are fairly common with normal computer use and the imperfections of the world we live in. Like chip fabrication and LCD panel manufacturing, HDD manufacture is a very critical, precise process, and like a TFT with bad pixels from the factory, you do get bad sectors with a HDD due to imperfections when it’s made. The manufacturers make legal allowances for a certain limit to these imperfections before warranty claims can be made, like the legal limit of 5 dead pixels on a TFT. However, there are several simple steps you can take to prevent HDD bad sectors and to repair any that you do have. Having bad sectors will slow down computer performance as well, as your drive takes time attempting to read them. Here is a step-by-step guide. The most common questions I get as a computer engineer are “What is a sector?”, and “How are HDD bad sectors created?”

A sector is simply a unit of information stored on your hard disk. Rather than being a mass of fluid information, your hard disk stores things neatly into “sectors”, a bit like us humans putting things into boxes, and the box only holds so much, and all boxes are the same size. The standard sector size is 512 bytes.

There are various problems that can cause HDD bad sectors:

  • Improper shutdown of Windows, especially power loss while the HDD is writing data;
  • Defects of the hard disk, including general surface wear, pollution of the air inside the unit due to a dirty or clogged air filter, or the head touching the surface of the disk;
  • Other poor quality or aging hardware, including dodgy data cables, an overheated hard drive, and even a power supply problem, if your drive’s power is erratic;
  • Malware.

Hard and soft bad sectors

There are two types of bad sectors – hard and soft.

Hard bad sectors are the ones that are physically damaged (that can happen because of a head crash if your drive is dropped while running and writing data), or in a fixed magnetic state. If your computer is bumped while the hard disk is writing data, is exposed to extreme heat, or simply has a faulty mechanical part that is allowing the head to contact the disk surface, a “hard bad sector” might be created. Hard bad sectors cannot be repaired, but they can be prevented. The heads of a hard drive float on the air cushion generated by the platters spinning, they fly less than the width of a human hair away from the platters, even a small speck of dust is like a mountain, so knocks are definitely to be avoided.

Soft bad sectors occur when an error correction code (ECC) found in the sector does not match the content of the sector. Whenever a file is written to a sector, the drive calculates a “checksum”, which is used to verify the data, if it doesn’t match upon read, the drive knows the sector is weak. A soft bad sector is sometimes explained as the “hard drive formatting wearing out”, in other words the magnetic field is weakening, like an old video cassette – they are logical errors, not physical damage ones. These are repairable by overwriting everything on the disk with zeros. Like tapes and CD’s, the magnetic surface on a hard disk is not infinite, it is affected by other magnetic fields around it, so data recovery guys like me recommend regularly imaging a drive directly to another, frequently, to keep the data fresh and readable.

Preventing bad sectors

You can help prevent bad sectors (always better than trying to repair them, as they say prevention is better than cure!) by paying attention to both the hardware and the software on your computer.

Preventing bad sectors caused by hardware:

  • Make sure your computer is kept cool and dust free;
  • Make sure you buy good quality hardware from respected brands. Cheap RAM and power supplies are my biggest culprits from experience;
  • Always move your computer carefully, and make sure it is TURNED OFF, not in Sleep mode, it can wake up while being moved, especially a laptop;
  • Keep your data cables as short as possible;
  • Always shut down your computer correctly – use an uninterrupted power supply if your house is prone to blackouts.

Preventing bad sectors using software

  • Use a quality disk defragmenter program with automated scheduling to help prevent head crashes (head crashes can create hard bad sectors). Disk defragmentation reduces hard drive wear and tear, thus prolonging its lifetime and preventing bad sectors;
  • Run a quality anti-virus and anti-malware software and keep the programs updated.

Monitoring bad sectors

If you use a tool like HD Sentinel, or CrystalDiskInfo, and you notice bad sectors on your drive, keep an eye on it. A few sectors bad is not normally a problem, as I mentioned at the start of the article, up to 5 bad pixels on a new TFT is allowed before it becomes a warranty claim, hard drives are allowed a few bad sectors due to the imperfections of their manufacturing process. They are manufactured with what are known as “reserved sectors”, a spare area of the disk only accessible by the controller board. If a sector is weak, the controller will attempt to move the data to the reserved area, if this is successful it then attempts a quick read/write test on the old sector (takes less than a few milliseconds), if it fails it marks it as bad in the sector map, also stored in the drive reserved area, along with drive firmware, so that it doesn’t attempt to use it again.

If the number of bad sectors starts increasing, or you start to experience other symptoms, such as the drive dropping out completely as if you unplugged it, or any clicking, and data taking longer to read or copy, this could indicate a fault with the read/write heads, or the control circuitry. Stop using it immediately and back up any important data to another drive. If the failing drive is under warranty, print a log off from HD Sentinel and take it along with you to return the drive, as evidence.

S.M.A.R.T Values to look for

When looking at S.M.A.R.T (Smart Monitoring And Reporting Tool) analysis, the two main areas to look out for are:

Reallocated Sector Count

This shows how many of the drive’s Reserved sectors have been used. If too many of these are used it generally indicates a problem with the disk surface.

Current Pending Sector

This shows how many bad sectors are currently pending a rewrite. A hard drive will always try to rewrite the sector, if it fails, the sector is reallocated into the reserved, the drive adds the sector on to the Reallocated Sector Count, and the original sector is then marked as unusable. If the rewrite is successful, the Pending Sector count will drop.

 

Comments No Comments »

Western Digital make really good hard drives, but where their Elements, Passport and MyBook drives are concerned, they’ve taken a wrong turn. The 2.5″ versions all have proprietary PCB’s on the drives themselves, so there’s no standard micro SATA data and power connectors like you’d expect. The USB connector and LED, plus the interface controller, are on the single board as well! This means you can’t just take the drive out and connect it to another USB to SATA enclosure.

A lot of very modern WD Elements, MyBook and Passport enclosures are now also encrypted, meaning the data can only be accessed when the control board is functioning correctly. In this article I’ll show you how to recover data from a WD Passport (laptop sized drive) enclosure, if the USB connector gets damaged.

1. Disassemble the enclosure, remove the drive, then remove the PCB from the bottom of the drive using a Torx screwdriver.

2. Flip the drive board over, you’ll see the following capacitors. Remove them using a soldering iron or a heatgun, being careful not to overheat or damage anything:

usb-only-western-digital-drive-capacitors

3. Next you need to take a standard SATA connector from another drive, or from a parts supplier (eBay has them in droves, search for COMAX SATA connector). Once you have it, take a look at it, you’ll see long pins and short pins. All the long ones are GROUND pins:

sata-connector-ground-pins

4. From the back side of the PCB (the componentless side which faces away from the drive when fitted), you will see pins E71, E72, E73 and E74, these belong to the SATA data pins. The other four pins marked with a red square belong to ground pins:

usb-only-western-digital-drive-E-pins

5. Now solder everything together, using this pinout:

E71 – Tx+
E72 – Tx-
E73 – Rx-
E74 – Rx+

The SATA standard uses two lines, a positive and negative, for Data TX (Transmit), and two for Data RX (Recieve), each having a separate ground on the ground lines. Use my picture below as a wiring reference:

usb-only-western-digital-drive-finished-wiring

Now all you need to do is use a standard USB cable to power the drive (if your connector is broken you can try soldering the power lines of a USB cable to the port power pins), connect via SATA to your PC, and it should work. NOTE: This WILL NOT work if your drive uses encryption, as that runs through the USB data lines, because we’re bypassing it, it won’t work.

You may get some “USB device not recognized” errors. Try connecting the SATA drive to a SATA hotplug port, connecting the data cable first, then the power, once Windows has started. Hotplug ports are usually purple or orange, it depends on the board manufacturer, Gigabytes are purple.

Comments 25 Comments »