Suspect Drive Revived - Thanks, SpinRite

  • DNS Benchmark v2 is Finished and Available!
    Guest:
    That's right. It took an entire year, but the result far more accurate and feature laden than we originally planned. The world now has a universal, multi-protocol, super-accurate, DNS resolver performance-measuring tool. This major second version is not free. But the deal is, purchase it once for $9.95 and you own it — and it's entire future — without ever being asked to pay anything more. For an overview list of features and more, please see The DNS Benchmark page at GRC. If you decide to make it your own, thanks in advance. It's a piece of work I'm proud to offer for sale. And if you should have any questions, many of the people who have been using and testing it throughout the past year often hang out here.
    /Steve.
  • Be sure to checkout “Tips & Tricks”
    Dear Guest Visitor → Once you register and log-in please checkout the “Tips & Tricks” page for some very handy tips!

    /Steve.
  • BootAble – FreeDOS boot testing freeware

    To obtain direct, low-level access to a system's mass storage drives, SpinRite runs under a GRC-customized version of FreeDOS which has been modified to add compatibility with all file systems. In order to run SpinRite it must first be possible to boot FreeDOS.

    GRC's “BootAble” freeware allows anyone to easily create BIOS-bootable media in order to workout and confirm the details of getting a machine to boot FreeDOS through a BIOS. Once the means of doing that has been determined, the media created by SpinRite can be booted and run in the same way.

    The participants here, who have taken the time to share their knowledge and experience, their successes and some frustrations with booting their computers into FreeDOS, have created a valuable knowledgebase which will benefit everyone who follows.

    You may click on the image to the right to obtain your own copy of BootAble. Then use the knowledge and experience documented here to boot your computer(s) into FreeDOS. And please do not hesitate to ask questions – nowhere else can better answers be found.

    (You may permanently close this reminder with the 'X' in the upper right.)

Festeron

New member
Oct 12, 2024
3
0
Yesterday I flipped my main machine into CSM/BIOS mode, and ran SpinRite 6.1 R4 for the very first time. I had a 2TB drive that I had taken out of my StableBit DrivePool, because StableBit Scanner told me there were about 3,500 unreadable sectors. Windows Format found them as well.

SpinRite level 3 ran for 10 hours, and reported zero errors on the disk(!) The only red elements were in the SMART report that showed ECC corrected current/max as 110/114. The horizontal bar chart showed mostly blue squares and then two red dots [Dots in the log, but red squares in the SMART screen. Tiny bug?]

Back in Windows, I asked Scanner to examine what it thought were bad sectors, and it now reports that all is OK. I've put the drive back into the DrivePool.

So what has happened here? Did SpinRite test the bad sectors, and determine that they were OK after all? Or did it agree with Scanner, and substitute good sectors from an unused pool of good sectors, mapping them onto the bad ones? Either way, why were all the error numbers zero?
 
The SRLOG file (if you had logging on) should indicate what happened.

The fact that SpinRite reported zero errors on the disk suggests that SpinRite was able to successfully read all sectors and and successfully rewrite all slow reading sectors thus refreshing them and correcting the previous read issues.

If any sectors were reallocated all of that activity would be logged in the SRLOG file (if logging was on)
 
The SRLOG file (if you had logging on) should indicate what happened.

The fact that SpinRite reported zero errors on the disk suggests that SpinRite was able to successfully read all sectors and and successfully rewrite all slow reading sectors thus refreshing them and correcting the previous read issues.

If any sectors were reallocated all of that activity would be logged in the SRLOG file (if logging was on)

As I said in the original post, level 3.

Followup:
Despite SR telling me all was OK, those same sectors have become unreadable again. Unfortunately, this time they contained files, and those files were lost. I am unwilling to run SR again.

Now I've taken that drive out of the pool, and removed it from the machine. It's in my pile of dead drives awaiting destruction.
 
Despite SR telling me all was OK, those same sectors have become unreadable again. Unfortunately, this time they contained files, and those files were lost. I am unwilling to run SR again.
The drive was/is marginal. SpinRite is not magic. Generally failing hardware can't be repaired by making it work harder... so your outcome is not unexpected (to me anyway.)
 
Aww, I hate it when a hard drive fails - ouch!

But, correct, SpinRite is just a tool, offering:

data recovery in place,​
and​
drive maintenance.​

When ANY program 'refreshes' any sector, and
that sector then reads back OK, there is no way,
contemporaneously, to know if that sector will
really be good or not in the future.

In the early days of low level formatting, drive
maintenance programs built a database of bad
sectors-turned-good, and the programmers
created a threshold over which the program
stopped turning them good over and over, and
instead started leaving them marked as bad
even though they were recoverable.

Nowadays, drives maintain themselves, so all a
recovery-and-maintenance program can do is
ask and report, but the drive is ultimately the
boss.

In the opening post's case, yeah, the drive
seems untrustworthy.

That is a decision we must make as users.

That is not a decision that SpinRite or any
program can make, especially if the data comes
back, and the sector 'passes' a re-write and
re-read test.

When in the future that sector will fail, if ever,
is not information that is available to SpinRite
or any program.

Everything fails sometime, though I have
40-year-old drives that suggest otherwise.

We ask for SpinRite logs because there is
information that others see that the user may
not be sensitive to, such as drive make, model,
interface, attachment, timing, and other
details which we can compare to our own
experience.

"... a 2TB drive that I had taken out of my
StableBit DrivePool, because StableBit Scanner
told me there were about 3,500 unreadable
sectors
..."

Let me Google this for us:​
Q: Google, what is StableBit DrivePool,
StableBit Scanner?
A: StableBit DrivePool and StableBit Scanner
are two distinct software tools developed by
StableBit for managing and monitoring hard
drives.
DrivePool creates a single virtual drive from
multiple physical drives, allowing for file
duplication and easy data management.
Scanner monitors the health of hard drives,
performing surface scans and file system
checks to detect potential issues early on.
Here's a more detailed look at each:
StableBit DrivePool:
• Disk Pooling: Combines multiple physical
hard drives into a single, large virtual drive.
• File Duplication: Allows for redundant
storage of files across multiple drives,
protecting against data loss from drive
failures.
• File Management: Automatically moves
data from failing drives to healthy drives,
simplifying data migration.
• Flexibility: Supports various drive types and
sizes and can be easily expanded as needed.
StableBit Scanner:
• Disk Health Monitoring: Regularly checks
SMART data and performs surface scans to
identify potential drive problems.
• Surface Scanning: Detects unreadable sectors
on hard drives, helping to prevent data loss.
• File System Check: Ensures the integrity of
the file system, identifying potential corruption.
• Early Warning System: Alerts users to
potential drive failures, allowing for timely
data backup and drive replacement.
In essence, DrivePool provides a way to create
a large, resilient storage pool, while Scanner
provides a way to monitor and maintain the
health of the drives within that pool.

StableBit's web site claims:

"... Continuously scans all of your hard drives,
ensuring that every precious bit remains
readable ..."


That sounds like it does on-line and
in-real-time what SpinRite does off-line.

"... those same sectors have become unreadable
again. Unfortunately, this time they contained
files, and those files were lost
..."

Sounds like a a failure of StableBit to fulfill
their promise to make sure there was a backup.

"... I am unwilling to run SR again ..."

Is unrelated, because the drive is no longer
used for data, so you can run SpinRite on it
forever now, uninterrupted.

So use the 'bad' drive for testing, use it to
learn about the drives, and use it to learn
about the tests.

Essentially, you are testing the drives, and you
are testing StableBit, as well as testing SpinRite.

And SpinRite can help by providing on outside
look by which to compare.

And then run SpinRite on all the other drives,
and especially run SpinRite on any drive that is
new to you so you can get a sense of both of
their behaviors doing what they promises to
do - write and read back data.

By watching the screens, and reading the
details, we see if sectors are slow, if there are
a lot of error corrections to produce the
promised data storage density ( though
Seagate seems to use ECC differently ).

Hence I run SpinRite Level 5 on all drives new
to me before I use them, so I know what to
compare to as they age over time and I retest
later.

- - - - -

And about StableBit's "... checks SMART data ..."
... everybody knows that S.M.A.R.T. is incredibly
stupid, and so, StableBit ought to know, too,
so there's that.

I have drives that are so overwhelmed with
self-ECC data recovery that they read at
5 MB / second, yet produce no S.M.A.R.T.
errors.

I have drives that sing along at full speed,
reliable for years, yet have critical S.M.A.R.T.
errors on boot that warn that drive failure is
imminent.

Just about the only thing S.M.A.R.T. is reliably
good for is temperature, and yet, some drives
are crappy at reporting that, and some are
even hard-coded to report appropriate drive
temperature regardless of having no
temperature chips, so there's that.

StableBit, and Backblaze, and others say they
depend on S.M.A.R.T., yet we all know
S.M.A.R.T. has nothing to offer that is
dependable.

- - - - -

So, what 2 TB drive was that?

And please do share the SpinRite log so we all
can learn.

Also run free ValiDrive
https://www.grc.com/validrive.htm on the drive
attached to a Windows computer via a USB
adapter, and share the log - ValiDrive is like a
mini-SpinRite Level 5 across 576 regions of a
USB-attached drive.

Seeing both SpinRite and ValiDrive logs will let
us look over your shoulder and see what you
see, and more.

Thanks.
 
Last edited:
More on "... everybody knows that S.M.A.R.T.
is incredibly stupid
..."

Let me Google that for us, too:

Q: Google, Why is HDD and SSD SMART data
not useful?
A: While SMART data can be a useful indicator
of a drive's health, it's not a foolproof predictor
of failure and can sometimes be misleading.
It's important to understand that SMART data
primarily reflects past issues and doesn't
always catch sudden, catastrophic failures.
Furthermore, inconsistencies in how different
manufacturers implement and report SMART
attributes can make interpreting the data
challenging.
Here's a more detailed breakdown:
Limitations of SMART data:
Not a complete picture:
SMART data focuses on specific attributes
that can indicate potential problems, but it
doesn't cover all possible failure modes.
Drives can still fail suddenly without
triggering any SMART warnings.
Inconsistent reporting:
Different manufacturers may use different
SMART attributes or have varying
interpretations of the same attributes,
leading to inconsistencies and making it
difficult to compare data across drives.
Predicting sudden failure is difficult:
SMART is better at predicting gradual
degradation, but it's not reliable for
catching sudden, catastrophic failures that
can happen without warning.
May trigger false alarms:
SMART data can sometimes report errors
that don't have a significant impact on the
drive's performance or lifespan, leading to
unnecessary replacements.
Encrypted drives:
If a drive is encrypted, a sudden failure
could render the encryption keys
inaccessible, leading to complete data loss,
regardless of any SMART warnings.
What SMART data is good for:
Identifying gradual degradation:
SMART can be helpful in identifying trends
of declining performance or increasing
error rates, which could indicate an
impending failure.
Initial troubleshooting:
SMART data can be a good starting point
for diagnosing potential drive issues.
Data backup:
SMART data can help users prioritize data
backups before a drive fails, especially if
it's showing signs of deterioration.
In conclusion:
SMART data is a valuable tool for
monitoring drive health, but it should not
be solely relied upon for predicting or
preventing failures. Users should combine
SMART data analysis with other diagnostic
tools and practices, such as regular backups
and testing, to ensure data safety and
minimize the risk of data loss.