Hey everyone,

I recently built my first NAS. It was bough used with SAS hardware. I’ve finally got past all the roadblocks and problems that were in my way (I basically bricked a whole SAS drive, a hero of a lemmy user helped me fix it).

Now after filling the 15 TB of RAIDZ2 with around 100gb of data. One of the drives started waiving its white flag and wants to die on me.

I am a complete beginner with no experience with these things.

Is my drive dying and should be replaced? or can it be fixed?

This is the output of the 507 errors that TrueNAS received form it and labelled the vDev as degraded and the drive as faulted:

Output of zpool status and sudo smartctl -a /dev/sdd

As a beginner it looks like this drive is cooked, please let me know if it needs replacing so I can order a new one and replace it right away.

Thank you sooo much!

Edit: SAS not SATA drives

  • SapphironZA@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    2
    ·
    edit-2
    5 hours ago

    My rule for older hardware, before trusting the ZFS fault reporting, I would follow the following steps.

    (Note these are homelabber steps and not what I would do in the enterprise, where risk and time is a lot more expensive than replacing hardware)

    1. Check the Smart data of the drive. If it reports the drive as faulty, replace it.

    2. Zpool clear the error and see if it comes back. Sometimes drive errors are not cause by the drive itself

    3. Reseat the drive and the cables between the motherboard and the drive. Clear errors after this step. Especially with older hardware and it having travelled from its previous owner to you, something might not be seated properly.

    4. Move the drive to another drive bay, or swap it with another drive. If the errors move with the drive, the drive is faulty. If the errors move to the bay, you probably have a good drive, but a faulty drive bay/cable.