IT Survey: Preemptive Drive Replacement in RAID Arrays
-
@Breffni-Potter said:
For the hard to access station, they should have spares on a shelf, but in theory, when you buy a drive and store it for 3 years, what happens with the warranty if you put it in and it dies after a month?
You could do hot spares or even cold spares in a chassis so that you can do many drive replacements without needing to be physically at the location.
Although how many places are remote AND unmanned?
-
@MattSpeller said:
@Breffni-Potter spares are a luxury unless you use them on a regular basis
That would only be the case in a situation where you were not comparing to preemptive replacement which is many times (orders of magnitude most likely) more expensive than spares, even tons of spares, even 100% of spares. Preemptive replacement of healthy drives means you have to have spares and use them over and over again, even when the original drives have not failed!
So every luxury of spares PLUS the luxury of just throwing out good drives for the fun of throwing them out!
-
@Dashrender said:
@Breffni-Potter said:
For the hard to access station, they should have spares on a shelf, but in theory, when you buy a drive and store it for 3 years, what happens with the warranty if you put it in and it dies after a month?
It would be out of warranty. But this wouldn't be the situation as @MattSpeller is describing. If they only visit the site say once every 3 months, presumably they would bring drives with them.
But really, you wouldn't setup a system that relied on this type of solution in this scenerio, you'd choose something with more robustness built in. Though I can't tell you what that would look like. Perhaps 2 or even three equal sized arrays kept in sync with redundant data paths, etc. If the data is that important, but you can only visit the site once every three months, you can't just use the day to day setup in most cases.
I'm not sure that is true. You might balance SSD and Winchester drives to alter robustness for the scenario in question, but avoiding RAID might not make sense. With failure rates without spares reaching into the tens and hundreds of thousands of years MTBF on RAID 10, going with a RAID 10, even a large one, with a number of hot spares could give an unmanned station decades of reliable operation time before arrays need to be replaced - likely longer than the equipment is viable.
-
@Dashrender said:
@MattSpeller said:
@Dashrender exactly, there are much better ways to set that kinda thing up - I think we're still looking for a scenario where dude-buddy-guy from SW forums would be right. He may just be 100% wrong.
well, again, my friends suggested reason, lack of personnel resources in times of emergency, could be a reason.
No, it makes investment in spares make sense but still doesn't justify preemptive, it would do the opposite. Having spares is what you do when you don't have available labour, not burning them up and throwing them out.