Unsolved
1 Message
0
1041
July 13th, 2021 19:00
Isilon H500 data protect test
Hello
I have a H500 system with 12 nodes. The SmartPools protection level is +2d:1n.
How to perform node fail or drive fail? We like to test to see this system has good protection working.
Can I use "smartfail" node or drive? If yes, how to add node or drive back to system after smartfail?
Thanks



DELL-Sam L
Moderator
•
7.7K Posts
0
July 14th, 2021 10:00
Hello engchang,
Here is a link to a video that explains how to do a failover/failback. https://dell.to/3yS1qRY
Peter_Sero
4 Operator
•
1.2K Posts
0
July 15th, 2021 07:00
@engchang
keep in mind that smartfail will rebuild protection BEFORE the affected disk or node is removed from the cluster. The purpose is that of a 'graceful decommissioning' in the sense that you never run into under-protection.
If you want to see wether the actual protection of your cluster works as intended fo protect against data loss at sudden failures of components, you would need to simulate such a failure. Now I wouldn't suggest to remove a disk from a powered-on node on a system under support. But of course you can power down a single node and see wether the system behaves as expected. If your system is not yet in regular protection, power down a second node to get some experience with a situation where +2d:1n protection in not sufficient and some data will be unavailable. See how things clear up when at least one node is brought back online.
To simulate one or more failed disks failure, remove ithe disk(s) from a powered-done node and power up the node.
Alternatively you can use the 'stopfail' feature for disks or nodes (syntax as for smartfail). HOWEVER. unlike smartfail, the stopfail will take down the affected disk or node immediately (like by an actual HW or power failure), and only AFTER this the protection will be rebuild.
In either case, if your cluster is under support (and does 'phone-home' for critical events), check with support before doing those maneuvres. Fwiw, performing those maneuvres is highly instructive and will give you confidence for running OneFS in production.
hth
-- Peter
Phil.Lam
3 Apprentice
•
625 Posts
0
July 26th, 2021 15:00
I would do this a part of POC in beginning, not when It's in production.
power up an Isilon/Powerscale simulator and play with it. It's your sandbox.