Unsolved
1 Message
0
12008
Equallogic PS6210 Battery Replacement
Hello,
We have equallogic PS6210 that the battery need to be replaced.
Can I just restart the active (Control Module Slot 1) and then automatically switch to the top (Control Module Slot 0) with no downtime? Because it is a production and I need to be careful.
Thanks
dwilliam62
1 Rookie
1 Rookie
•
1.5K Posts
0
December 10th, 2020 10:00
Hello,
Yes you can. You should have set the disk timeout values according to the OS considerations guide that's located with the EQL firmware downloads. If you install the HIT/ME or HIT/LE software that will be set automatically.
The failover time depends on EQL F/W version, the model, 6210's are fast, and the load on the array at the time.,
These settings should be set in the event of an unexpected failover, and firmware updates. So it's a good thing to do regardless. If they are not set, some OS's like Windows requires a restart to become effective.
Regards,
Don
LAS2001
1 Message
0
February 8th, 2021 15:00
I have the same issue. I have replaced the battery on slot 0 and performed multiple maintenance resets, allowed over 72 hours and slot 0 still reports an end of life warning. Any suggestions appreciated. Thank you in advance.
dwilliam62
1 Rookie
1 Rookie
•
1.5K Posts
0
February 8th, 2021 20:00
Hello,
I would go back to your vendor and get another battery. Once you replace the battery with a good one there's no other action required. The battery assy keeps track of manufacture data, number of cycles, etc.. it's similar to a SMART failure on a drive. Replacement is the only option.
Regards,
Don
Dell-DylanJ
2.9K Posts
1
May 4th, 2021 07:00
Hello,
While I do not know what resolution the original poster reached, I would expect you would need a different battery. Like Don mentioned above, once the battery is replaced, there really isn't anything more for you to do. Page 26 of the document below goes into detail on battery replacement.
https://downloads.dell.com/manuals/common/ps6210_om_en.pdf
sgt21ct
7 Posts
0
May 4th, 2021 07:00
Thanks very much for the prompt response!
I followed this guide, except the fact that I turned off the storage entirely before taking the controllers off and replacing the batteries. I found so many information about the charging time of the controller batteries. Do you know how many time we need to wait for it? It is possible that this erros is related with a uncharged battery?
Thanks again!
sgt21ct
7 Posts
0
May 4th, 2021 07:00
Hello, I'm facing the same problem. Even after the replacement, the two slots keeps the same error.
Do you found any other solution? Or you needed to ask for another battery?
Thanks!
EDIT: My cache policy is set to write-through, which is impairing the storage performance.
Dell-DylanJ
2.9K Posts
1
May 4th, 2021 08:00
You could give it 24 hours from the of replacement, because the batteries do go through charging and learn cycles. I'm not confident this will resolve your issue by itself, but there's certainly no harm in observing to see if the battery does come up properly. However, if it doesn't come up after that, I'd be considering a different battery.
sgt21ct
7 Posts
0
May 4th, 2021 09:00
Thank you both for the patience and quick response.
So I'll wait the 24 hours replacement period and if it do not come back up, we will try to aquire a new battery or a new controller.
dwilliam62
1 Rookie
1 Rookie
•
1.5K Posts
1
May 4th, 2021 09:00
Hello,
That would not be a very good idea. A power loss or possibly even a controller failover could result in lost data. Including array side operations to the internal databases. Which could leave that member inaccessible.
GrpName>member select MEMBERNAME low-battery-safe ?
enable - Enables low-battery-safe mode. Cache operates in write-b
ack mode.
disable - Disables low-battery-safe mode. Cache operates in write-
back mode, regardless of battery charge. Not recommended
due to risk of data loss.
The time to flush cache is dependent on many factors. RAID type and IO load foremost among them.
Strongly advise you do NOT disable this feature.
Regards,
Don
Dell-DylanJ
2.9K Posts
1
May 4th, 2021 09:00
Like Don had said, I really don't think it would be a great idea to disable this. I'd really only be repeating what his post, but turning this off would be a disservice to yourself.
sgt21ct
7 Posts
0
May 4th, 2021 09:00
Nice, thank you.
About the cache policy, what is the impact of setting low-battery-safe disabled? It is risky only if the storage powers off unexpectedly, right?
In the moment of set this option there is any impacts on the VMs running disks on storage? And in the case of energy fault, if I have time to set low-battery-safe enabled again before the storage powers off, how many time the storage takes to transfer the information from cache to disks, in order to prevent the loss of data stored in cache?
dwilliam62
1 Rookie
1 Rookie
•
1.5K Posts
1
May 4th, 2021 10:00
Hello,
Correct. I suspect you will have to replace the batteries.
Regards,
Don
sgt21ct
7 Posts
0
May 4th, 2021 12:00
Sorry for the inconvenience.
I was reading the storage logs in the startup process after the replacement, and put it here:
776:229:member1:SP: 3-May-2021 19:44:07.690228:emm.c:2854:INFO:28.2.171:Smart Ba
ttery now has sufficient charge to reliably power the control module through a p
ower failure event.
774:8:member1:netmgtd: 3-May-2021 19:43:40.270007:rca_ocp.c:2113:INFO:25.2.16:GU
I: Account grpadmin from 10.122.7.185 logged in to 10.122.8.210, using local aut
hentication. User privilege is group-admin.
769:3:member1:netmgtd: 3-May-2021 19:41:39.930002:rca_ocpconn.c:754:INFO:25.2.17
:GUI: Account grpadmin from 10.122.7.185 to 10.122.8.210 logged out.
756:1:member1:netmgtd: 3-May-2021 19:33:07.300000:rca_ocp.c:2113:INFO:25.2.16:GU
I: Account grpadmin from 10.122.7.185 logged in to 10.122.8.210, using local aut
hentication. User privilege is group-admin.
667:128:member1:SP [secondary]: 3-May-2021 19:32:38.770126:emm.c:2363:ERROR:28.4
.47:Critical health conditions exist.
Correct immediately before they affect array operation.
The service life of the control module battery is depleted.
There are 1 outstanding health conditions. Correct these conditions before they
affect array operation.
666:224:member1:SP: 3-May-2021 19:32:38.680223:emm.c:2363:WARNING:28.3.51:Warnin
g health conditions currently exist.
Correct these conditions before they affect array operation.
Smart Battery is undercharged.
There are 1 outstanding health conditions. Correct these conditions before they
affect array operation.
665:223:member1:SP: 3-May-2021 19:32:38.680222:emm.c:2854:INFO:28.2.30:Control m
odules have completed initializing, and failover is now operational.
654:118:member1:SP [secondary]: 3-May-2021 19:32:37.020118:cord.c:710:INFO:28.2.
108:Control module in slot 1 with serial number CN-0D85Y9-77921-4BK-005A is desi
gnated as secondary.
526:219:member1:SP: 3-May-2021 19:32:36.320218:emm.c:2363:WARNING:28.3.51:Warnin
g health conditions currently exist.
Correct these conditions before they affect array operation.
Control modules are initializing. Control module failover cannot occur until th
e initialization completes.
Smart Battery is undercharged.
There are 2 outstanding health conditions. Correct these conditions before they
affect array operation.
525:218:member1:SP: 3-May-2021 19:32:36.320217:emm.c:2363:WARNING:28.3.51:Warnin
g health conditions currently exist.
Correct these conditions before they affect array operation.
Smart Battery is undercharged.
There are 1 outstanding health conditions. Correct these conditions before they
affect array operation.
524:217:member1:SP: 3-May-2021 19:32:36.320216:emm.c:2854:INFO:28.2.22:Dual cont
rol modules are now communicating properly.
650:117:member1:SP [secondary]: 3-May-2021 19:32:32.460117:emm.c:2363:ERROR:28.4
.47:Critical health conditions exist.
Correct immediately before they affect array operation.
649:0:member1:emd [secondary]: 3-May-2021 19:32:32.460000:emdSBS.cc:378:ERROR:28
.4.165:The service life of the cache battery on the control module in slot 1 is
depleted.
645:0:member1:QRQ [secondary]: 3-May-2021 19:32:27.020114:qrq.c:910:INFO:9.2.0:P
S Series Array Firmware Version: Storage Array Firmware V8.1.0 (R417284)
640:110:member1:SP [secondary]: 3-May-2021 19:32:27.010110:eqllog_mbuf_Q.c:1100:
ERROR:2.4.0:Panic recovery from CPU0 with reason 'xlp_ddr_ecc_disable_error: DDR
correctable threshold exceeded on channel 2, cause failover'.
635:105:member1:SP [secondary]: 3-May-2021 19:32:27.010105:ppool_nvram.c:200:ERR
OR:15.4.18:Saved function call stack, CPU 0
00000000e05a2244 00000000e05989c4 00000000e0821534 00000000e0821bd8
00000000e0838e3c 00000000e083917c 00000000e0566274 00000000e055fafc
00000000e055c9a4 00000000e05877ec 00000000e057eca4 00000000e05acd80
634:104:member1:SP [secondary]: 3-May-2021 19:32:27.010104:ppool_nvram.c:194:ERR
OR:15.4.17:Saved CP0 registers, CPU 0
badva 0000000078768340 epc 0000000078768354 errorepc 0000000000000000
sr 4080ff81 cause 00000120 errctl 00000000 cacheeri 00000000 cacheerd 000
00000
buserr 0000000000000000 cacheerrdpa 0000000000000000
633:103:member1:SP [secondary]: 3-May-2021 19:32:27.010103:ppool_nvram.c:188:ERR
OR:15.4.5:Saved CPU registers, CPU 0
at 0000000000000000 v0 0000000000000001 v1 0000000000000000
a0 ffffffffe0aaf3d0 a1 ffffffffe0a6ae88 a2 0000000000000002 a3 0000000000200000
t0 ffffffffe0be0000 t1 000000000000005d t2 0000000000000000 t3 0000000000000000
t4 0000000000000040 t5 0000000000000004 t6 0000000000000020 t7 0000000000000000
s0 ffffffffe0aaf3d0 s1 ffffffffe0a70000 s2 ffffffffe16c0000 s3 ffffffffe16c0000
s4 045e7b272f608771 s5 ffffffffe114df10 s6 ffffffffe1140000 s7 98000000bea25560
t8 0000003041a5dc19 t9 0000000000000000 k0 c000000038397d00 k1 98000000bea25560
gp ffffffffe116e2e0 sp c000000038397670 s8 fffffffffffffff7 ra ffffffffe05989c4
632:102:member1:SP [secondary]: 3-May-2021 19:32:27.010102:ppool_nvram.c:396:ERR
OR:15.4.1:NVRAM contains valid data. This is a PANIC RECOVERY due to a panic on
a NetBSD processor.
497:216:member1:SP: 3-May-2021 19:32:21.860215:emm.c:2363:WARNING:28.3.51:Warnin
g health conditions currently exist.
Correct these conditions before they affect array operation.
Active control module cannot communicate with secondary control module. Failove
r cannot occur.
Smart Battery is undercharged.
There are 2 outstanding health conditions. Correct these conditions before they
affect array operation.
496:215:member1:SP: 3-May-2021 19:32:21.860214:emm.c:2858:INFO:28.2.20:Control m
odule has been installed in slot 1.
338:24:member1:psgd: 3-May-2021 19:31:43.330023:psgd_group.cc:18053:INFO:18.2.0:
Group member member1 now active in the group.
213:196:member1:SP: 3-May-2021 19:31:04.480195:emm.c:2363:WARNING:28.3.51:Warnin
g health conditions currently exist.
Correct these conditions before they affect array operation.
Control module was removed from array.
Active control module cannot communicate with secondary control module. Failove
r cannot occur.
Smart Battery is undercharged.
There are 3 outstanding health conditions. Correct these conditions before they
affect array operation.
211:195:member1:SP: 3-May-2021 19:31:04.470194:emm.c:2363:ERROR:28.4.47:Critical
health conditions exist.
Correct immediately before they affect array operation.
The service life of the control module battery is depleted.
There are 1 outstanding health conditions. Correct these conditions before they
affect array operation.
212:2:member1:emd: 3-May-2021 19:31:04.470001:emdSBS.cc:495:WARNING:28.3.171:Sma
rt Battery on control module 0 has insufficient charge to reliably power the con
trol module through a power failure event. The battery should return to normal o
peration within an hour.
210:1:member1:emd: 3-May-2021 19:31:04.470000:emdSBS.cc:378:ERROR:28.4.165:The s
ervice life of the cache battery on the control module in slot 0 is depleted.
209:194:member1:SP: 3-May-2021 19:31:04.440193:emm.c:2806:INFO:28.2.176:The cont
rol module cache battery end of life critical condition has been cleared.
208:193:member1:SP: 3-May-2021 19:31:04.440192:emm.c:2363:WARNING:28.3.51:Warnin
g health conditions currently exist.
Correct these conditions before they affect array operation.
Control module was removed from array.
Active control module cannot communicate with secondary control module. Failove
r cannot occur.
There are 2 outstanding health conditions. Correct these conditions before they
affect array operation.
207:192:member1:SP: 3-May-2021 19:31:04.440191:emm.c:2854:INFO:28.2.171:Smart Ba
ttery now has sufficient charge to reliably power the control module through a p
ower failure event.
202:190:member1:SP: 3-May-2021 19:31:03.870189:b2b.c:2786:INFO:28.2.119:Failover
completed for control module in slot 0 with serial number CN-0D85Y9-77921-4BK-0
01H.
198:187:member1:SP: 3-May-2021 19:31:03.690186:emm.c:1333:INFO:28.2.6:Enclosure
serial number: CN-04FMJV-70821-47P-0FIM-A00.
197:186:member1:SP: 3-May-2021 19:31:03.690185:emm.c:1435:ERROR:28.4.50:Control
module in slot 1 is not functioning or not installed.
141:130:member1:SP: 3-May-2021 19:31:03.010129:emm.c:2363:WARNING:28.3.51:Warnin
g health conditions currently exist.
Correct these conditions before they affect array operation.
Control module was removed from array.
Active control module cannot communicate with secondary control module. Failove
r cannot occur.
Smart Battery is undercharged.
There are 3 outstanding health conditions. Correct these conditions before they
affect array operation.
140:129:member1:SP: 3-May-2021 19:31:03.010128:b2b.c:2537:WARNING:33.3.0:Communi
cation link between control modules has shut down.
139:128:member1:SP: 3-May-2021 19:31:03.010127:cache_driver.cc:1056:WARNING:28.3
.17:Active control module cache is now in write-through mode. Array performance
is degraded.
138:127:member1:SP: 3-May-2021 19:31:03.010126:emm.c:2363:WARNING:28.3.51:Warnin
g health conditions currently exist.
Correct these conditions before they affect array operation.
Active control module cannot communicate with secondary control module. Failove
r cannot occur.
Smart Battery is undercharged.
There are 2 outstanding health conditions. Correct these conditions before they
affect array operation.
591:216:member1:SP: 3-May-2021 19:31:02.940216:hal_corr_handler.c:334:INFO:28.2.
166:EVENT_NEAR_UNEXPECTED_REBOOT:A correctable error has been detected on contro
ller in slot 1 .
501:23:member1:psgd: 3-May-2021 19:31:00.910023:psgd_group.cc:18053:INFO:18.2.0:
EVENT_NEAR_UNEXPECTED_REBOOT:Group member member1 now active in the group.
367:121:member1:SP [secondary]: 3-May-2021 19:30:47.960120:emm.c:2363:ERROR:28.4
.47:EVENT_NEAR_UNEXPECTED_REBOOT:Critical health conditions exist.
Correct immediately before they affect array operation.
The service life of the control module battery is depleted.
There are 1 outstanding health conditions. Correct these conditions before they
affect array operation.
131:121:member1:SP [secondary]: 3-May-2021 19:30:47.960120:emm.c:2363:ERROR:28.4
.47:Critical health conditions exist.
Correct immediately before they affect array operation.
The service life of the control module battery is depleted.
There are 1 outstanding health conditions. Correct these conditions before they
affect array operation.
366:120:member1:SP [secondary]: 3-May-2021 19:30:47.960119:emm.c:2363:WARNING:28
.3.51:EVENT_NEAR_UNEXPECTED_REBOOT:Warning health conditions currently exist.
Correct these conditions before they affect array operation.
Smart Battery is undercharged.
There are 1 outstanding health conditions. Correct these conditions before they
affect array operation.
130:120:member1:SP [secondary]: 3-May-2021 19:30:47.960119:emm.c:2363:WARNING:28
.3.51:Warning health conditions currently exist.
Correct these conditions before they affect array operation.
Smart Battery is undercharged.
There are 1 outstanding health conditions. Correct these conditions before they
affect array operation.
355:114:member1:SP [secondary]: 3-May-2021 19:30:24.100114:cord.c:710:INFO:28.2.
108:EVENT_NEAR_UNEXPECTED_REBOOT:Control module in slot 0 with serial number CN-
0D85Y9-77921-4BK-001H is designated as secondary.
124:114:member1:SP [secondary]: 3-May-2021 19:30:24.100114:cord.c:710:INFO:28.2.
108:Control module in slot 0 with serial number CN-0D85Y9-77921-4BK-001H is desi
gnated as secondary.
350:112:member1:SP [secondary]: 3-May-2021 19:30:21.640112:emm.c:2363:WARNING:28
.3.51:EVENT_NEAR_UNEXPECTED_REBOOT:Warning health conditions currently exist.
Correct these conditions before they affect array operation.
119:112:member1:SP [secondary]: 3-May-2021 19:30:21.640112:emm.c:2363:WARNING:28
.3.51:Warning health conditions currently exist.
Correct these conditions before they affect array operation.
Smart Battery is undercharged.
There are 1 outstanding health conditions. Correct these conditions before they
affect array operation.
348:111:member1:SP [secondary]: 3-May-2021 19:30:21.640111:emm.c:2363:ERROR:28.4
.47:EVENT_NEAR_UNEXPECTED_REBOOT:Critical health conditions exist.
Correct immediately before they affect array operation.
118:2:member1:emd [secondary]: 3-May-2021 19:30:21.640111:emdSBS.cc:495:WARNING:
28.3.171:Smart Battery on control module 0 has insufficient charge to reliably p
ower the control module through a power failure event. The battery should return
to normal operation within an hour.
117:111:member1:SP [secondary]: 3-May-2021 19:30:21.640111:emm.c:2363:ERROR:28.4
.47:Critical health conditions exist.
Correct immediately before they affect array operation.
The service life of the control module battery is depleted.
There are 1 outstanding health conditions. Correct these conditions before they
affect array operation.
347:1:member1:emd [secondary]: 3-May-2021 19:30:21.630001:emdSBS.cc:378:ERROR:28
.4.165:EVENT_NEAR_UNEXPECTED_REBOOT:The service life of the cache battery on the
control module in slot 0 is depleted.
116:1:member1:emd [secondary]: 3-May-2021 19:30:21.630001:emdSBS.cc:378:ERROR:28
.4.165:The service life of the cache battery on the control module in slot 0 is
depleted.
203:190:member1:SP: 3-May-2021 19:30:15.640190:emm.c:1333:INFO:28.2.6:EVENT_NEAR
_UNEXPECTED_REBOOT:Enclosure serial number: CN-04FMJV-70821-47P-0FIM-A00.
341:0:member1:QRQ [secondary]: 3-May-2021 19:30:14.010109:qrq.c:910:INFO:9.2.0:E
VENT_NEAR_UNEXPECTED_REBOOT:PS Series Array Firmware Version: Storage Array Firm
ware V8.1.0 (R417284)
110:0:member1:QRQ [secondary]: 3-May-2021 19:30:14.010109:qrq.c:910:INFO:9.2.0:P
S Series Array Firmware Version: Storage Array Firmware V8.1.0 (R417284)
188:184:member1:SP: 3-May-2021 19:30:11.710184:emm.c:2363:ERROR:28.4.47:EVENT_NE
AR_UNEXPECTED_REBOOT:Critical health conditions exist.
Correct immediately before they affect array operation.
The service life of the control module battery is depleted.
There are 1 outstanding health conditions. Correct these conditions before they
affect array operation.
187:1:member1:emd: 3-May-2021 19:30:11.710001:emdSBS.cc:378:ERROR:28.4.165:EVENT
_NEAR_UNEXPECTED_REBOOT:The service life of the cache battery on the control mod
ule in slot 1 is depleted.
186:183:member1:SP: 3-May-2021 19:30:11.700183:cache_driver.cc:1058:INFO:28.2.39
:EVENT_NEAR_UNEXPECTED_REBOOT:Active control module cache set to write-back mode
.
139:0:member1:QRQ: 3-May-2021 19:30:06.610000:qrq.c:910:INFO:9.2.0:EVENT_NEAR_UN
EXPECTED_REBOOT:PS Series Array Firmware Version: Storage Array Firmware V8.1.0
(R417284)
1891921:0:member1:logevent: 3-May-2021 18:58:52.230000:logevent.cc:238:WARNING:2
5.3.0:User has initiated a clean halt restart.
I saw that just after the reboot and my connection to group administrator tool, it was a warning about "Smart Battery is underchaged", and after some minutes it disappears. I now founded in log this entry:
Smart Battery now has sufficient charge to reliably power the control module through a power failure event.
This smartbattery is the same that I replaced or it is something else?
Thanks again!
dwilliam62
1 Rookie
1 Rookie
•
1.5K Posts
1
May 4th, 2021 13:00
Hello,
The smart battery is the control module cache battery. They are the same.
Regards,
Don
sgt21ct
7 Posts
0
May 5th, 2021 02:00
Nice, thanks.
So, with the logs we can't conclude anything about the health of the new battery, right?
The 24 hours period passed and nothing changed, we will search for another battery.
Thanks very much!