Hardware do Avamar Gen4S: Tempo limite do ciclo de aprendizagem
Summary: O ciclo de aprendizado falha no hardware Avamar Gen4S.
Symptoms
O seguinte erro é exibido na interface do usuário do Avamar ou nos logs:
<MRMON154> Controller ID: 0 Battery relearn timed out Cause
Isso está sendo analisado pelo fornecedor.
Resolution
1. Faça log-in no Avamar Server como administrador, eleve para raiz e carregue o SSH as teclas.
Para obter instruções sobre como carregar chaves, consulte Avamar: Como fazer log-in em um Avamar Server e carregar várias chaves.
2. Usando as informações do evento da interface do usuário ou do Dial Home Service Request:
um. Determine o nó que gerou a mensagem de erro.
b. Conecte-se a esse nó como root:
ssn 0.# --user=root
(Em que 0.# é o número do nó físico)
3. Extract /var/log/messages usando o comando aplicável abaixo:
bunzip2 /var/log/messages*
gunzip /var/log/messages*
xz --decompress /var/log/messages*
4. Analise o log de mensagens (/var/log/messages) para eventos de reaprendizado da bateria :
grep -i "battery relearn" /var/log/messages
Jul 29 13:37:12 AVATPCKVS41N05 MR_MONITOR[8625]: <MRMON157> Controller ID: 0 Battery relearn will start in 4 days Jul 31 13:37:48 AVATPCKVS41N05 MR_MONITOR[8625]: <MRMON158> Controller ID: 0 Battery relearn will start in 2 days Aug 1 13:37:33 AVATPCKVS41N05 MR_MONITOR[8625]: <MRMON159> Controller ID: 0 Battery relearn will start in 1 day Aug 2 08:37:13 AVATPCKVS41N05 MR_MONITOR[8625]: <MRMON160> Controller ID: 0 Battery relearn will start in 5 hours Aug 2 13:38:24 AVATPCKVS41N05 MR_MONITOR[8625]: <MRMON155> Controller ID: 0 Battery relearn pending: Battery is under charge Aug 2 13:39:28 AVATPCKVS41N05 MR_MONITOR[8625]: <MRMON151> Controller ID: 0 Battery relearn started Aug 2 13:40:36 AVATPCKVS41N05 MR_MONITOR[8625]: <MRMON152> Controller ID: 0 Battery relearn in progress Aug 2 13:40:36 AVATPCKVS41N05 MR_MONITOR[8625]: <MRMON153> Controller ID: 0 Battery relearn completed Aug 13 16:32:15 AVATPCKVS41N05 MR_MONITOR[8625]: <MRMON155> Controller ID: 0 Battery relearn pending: Battery is under charge Aug 13 16:44:10 AVATPCKVS41N05 MR_MONITOR[8625]: <MRMON151> Controller ID: 0 Battery relearn started Aug 13 16:45:15 AVATPCKVS41N05 MR_MONITOR[8625]: <MRMON152> Controller ID: 0 Battery relearn in progress Aug 13 16:48:30 AVATPCKVS41N05 MR_MONITOR[8625]: <MRMON154> Controller ID: 0 Battery relearn timed out
5. Confirme usando CmdTool2 que o aprendizado falhou, mas que a bateria não exibe 0 Volts:
CmdTool2 -AdpBbuCmd -GetBbuStatus -a0
BBU status for Adapter: 0
BatteryType: CVPM02
Voltage: 9563 mV
Current: 0 mA
Temperature: 30 C
BBU Firmware Status:
Charging Status : None
Voltage : OK
Temperature : OK
Learn Cycle Requested : No
Learn Cycle Active : No
Learn Cycle Status : Failed
Learn Cycle Timeout : Yes
I2c Errors Detected : No
Battery Pack Missing : No
Battery Replacementrequired : No
Remaining CapacityLow : No
Periodic Learn Required : No
Transparent Learn : No
No space to cache offload : No
Pack is about to fail & should be replaced : No
Cache Offload premium feature required : No
Module microcode update required : No
GasGuageStatus:
Fully Discharged : No
FullyCharged : Yes
Discharging : Yes
Initialized : No
Remaining Time Alarm : No
Remaining Capacity Alarm: No
Discharge Terminated : No
OverTemperature : No
Charging Terminated : Yes
Over Charged : No
Pack energy : 96 J
Capacitance : 100
Remaining reserve space : 93
Exit Code: 0x00
6. Inicie um ciclo de aprendizado manual:
sudo CmdTool2 -AdpBbuCmd -BbuLearn -a0
7. Analise o registro de mensagens para ver se o ciclo de aprendizado começou e terminou.
Exemplo 1:
Aug 26 12:15:01 AVAMAR-GRID-VAR-LOG-MESSAGE syslog-ng[3170]: Configuration reload request received, reloading configuration;
Aug 26 12:15:01 AVAMAR-GRID-VAR-LOG-MESSAGE syslog-ng[3170]: New configuration initialized;
Aug 26 12:15:11 AVAMAR-GRID-VAR-LOG-MESSAGE sudo: admin : TTY=pts/0 ; PWD=/data01/home/admin ; USER=root ; COMMAND=/opt/MegaRAID/CmdTool2/CmdTool2 -pdlist -a0 -nolog
Aug 26 12:16:10 AVAMAR-GRID-VAR-LOG-MESSAGE sudo: admin : TTY=pts/0 ; PWD=/data01/home/admin ; USER=root ; COMMAND=/opt/MegaRAID/CmdTool2/CmdTool2 -AdpBbuCmd -GetBbuStatus -a0
Aug 26 12:18:28 AVAMAR-GRID-VAR-LOG-MESSAGE sudo: admin : TTY=pts/0 ; PWD=/data01/home/admin ; USER=root ; COMMAND=/opt/MegaRAID/CmdTool2/CmdTool2 -AdpBbuCmd -BbuLearn -a0
Aug 26 12:18:31 AVAMAR-GRID-VAR-LOG-MESSAGE MR_MONITOR[5742]: Controller ID: 0 Battery relearn pending: Battery is under charge
Aug 26 12:19:36 AVAMAR-GRID-VAR-LOG-MESSAGE MR_MONITOR[5742]: Controller ID: 0 Battery relearn started
Aug 26 12:20:02 AVAMAR-GRID-VAR-LOG-MESSAGE sudo: admin : TTY=pts/0 ; PWD=/data01/home/admin ; USER=root ; COMMAND=/opt/MegaRAID/CmdTool2/CmdTool2 -AdpBbuCmd -GetBbuStatus -a0
Aug 26 12:20:44 AVAMAR-GRID-VAR-LOG-MESSAGE MR_MONITOR[5742]: Controller ID: 0 Battery relearn in progress
Aug 26 12:20:44 AVAMAR-GRID-VAR-LOG-MESSAGE MR_MONITOR[5742]: Controller ID: 0 Battery relearn completed
Aug 26 12:22:25 AVAMAR-GRID-VAR-LOG-MESSAGE sudo: admin : TTY=pts/0 ; PWD=/data01/home/admin ; USER=root ; COMMAND=/opt/MegaRAID/CmdTool2/CmdTool2 -AdpBbuCmd -GetBbuStatus -a0
Embora o Reaprender pareça ter terminado, o tempo para concluir é mínimo (o que não é normal).
Exemplo 2:
Aug 26 01:30:23 AVATPCKVS41N05 sudo: root : TTY=pts/0 ; PWD=/root ; USER=root ; COMMAND=/opt/MegaRAID/CmdTool2/CmdTool2 -AdpBbuCmd -BbuLearn -a0
Aug 26 01:31:12 AVATPCKVS41N05 MR_MONITOR[8625]: <MRMON155> Controller ID: 0 Battery relearn pending: Battery is under charge
Aug 26 16:44:10 AVATPCKVS41N05 MR_MONITOR[8625]: <MRMON151> Controller ID: 0 Battery relearn started
Aug 26 16:45:15 AVATPCKVS41N05 MR_MONITOR[8625]: <MRMON152> Controller ID: 0 Battery relearn in progress
Aug 26 16:48:30 AVATPCKVS41N05 MR_MONITOR[8625]: <MRMON154> Controller ID: 0 Battery relearn timed out
Aqui, o "reaprendizado" expirou.
8. Verifique novamente o status da bateria:
CmdTool2 -AdpBbuCmd -GetBbuStatus -a0
Se o status for OK e não houver tempo de espera excedido (como visto abaixo), nenhuma outra ação será necessária.
BBU status for Adapter: 0
BatteryType: CVPM02
Voltage: 9563 mV
Current: 0 mA
Temperature: 30 C
BBU Firmware Status:
Charging Status : None
Voltage : OK
Temperature : OK
Learn Cycle Requested : No
Learn Cycle Active : No
Learn Cycle Status : OK
Learn Cycle Timeout : No
I2c Errors Detected : No
...
Se o problema continuar como visto abaixo, crie um chamado fornecendo o resultado acima para determinar se uma substituição de nó é necessária.
BBU status for Adapter: 0
BatteryType: CVPM02
Voltage: 9563 mV
Current: 0 mA
Temperature: 30 C
BBU Firmware Status:
Charging Status : None
Voltage : OK
Temperature : OK
Learn Cycle Requested : No
Learn Cycle Active : No
Learn Cycle Status : Failed
Learn Cycle Timeout : Yes
I2c Errors Detected : No
...