ECS: OBS: xDoctor: RAP073: Switch-Verbindungsfehler erkannt
Zusammenfassung: In dieser Wissensdatenbank wird erläutert, wie Sie mit der Warnmeldung "Switch-Verbindungsfehler erkannt" umgehen.
Symptome
Timestamp = 2019-03-18_084014
Category = Environment
Source = Switch
Severity = ERROR
Message = Switch Connection Failure detected
Extra = {'turtle.rack': {'Traceback (most recent call last):\n File "
RAP = /usr/local/xdoctor/pacemaker/switch.py"
Solution = PASSWORD'
Pacemaker switch.py berichtet:
# sudo /usr/local/xdoctor/pacemaker/switch.py <node data="True" name="169.254.1.1"> <switch name="turtle.rack"> <error msg="Traceback (most recent call last): File "/usr/local/xdoctor/pacemaker/switch.py", line 265, in collect swt = xSwitch.switch(ip, user, password=base64.b64decode(self.env['PASSWORD']), sshKey=None, sshOption=sshOption, verbose=self.debug, askPass=True, timeout=60, debug=self.debug) File "/usr/local/xdoctor/lib/xdoctor/switch.py", line 103, in __init__ self.open() File "/usr/local/xdoctor/lib/xdoctor/switch.py", line 151, in open raise Exception("invalid password max retries")Exception: invalid password max retries;"/> </switch>
Ursache
Lösung
Verbindungsfehler: Automatisierung NEU!
WICHTIG! Eine neue Funktion wurde in xDoctor 4-8.104.0 und höher veröffentlicht. Diese Wissensdatenbank (KB) ist jetzt mit der Servicekonsole automatisiert, um schnell zu versuchen, Switchverbindungen zu überprüfen. Das Skript erkennt Kennwörter, Kundenswitches, Konfigurationsprobleme in xDoctor und behebt bekannte Probleme mit dem SSH-Schlüssel aufgrund des Austauschs von Switches. Es bietet eine detaillierte Zusammenfassung der Ergebnisse und ob Probleme mit dem SSH-Schlüssel behoben wurden. Diese Lösung funktioniert sowohl auf Gen 2- als auch auf Gen 3-Switches zur Erkennung und SSH-Schlüsselauflösung.
Weitere Informationen zum Einrichten der Automatisierungen finden Sie unter: ECS: ObjectScale: Ausführen von KB-Automatisierungsskripten (Autopilot)
So finden Sie den Master-Knoten des Racks:
Befehl:
ssh master.rack
Um die NAN-IP zu finden, können Sie die IP verwenden, die in der Warnmeldung oder in getrackinfo angegeben ist:
Befehl:
admin@ecsnode1:~> getrackinfo
Node private Node Public BMC
Ip Address Id Status Mac Ip Address Mac Ip Address Private.4(NAN) Node Name
=============== ====== ====== ================= =============== ================= =============== =============== =========
192.168.219.1 1 MA 00:00:00:00:00 0.0.0.0 00:00:00:00:00 192.168.219.101 169.254.1.1 provo-red
192.168.219.2 2 SA 00:00:00:00:00 0.0.0.0 00:00:00:00:00 192.168.219.102 169.254.1.2 sandy-red
192.168.219.3 3 SA 00:00:00:00:00 0.0.0.0 00:00:00:00:00 192.168.219.103 169.254.1.3 orem-red
192.168.219.4 4 SA 00:00:00:00:00 0.0.0.0 00:00:00:00:00 192.168.219.104 169.254.1.4 ogden-red
192.168.219.5 5 SA 00:00:00:00:00 0.0.0.0 00:00:00:00:00 192.168.219.105 169.254.1.5 layton-red
192.168.219.6 6 SA 00:00:00:00:00 0.0.0.0 00:00:00:00:00 192.168.219.106 169.254.1.6 logan-red
192.168.219.7 7 SA 00:00:00:00:00 0.0.0.0 00:00:00:00:00 192.168.219.107 169.254.1.7 lehi-red
192.168.219.8 8 SA 00:00:00:00:00 0.0.0.0 00:00:00:00:00 192.168.219.108 169.254.1.8 murray-red
- Führen Sie den Automatisierungsbefehl über den Master-Node mit xDoctor 4-8.104.0 und höher aus.
Hinweis:
--target-rack wird für diese Aktion unterstützt.
# sudo xdoctor autopilot --kb 39838 --target-rack <rack_colour>
admin@ecsnode1:~> sudo xdoctor autopilot --kb 39838 --target-rack red
Checking for existing screen sessions...
Starting screen session 'autopilot_kb_39838_20250626_112318'...
Screen session 'autopilot_kb_39838_20250626_112318' started successfully.
Attaching to screen session 'autopilot_kb_39838_20250626_112318'...
Using /etc/ansible/ansible.cfg as config file
VERSION: 3.0
Playbook tasks: 47
Role tasks: 97
Total tasks: 144 across 1 host(s)
PLAY [red] ******************************************************************************************************************************************************************
Detected 8 hosts for this play.
TASK [target_check : set_fact] **********************************************************************************************************************************************
ok: [169.254.1.1 -> localhost] => {"ansible_facts": {"allowed_targets": "Please use: --target-rack", "target_node_check": false, "target_rack_check": true, "target_vdc_check": false}, "changed": false}
TASK [target_check : context] ***********************************************************************************************************************************************
skipping: [169.254.1.1] => {"changed": false, "false_condition": "node_script == false and target_node_check == true or rack_script == false and target_rack_check == true or vdc_script == false and target_vdc_check == true", "skip_reason": "Conditional result was False"}
...truncated
- Zusammenfassung der Überprüfung:
Beispiel:
TASK [Print all summaries] **************************************************************************************************************************************************
ok: [169.254.1.1] => {
"msg": [
"*******************************************************************************",
"Switch xDoctor 'RAP073' password and SSH summary:",
"*******************************************************************************",
"Validated Frontend switch(es): FAIL: The passwords for the Dell managed switch(es) are incorrect and need to be configured in the xDoctor settings according to KB 39838.",
"Validated Backend switch(es): FAIL: The passwords for the Dell managed switch(es) are incorrect and need to be configured in the xDoctor settings according to KB 39838.",
"Validated Backend management connections: PASS: Management connections are up and connected to the frontend switches.",
"*******************************************************************************",
"Validated ssh keys to switch(es): PASS: All ssh keys are valid and nothing was corrected.",
"Validated xDoctor alert: PASS: Alert RAP073 was not present in xDoctor.",
"*******************************************************************************"
]
}
TASK [Set fact for context] *************************************************************************************************************************************************
ok: [169.254.1.1 -> localhost] => {"ansible_facts": {"context": " Validated Frontend switch(es): FAIL: The passwords for the Dell managed switch(es) are incorrect and need to be configured in the xDoctor settings according to KB 39838., Validated Backend switch(es): FAIL: The passwords for the Dell managed switch(es) are incorrect and need to be configured in the xDoctor settings according to KB 39838."}, "changed": false}
TASK [Fail if validation fails] *********************************************************************************************************************************************
fatal: [169.254.1.1]: FAILED! => {"changed": false, "msg": "Review the summary above for recommendations."}
NO MORE HOSTS LEFT **********************************************************************************************************************************************************
PLAY RECAP ******************************************************************************************************************************************************************
169.254.1.1 : ok=65 changed=13 unreachable=0 failed=1 skipped=73 rescued=0 ignored=1
169.254.1.2 : ok=4 changed=0 unreachable=0 failed=0 skipped=1 rescued=0 ignored=0
169.254.1.3 : ok=4 changed=0 unreachable=0 failed=0 skipped=1 rescued=0 ignored=0
169.254.1.4 : ok=4 changed=0 unreachable=0 failed=0 skipped=1 rescued=0 ignored=0
169.254.1.5 : ok=4 changed=0 unreachable=0 failed=0 skipped=1 rescued=0 ignored=0
169.254.1.6 : ok=4 changed=0 unreachable=0 failed=0 skipped=1 rescued=0 ignored=0
169.254.1.7 : ok=4 changed=0 unreachable=0 failed=0 skipped=1 rescued=0 ignored=0
169.254.1.8 : ok=4 changed=0 unreachable=0 failed=0 skipped=1 rescued=0 ignored=0
=============================================================================================================================================================================
Status: FAIL
Time Elapsed: 0h 1m 25s
Debug log: /tmp/autopilot/log/autopilot_39838_20250626_113201.log
Message: Validated Frontend switch(es): FAIL: The passwords for the Dell managed switch(es) are incorrect and need to be configured in the xDoctor settings according to KB 39838., Validated Backend switch(es): FAIL: The passwords for the Dell managed switch(es) are incorrect and need to be configured in the xDoctor settings according to KB 39838.
=============================================================================================================================================================================
- Aktualisieren Sie das xDoctor-Kennwort:
admin@ecsnode7:~> sudo xdoctor -c --expert
xDoctor Configuration Menu
--------------------------
[Expert Mode Active]
(1) Overview
(2) Scheduling
(3) Archiving
(5) Repository
(9) Miscellaneous
(0) Exit
Please make a choice: 9
xDoctor Miscellaneous
---------------------
(3) Switches
(4) Remove Hardware Alerting Timestamp
(0) Main menu
Please make a choice: 3
xDoctor Switch Settings
---------------------
Enable Switch Analysis? [Yes]:
Switches [hare,rabbit,fox,hound]:
Username [admin]:
Password [*****]:
[New Switch Settings]
Enabled = Yes
Switches = hare,rabbit,fox,hound
Username = admin
Password = *****
> Issue new settings? [No]: yes
2024-11-20 16:03:53,702: xDoctor_4.8-100.0 - INFO : Settings saved and distributed ...
xDoctor Miscellaneous
---------------------
(3) Switches
(4) Remove Hardware Alerting Timestamp
(0) Main menu
Automatisierung der Basis-Wissensdatenbank:
ECS: xDoctor: RAP073: Switch-Verbindungsfehler erkannt
Zusätzliche KB-Konsolidierung in diese Automatisierung:
ECS: xDoctor meldet Switch-Verbindungsfehler aufgrund des RSA-Schlüssels im known_hosts