ECS: OBS: xDoktor: RAP073/208: Switchanslutningsfel har upptäckts
Summary: I den här kunskapsbasen beskrivs hur du hanterar identifiering av aviseringsswitchanslutningsfel.
Symptoms
Från och med ECS xDoctor v4.8-109.0 och ObjectScale xDoctor v5.1-109.0 implementeras RAP208 (Switch Connection Failure Detected) som en auto-healer. När problem med switchanslutning överskrider det konfigurerade tröskelvärdet för fel eller kritisk allvarlighetsgrad, genererar xDoctor en RAP208-varning och initierar automatiskt sitt integrerade arbetsflöde för reparationsorkestrering. Det här arbetsflödet utför de nödvändiga reparationsåtgärderna, om xDoctor autohealers är aktiverade.
Obs! Om din miljö kör en xDoctor-version tidigare än ECS xDoctor v4.8-109.0 eller ObjectScale xDoctor v5.1-109.0 är RAP208-funktionen för automatisk läkning inte tillgänglig. I dessa versioner måste reparationen utföras med hjälp av AutoPilot-processen som beskrivs nedan eller genom att följa de manuella reparationsstegen som beskrivs i avsnittet Lösning.
Varning som utlöser automatisk återställning av RAP208
Arbetsflödet för automatisk återställning i RAP208 utlöses när switchanslutningsfel överskrider det konfigurerade tröskelvärdet för fel eller kritisk allvarlighetsgrad. När detta tröskelvärde överskrids genererar xDoctor en RAP208-varning, som fungerar som utlösare för den automatiserade reparationsprocessen.
Exempel på aviseringsutdata
Obs! I xDoctor-versioner före ECS xDoctor v4.8-109.0 och ObjectScale xDoctor v5.1-109.0 resulterar det här tillståndet endast i en varning. Automatisk reparation utförs inte.
--------------------------------------------------------
INFO - Auto Healer for dell_switch_connectivity disabled
--------------------------------------------------------
Extra = Auto Healer for dell_switch_connectivity disabled
Timestamp = 2026-04-01_180132
PSNT = CKMXXXXXXXXXXX @ 4.8-109.0
----------------------------------------------------
ERROR - (Cached) Switch Connection Failure detected.
----------------------------------------------------
Node = 169.254.1.1
Extra = {"169.254.1.1": ["hare"]}
RAP = RAP208
Solution = KB 39838
Timestamp = 2026-04-01_180132
PSNT = CKMXXXXXXXXXXX @ 4.8-109.0
Auto Healer-reparation (exempel)
När auto-healers är aktiverade initierar xDoctor automatiskt åtgärder för upptäckta switchanslutningsproblem genom att tillämpa vanliga korrigerande åtgärder som beskrivs i denna kunskapsbas.
--------------------------------------------------------
FIXED - Auto Healer fixed Dell switch connectivity issue
--------------------------------------------------------
Node = Nodes
Extra = {"Nodes": ["169.254.1.1"]}
Timestamp = 2026-04-01_180344
PSNT = CKMXXXXXXXXXXX @ 4.8-109.0
Krav på Auto Healer
xDoctor auto-healer-funktionen måste vara aktiverad för att denna åtgärd ska kunna ske. Auto-healers kan aktiveras antingen under installationen eller efter installationen genom att följa stegen som beskrivs i:
KB: ECS: xDoctor: Hur man aktiverar xDoctor Auto Healer efter verktygsinstallation
Cause
Efter ett switchbyte kan de SSH-värdnycklar som används för att autentisera till switchen ändras, eller så kan hanteringsgränssnittet som ansluter till switchen stängas av administrativt. Ibland matchar inte lösenordet som konfigurerats i xDoctor det aktuella lösenordet på den berörda switchen och måste uppdateras i enlighet med detta.
Arbetsflödena för xDoctor-automatisering och auto-healer utför inte reparation av switchlösenord. Istället upptäcker xDoctor autentiseringsrelaterade fel och höjer lämplig varning och dirigerar användaren till den relevanta kunskapsbasartikeln som beskriver hur man konfigurerar xDoctor för att använda lösenordet som ställts in på switcharna.
Resolution
xDoctor Auto Healer: ObjectScale xDoctor v5.1-109.0/ECS xDoctor v4.8-109.0 eller senare
- Om du vill utlösa den aktiverade automatiska rättsaren manuellt kör du följande kommando på
master.rackNod. Detta initierar rackanalysatorerna, som validerar och automatiskt reparerar noderna en i taget.
# sudo xdoctor --rap=RAP208
Exempel:
admin@ecsnode1:~> sudo xdoctor --rap=RAP208 2026-04-01 18:03:45,441: xDoctor_4.8-109.0 - INFO : Initializing xDoctor v4.8-109.0 ... [... Truncated Output ...] 2026-04-01 18:05:01,725: xDoctor_4.8-109.0 - INFO : ANALYZER [ac_dell_switch_connectivity] 2026-04-01 18:05:02,063: xDoctor_4.8-109.0 - INFO : Autohealing switch_connectivity on node 169.254.1.1 ... 2026-04-01 18:08:57,494: xDoctor_4.8-109.0 - INFO : All data analyzed in 0:03:55 2026-04-01 18:08:58,529: xDoctor_4.8-109.0 - INFO : -------------------- 2026-04-01 18:08:58,529: xDoctor_4.8-109.0 - INFO : Diagnosis Summary 2026-04-01 18:08:58,529: xDoctor_4.8-109.0 - INFO : -------------------- 2026-04-01 18:08:58,529: xDoctor_4.8-109.0 - INFO : PSNT: CKMXXXXXXXXXXX 2026-04-01 18:08:58,529: xDoctor_4.8-109.0 - INFO : -------------------- 2026-04-01 18:08:58,529: xDoctor_4.8-109.0 - INFO : FIXED = 1 2026-04-01 18:08:58,530: xDoctor_4.8-109.0 - INFO : CRITICAL = 0 2026-04-01 18:08:58,530: xDoctor_4.8-109.0 - INFO : CRITICAL (CACHED) = 0 2026-04-01 18:08:58,530: xDoctor_4.8-109.0 - INFO : ERROR = 0 2026-04-01 18:08:58,530: xDoctor_4.8-109.0 - INFO : ERROR (CACHED) = 0 2026-04-01 18:08:58,530: xDoctor_4.8-109.0 - INFO : WARNING = 0 2026-04-01 18:08:58,530: xDoctor_4.8-109.0 - INFO : INFO = 0 2026-04-01 18:08:58,530: xDoctor_4.8-109.0 - INFO : VERBOSE = 0 2026-04-01 18:08:58,531: xDoctor_4.8-109.0 - INFO : REPORT = 0 2026-04-01 18:08:58,646: xDoctor_4.8-109.0 - INFO : --------------------- 2026-04-01 18:08:58,646: xDoctor_4.8-109.0 - INFO : xDoctor Post Features 2026-04-01 18:08:58,646: xDoctor_4.8-109.0 - INFO : ---------------- 2026-04-01 18:08:58,646: xDoctor_4.8-109.0 - INFO : Data Combiner 2026-04-01 18:08:58,646: xDoctor_4.8-109.0 - INFO : ------------- 2026-04-01 18:08:58,647: xDoctor_4.8-109.0 - INFO : Created a Data Collection Report (data.xml) 2026-04-01 18:08:58,648: xDoctor_4.8-109.0 - INFO : ------ 2026-04-01 18:08:58,648: xDoctor_4.8-109.0 - INFO : SysLog 2026-04-01 18:08:58,648: xDoctor_4.8-109.0 - INFO : ------ 2026-04-01 18:08:58,648: xDoctor_4.8-109.0 - INFO : Using Fabric as Syslog Server 2026-04-01 18:08:58,648: xDoctor_4.8-109.0 - INFO : Not triggered ... no WARNING, ERROR, nor CRITICAL 2026-04-01 18:08:58,648: xDoctor_4.8-109.0 - INFO : ---- 2026-04-01 18:08:58,648: xDoctor_4.8-109.0 - INFO : SNMP 2026-04-01 18:08:58,648: xDoctor_4.8-109.0 - INFO : ---- 2026-04-01 18:08:58,649: xDoctor_4.8-109.0 - INFO : Using 10.118.165.48:162 as SNMP server 2026-04-01 18:08:58,649: xDoctor_4.8-109.0 - INFO : Not triggered .. no WARNING, ERROR nor CRITICAL 2026-04-01 18:08:58,649: xDoctor_4.8-109.0 - INFO : ------------ 2026-04-01 18:08:58,649: xDoctor_4.8-109.0 - INFO : ProcComplete 2026-04-01 18:08:58,649: xDoctor_4.8-109.0 - INFO : ------------ 2026-04-01 18:08:58,649: xDoctor_4.8-109.0 - WARNING : ProcComplete is disabled, please re-enable it (xdoctor --config) 2026-04-01 18:08:58,767: xDoctor_4.8-109.0 - INFO : ---------------- 2026-04-01 18:08:58,767: xDoctor_4.8-109.0 - INFO : Session Archiver 2026-04-01 18:08:58,768: xDoctor_4.8-109.0 - INFO : ---------------- 2026-04-01 18:08:58,777: xDoctor_4.8-109.0 - INFO : Session Stored in folder - /usr/local/xdoctor/archive/other/2026-04-01_180344 2026-04-01 18:08:58,777: xDoctor_4.8-109.0 - INFO : Session Archived as tar - /usr/local/xdoctor/archive/other/xDoctor-CKMXXXXXXXXXXX-2026-04-01_180344.tgz 2026-04-01 18:08:58,777: xDoctor_4.8-109.0 - INFO : -------------------------- 2026-04-01 18:08:58,777: xDoctor_4.8-109.0 - INFO : Session Report - sudo xdoctor --report --archive=2026-04-01_180344 2026-04-01 18:08:58,777: xDoctor_4.8-109.0 - INFO : --------------- 2026-04-01 18:08:58,777: xDoctor_4.8-109.0 - INFO : Session Cleaner 2026-04-01 18:08:58,777: xDoctor_4.8-109.0 - INFO : --------------- 2026-04-01 18:08:58,789: xDoctor_4.8-109.0 - INFO : Removing folder (count limit) - /usr/local/xdoctor/archive/other/2026-04-01_170120 2026-04-01 18:08:58,790: xDoctor_4.8-109.0 - INFO : Removing archive (count limit) - /usr/local/xdoctor/archive/other/xDoctor-CKMXXXXXXXXXXX-2026-04-01_170120.tgz 2026-04-01 18:08:58,793: xDoctor_4.8-109.0 - INFO : Cleaned 2 archived session(s) 2026-04-01 18:08:58,793: xDoctor_4.8-109.0 - INFO : ------- 2026-04-01 18:08:58,794: xDoctor_4.8-109.0 - INFO : Emailer 2026-04-01 18:08:58,794: xDoctor_4.8-109.0 - INFO : ------- 2026-04-01 18:08:58,794: xDoctor_4.8-109.0 - INFO : Using Dedicated Server (25:25) as SMTP Server ... 2026-04-01 18:08:58,794: xDoctor_4.8-109.0 - INFO : Email Type = Individual Events 2026-04-01 18:08:58,795: xDoctor_4.8-109.0 - INFO : ------------------------------ 2026-04-01 18:08:58,795: xDoctor_4.8-109.0 - INFO : xDoctor session_1775066624.943 finished in 0:05:13 2026-04-01 18:08:58,813: xDoctor_4.8-109.0 - INFO : Successful Job:1775066624 Exit Code:192
- Kör sessionsrapporten för att granska resultatet av den manuella körningen av automatisk läkning.
# sudo xdoctor --report --archive=<session report>
Exempel:
admin@ecsnode1:~> sudo xdoctor --report --archive=2026-04-01_180344
xDoctor 4.8-109.0
CKMXXXXXXXXXXX - ECS 3.8.1.4
Displaying xDoctor Report (2026-04-01_180344) Filter:[] ...
--------------------------------------------------------
FIXED - Auto Healer fixed Dell switch connectivity issue
--------------------------------------------------------
Node = Nodes
Extra = {"Nodes": ["169.254.1.1"]}
Timestamp = 2026-04-01_180344
PSNT = CKMXXXXXXXXXXX @ 4.8-109.0
- Om det uppstår ett fel öppnar du en SRn för undersökning.
Exempel på fel:---------------------------------------------------- ERROR - (Cached) Auto fix failed - Switch Connection Failure detected. ---------------------------------------------------- Node = 169.254.1.1 Extra = {"169.254.1.1": ["hare"]} RAP = RAP208 Solution = KB 39838 Timestamp = 2026-04-01_180132 PSNT = CKMXXXXXXXXXXX @ 4.8-109.0
xDoctor autopilot:
Denna kunskapsbas (KB) är nu automatiserad med xDoctor Auto Pilot som tar itu med de flesta problem utan att behöva supportinvolvera.
Den här funktionen är inbyggd i xDoctor 4-8.104.0 och senare, för syntax- och användningsproblem hänvisar du till ECS: ObjectScale: Köra KB Automation-skript (autopilot).
Så här hittar du huvudnoden för racket:
Kommando:
ssh master.rack
Om du vill hitta NAN-IP-adressen kan du använda IP-adressen som identifieras i aviseringen eller från getrackinfo:
Kommando:
admin@ecsnode1:~> getrackinfo
Node private Node Public BMC
Ip Address Id Status Mac Ip Address Mac Ip Address Private.4(NAN) Node Name
=============== ====== ====== ================= =============== ================= =============== =============== =========
192.168.219.1 1 MA 00:00:00:00:00 0.0.0.0 00:00:00:00:00 192.168.219.101 169.254.1.1 provo-red
192.168.219.2 2 SA 00:00:00:00:00 0.0.0.0 00:00:00:00:00 192.168.219.102 169.254.1.2 sandy-red
192.168.219.3 3 SA 00:00:00:00:00 0.0.0.0 00:00:00:00:00 192.168.219.103 169.254.1.3 orem-red
192.168.219.4 4 SA 00:00:00:00:00 0.0.0.0 00:00:00:00:00 192.168.219.104 169.254.1.4 ogden-red
192.168.219.5 5 SA 00:00:00:00:00 0.0.0.0 00:00:00:00:00 192.168.219.105 169.254.1.5 layton-red
192.168.219.6 6 SA 00:00:00:00:00 0.0.0.0 00:00:00:00:00 192.168.219.106 169.254.1.6 logan-red
192.168.219.7 7 SA 00:00:00:00:00 0.0.0.0 00:00:00:00:00 192.168.219.107 169.254.1.7 lehi-red
192.168.219.8 8 SA 00:00:00:00:00 0.0.0.0 00:00:00:00:00 192.168.219.108 169.254.1.8 murray-red
- Kör automatiseringskommandot från huvudnoden med xDoctor 4-8.104.0 och senare.
Obs!
--target-rack stöds för den här åtgärden.
# sudo xdoctor autopilot --kb 39838 --target-rack <rack_colour>
admin@ecsnode1:~> sudo xdoctor autopilot --kb 39838 --target-rack red
Checking for existing screen sessions...
Starting screen session 'autopilot_kb_39838_20250626_112318'...
Screen session 'autopilot_kb_39838_20250626_112318' started successfully.
Attaching to screen session 'autopilot_kb_39838_20250626_112318'...
Using /etc/ansible/ansible.cfg as config file
VERSION: 3.0
Playbook tasks: 47
Role tasks: 97
Total tasks: 144 across 1 host(s)
PLAY [red] ******************************************************************************************************************************************************************
Detected 8 hosts for this play.
TASK [target_check : set_fact] **********************************************************************************************************************************************
ok: [169.254.1.1 -> localhost] => {"ansible_facts": {"allowed_targets": "Please use: --target-rack", "target_node_check": false, "target_rack_check": true, "target_vdc_check": false}, "changed": false}
TASK [target_check : context] ***********************************************************************************************************************************************
skipping: [169.254.1.1] => {"changed": false, "false_condition": "node_script == false and target_node_check == true or rack_script == false and target_rack_check == true or vdc_script == false and target_vdc_check == true", "skip_reason": "Conditional result was False"}
...truncated
- Sammanfattning av recensionen:
Exempel:
TASK [Print all summaries] **************************************************************************************************************************************************
ok: [169.254.1.1] => {
"msg": [
"*******************************************************************************",
"Switch xDoctor 'RAP073' password and SSH summary:",
"*******************************************************************************",
"Validated Frontend switch(es): FAIL: The passwords for the Dell managed switch(es) are incorrect and need to be configured in the xDoctor settings according to KB 39838.",
"Validated Backend switch(es): FAIL: The passwords for the Dell managed switch(es) are incorrect and need to be configured in the xDoctor settings according to KB 39838.",
"Validated Backend management connections: PASS: Management connections are up and connected to the frontend switches.",
"*******************************************************************************",
"Validated ssh keys to switch(es): PASS: All ssh keys are valid and nothing was corrected.",
"Validated xDoctor alert: PASS: Alert RAP073 was not present in xDoctor.",
"*******************************************************************************"
]
}
TASK [Set fact for context] *************************************************************************************************************************************************
ok: [169.254.1.1 -> localhost] => {"ansible_facts": {"context": " Validated Frontend switch(es): FAIL: The passwords for the Dell managed switch(es) are incorrect and need to be configured in the xDoctor settings according to KB 39838., Validated Backend switch(es): FAIL: The passwords for the Dell managed switch(es) are incorrect and need to be configured in the xDoctor settings according to KB 39838."}, "changed": false}
TASK [Fail if validation fails] *********************************************************************************************************************************************
fatal: [169.254.1.1]: FAILED! => {"changed": false, "msg": "Review the summary above for recommendations."}
NO MORE HOSTS LEFT **********************************************************************************************************************************************************
PLAY RECAP ******************************************************************************************************************************************************************
169.254.1.1 : ok=65 changed=13 unreachable=0 failed=1 skipped=73 rescued=0 ignored=1
169.254.1.2 : ok=4 changed=0 unreachable=0 failed=0 skipped=1 rescued=0 ignored=0
169.254.1.3 : ok=4 changed=0 unreachable=0 failed=0 skipped=1 rescued=0 ignored=0
169.254.1.4 : ok=4 changed=0 unreachable=0 failed=0 skipped=1 rescued=0 ignored=0
169.254.1.5 : ok=4 changed=0 unreachable=0 failed=0 skipped=1 rescued=0 ignored=0
169.254.1.6 : ok=4 changed=0 unreachable=0 failed=0 skipped=1 rescued=0 ignored=0
169.254.1.7 : ok=4 changed=0 unreachable=0 failed=0 skipped=1 rescued=0 ignored=0
169.254.1.8 : ok=4 changed=0 unreachable=0 failed=0 skipped=1 rescued=0 ignored=0
=============================================================================================================================================================================
Status: FAIL
Time Elapsed: 0h 1m 25s
Debug log: /tmp/autopilot/log/autopilot_39838_20250626_113201.log
Message: Validated Frontend switch(es): FAIL: The passwords for the Dell managed switch(es) are incorrect and need to be configured in the xDoctor settings according to KB 39838., Validated Backend switch(es): FAIL: The passwords for the Dell managed switch(es) are incorrect and need to be configured in the xDoctor settings according to KB 39838.
=============================================================================================================================================================================
- Uppdatera xDoctor-lösenordet:
admin@ecsnode7:~> sudo xdoctor -c --expert
xDoctor Configuration Menu
--------------------------
[Expert Mode Active]
(1) Overview
(2) Scheduling
(3) Archiving
(5) Repository
(9) Miscellaneous
(0) Exit
Please make a choice: 9
xDoctor Miscellaneous
---------------------
(3) Switches
(4) Remove Hardware Alerting Timestamp
(0) Main menu
Please make a choice: 3
xDoctor Switch Settings
---------------------
Enable Switch Analysis? [Yes]:
Switches [hare,rabbit,fox,hound]:
Username [admin]:
Password [*****]:
[New Switch Settings]
Enabled = Yes
Switches = hare,rabbit,fox,hound
Username = admin
Password = *****
> Issue new settings? [No]: yes
2024-11-20 16:03:53,702: xDoctor_4.8-100.0 - INFO : Settings saved and distributed ...
xDoctor Miscellaneous
---------------------
(3) Switches
(4) Remove Hardware Alerting Timestamp
(0) Main menu
Bas-KB-automatisering:
ECS: xDoctor: RAP073: Switchanslutningsfel har upptäckts
Ytterligare KB-konsolidering i den här automatiseringen:
ECS: xDoctor rapporterar switchanslutningsfel på grund av RSA-nyckel i known_hosts