Start a Conversation

Unsolved

This post is more than 5 years old

447

April 17th, 2011 12:00

Autostart agent doesn't start when join a Domain

Hi,

I have an issue with Autostart. I already install Autostart 5.4 in two Solaris 10 machines.

In the first machine "SERVER1", when run the ./ft_setup, chose that machine like a primary agent and create a Domain "TEST", next start the backbone and agent and all be OK. Then, in the second one "SERVER2", in the setup, chose the Domain "Test", recently created and when ask for the primary agent node, and put "server_1", so the setup finish without problem.

The issue is when we tried to startup the second agent, never get up. The only error that show is the next one:

Agent startup failed.
Unexplained fatal error. No $FT_DIR/log/agent/SERVER2_fatal.out file found.
FULLTIME_SITES_TID 00000002
+ 1:8042,8042,8043 SERVER1    Test #FT_Agent_Port=8045
+ 2:8042,8042,8043 SERVER2 Test

And, in the ftAgent.log:

> 0x7d3a94 Sun Apr 17 13:34:33 2011 apm_localRunCommand
ERROR: 0x7d3a94 (04/17/11 13:34:33) Cannot get working directory, continuing  anyway
> 0x7d46c4 Sun Apr 17 13:34:33 2011 UPM_userConnectHandler
> 0x7d46c4 Sun Apr 17 13:34:33 2011 UPM_acceptSocketConnection
< 0x7d3a94 Sun Apr 17 13:34:33 2011
INFO: 0x7d3a94 (04/17/11 13:34:33) ID00004746 Successfully connected to 130.0.20.68
INFO: 0x7d3a94 (04/17/11 13:34:33) A primary agent is alive, so let the secondary connect
> 0x7d3a94 Sun Apr 17 13:34:33 2011 nm_CacheNodeIPAddrs
< 0x7d3a94 Sun Apr 17 13:34:33 2011 nm_CacheNodeIPAddrs ret: 0
> 0x7d3a94 Sun Apr 17 13:34:33 2011 em_AgentInit
< 0x7d3a94 Sun Apr 17 13:34:33 2011 em_AgentInit ret: 0
EVENT: 0x7d3a94 (04/17/11 13:34:33) ID00000806 CM - binding agent Sock Addr - 127.0.0.1 - port - 0
EVENT: 0x7d3a94 (04/17/11 13:34:33) ID00000807 CM - agent Sock Addr assigned to addr - 127.0.0.1 - port - 49612
> 0x7d3a94 Sun Apr 17 13:34:33 2011 nm_RamdInit
< 0x7d3a94 Sun Apr 17 13:34:33 2011 nm_RamdInit ret: 0
> 0x7d3a94 Sun Apr 17 13:34:33 2011 ipm_RamdInit
EVENT: 0x7d3a94 (04/17/11 13:34:33) ipm_ramd.c: 339: Initialized ipm_GlobalMutex
EVENT: 0x7d3a94 (04/17/11 13:34:33) ipm_ramd.c: 341: Initialized ipm_AssignMutex
< 0x7d3a94 Sun Apr 17 13:34:33 2011 ipm_RamdInit ret: 0
> 0x7d3a94 Sun Apr 17 13:34:33 2011 fdm_RamdInit
< 0x7d3a94 Sun Apr 17 13:34:33 2011 fdm_RamdInit ret: 0
> 0x7d3a94 Sun Apr 17 13:34:33 2011 atm_Init
< 0x7d3a94 Sun Apr 17 13:34:33 2011 atm_Init ret: 0
> 0x7d3a94 Sun Apr 17 13:34:33 2011 apm_RamdInit
< 0x7d3a94 Sun Apr 17 13:34:33 2011 apm_RamdInit ret: 0
> 0x7d3a94 Sun Apr 17 13:34:33 2011 svr_rum_Init
< 0x7d3a94 Sun Apr 17 13:34:33 2011 svr_rum_Init ret: 0
> 0x7d3a94 Sun Apr 17 13:34:33 2011 sec_Init
< 0x7d3a94 Sun Apr 17 13:34:33 2011 sec_Init ret: 0
> 0x7d3a94 Sun Apr 17 13:34:33 2011 sam_Init
< 0x7d3a94 Sun Apr 17 13:34:33 2011 sam_Init ret: 0
> 0x7d3a94 Sun Apr 17 13:34:33 2011 dm_RamdInit
> 0x7d3a94 Sun Apr 17 13:34:33 2011 dm_UxFsInit
< 0x7d3a94 Sun Apr 17 13:34:33 2011
< 0x7d3a94 Sun Apr 17 13:34:33 2011 dm_RamdInit ret: 0
> 0x7d3a94 Sun Apr 17 13:34:33 2011 cam_RamdInit
< 0x7d3a94 Sun Apr 17 13:34:33 2011 cam_RamdInit ret: 0
> 0x7d3a94 Sun Apr 17 13:34:33 2011 rg_RamdInit
< 0x7d3a94 Sun Apr 17 13:34:33 2011 rg_RamdInit ret: 0
INFO: 0x7d3a94 (04/17/11 13:34:33) Entering the isis_start_done call
INFO: 0x7d3a94 (04/17/11 13:34:33) Entering the initalization stage for Ramd call
> 0x7d3a94 Sun Apr 17 13:34:33 2011 im_RamdInit
EVENT: 0x7d3a94 (04/17/11 13:34:33) ID00001096 IM - AgentGroupName is /ram/NAFIN_A/agent
EVENT: 0x7d3a94 (04/17/11 13:34:33) ID00001097 IM - LocalGroupName is /ram/NAFIN_A/node/nafin10
> 0x7d3a94 Sun Apr 17 13:34:33 2011 waitForDomainReady
> 0x7d3a94 Sun Apr 17 13:34:33 2011 isDomainReady
< 0x7d3a94 Sun Apr 17 13:34:33 2011 isDomainReady ret: 1
< 0x7d3a94 Sun Apr 17 13:34:33 2011 waitForDomainReady ret: 8283100
> 0x7d3a94 Sun Apr 17 13:34:33 2011 sec_GenCredentials
> 0x7d3a94 Sun Apr 17 13:34:33 2011 sec_GetBypassCredentialsFlag
< 0x7d3a94 Sun Apr 17 13:34:33 2011 sec_GetBypassCredentialsFlag ret: 0
< 0x7d3a94 Sun Apr 17 13:34:33 2011 sec_GenCredentials ret: 8198048
> 0x7d3a94 Sun Apr 17 13:34:33 2011 sec_AddExistingMembers
> 0x7d3a94 Sun Apr 17 13:34:33 2011 sec_AddAddress
> 0x7d3a94 Sun Apr 17 13:34:33 2011 sec_getAddrEntry
< 0x7d3a94 Sun Apr 17 13:34:33 2011 sec_getAddrEntry ret: 0
< 0x7d3a94 Sun Apr 17 13:34:33 2011 sec_AddAddress ret: 0
< 0x7d3a94 Sun Apr 17 13:34:33 2011 sec_AddExistingMembers ret: 0
< 0x7d46c4 Sun Apr 17 13:34:40 2011 UPM_acceptSocketConnection ret: 0
> 0x7d46c4 Sun Apr 17 13:34:40 2011 UPM_checkAuthorization
< 0x7d46c4 Sun Apr 17 13:34:40 2011 UPM_checkAuthorization ret: 1
> 0x7d46c4 Sun Apr 17 13:34:40 2011 UPM_acceptSocketConnection
FATAL: 0x8182d4 (04/17/11 13:36:35) Primary Agent process group has failed.
> 0x8182d4 Sun Apr 17 13:36:35 2011 adm_dbTerminate
< 0x8182d4 Sun Apr 17 13:36:35 2011 adm_dbTerminate ret: 0
> 0x8182d4 Sun Apr 17 13:36:35 2011 trace_DumpRing
< 0x8182d4 Sun Apr 17 13:36:35 2011

In the console, when the SERVER2 tried to connect to the domain, we can see that the private interface of the SERVER1 go down and up, like two times in the process, until the SERVER2 fail with the error that I share first.

I hope some can help us.

Best Regards

No Responses!
No Events found!

Top