Hi,
I have a three search head SHC.
I see that one SHC member going for restart but never comes back up. This is the log line.
INFO SHCSlave - event=SHPSlave::handleHeartbeatDone master has instructed peer to restart
SHC has three members with Dynamic captain.
What could be going wrong.
Please help.
So many things could be going wrong...so very many.
What happens when you logon to that search head and just do splunk start
?
Hi immortalraghavan,
Can you please tell me output of "/splunk show shcluster-status" command?
http://answers.splunk.com/answers/82275/why-is-my-windows-cluster-peer-node-continually-restarting
Essentially, directory permissions on /slave-apps/ on the search peer had been lost (why?) and directory was set to read only. As per the link above, resetting the permissions allowed the Cluster Master to once again populate the directory with the required apps.