Getting Data In

Why Cluster Peer (Indexer) takes a long time to start splunkweb when Cluster Master is down?

Masa
Splunk Employee
Splunk Employee

Why Cluster Peer (Indexer) takes long time to start splunkweb when Cluster Master is down

In my test environment, I was cleaning up all data in Indexer Clustering Peers.
Because there is no splunk utility to clean up all index db in Indexer Clustering environment in the current version ( v6.2.2 ), I did the following steps.

Version: 6.2.2
CM: Cluster Master
CP: Cluster Peer

  1. Stop CM
  2. Stop CPs
  3. Run "splunk clean eventdata -f" in the all CPs
  4. Start CPs


    • Each CP took over five minutes to return a command prompt.

    • What it took time is at "Waiting for web server at http://127.0.0.1:55110 to be available...."

    • Sometimes it ended up with splunkweb timeout "WARNING: web interface does not seem to be available! "

  5. Start CM

    • CM shows CPs are okay and I can login to Splnkweb of the CPs

If CM is up, starting a CP does not get does not take time at "Waiting for web server at http://127.0.0.1:55110 to be available...." and starts within five sec.
It seems like splunkweb got stuck until CP is connected to CM.

0 Karma

esix_splunk
Splunk Employee
Splunk Employee

So the best practices here are not to stop the cluster master, but to put it into maintenance mode. This will prevent bucket fixup and cluster rebalancing processes from being run.

After this, you can stop the individual peers, clean the indexes and restart them. Once this is completed across the cluster, you should take the master out of maintenance mode.

When the peers start, they are attempting to reach out to the cluster master, register, and get peer information in order to meet the CLuster's Search and Replication factors. So yes, with out the cluster master up and running, it will take time for the peers to start.

During the start process, you can monitor the splunkd.log file (in $splunk_home$/var/log/splunk/splunkd.log) and see its connection attempts to the Cluster Master.

Masa
Splunk Employee
Splunk Employee

Thanks for your advice.
I wanted to make sure when CM is down this slow start-up happens as expected.

0 Karma
Get Updates on the Splunk Community!

Introducing the Splunk Community Dashboard Challenge!

Welcome to Splunk Community Dashboard Challenge! This is your chance to showcase your skills in creating ...

Built-in Service Level Objectives Management to Bridge the Gap Between Service & ...

Wednesday, May 29, 2024  |  11AM PST / 2PM ESTRegister now and join us to learn more about how you can ...

Get Your Exclusive Splunk Certified Cybersecurity Defense Engineer Certification at ...

We’re excited to announce a new Splunk certification exam being released at .conf24! If you’re headed to Vegas ...