All Apps and Add-ons

Why does the Splunk process die during AWS cloud formation? "helper process seems to have died (child killed by signal 15: Terminated"

a263534
New Member

I've got a CF template that does the Splunk configuration for a license/master server and the last step starts the splunk service. I've logged the output and see Splunk starting up fine. Once I log into the EC2 instance, I see Splunk has died. The splunkd.log shows this error:

08-03-2016 12:32:40.130 -0400 FATAL ProcessRunner - Unexpected EOF from process runner child!
08-03-2016 12:32:40.130 -0400 ERROR ProcessRunner - helper process seems to have died (child killed by signal 15: Terminated
0 Karma

jkat54
SplunkTrust
SplunkTrust

try creating another "job" / "task" to run the restart. My guess is that the script is being killed because the cloud formation user is exiting which then orphans your process. You can try nohup for example:

nohup /opt/splunk/bin/splunk restart &

Or maybe just backgrounding it will do

/opt/splunk/bin/splunk restart &

You might even have to disown it before ending your script.

Maybe even a sleep command would work

/opt/splunk/bin/splunk restart && sleep 5

That would wait for the restart to complete and if successful sleep 5 seconds.

0 Karma

Jeremiah
Motivator

I think we'll need some more details about how you're deploying Splunk to help. Are there any more relevant lines in the log?

In the meantime, there are a few posts about the same error:

https://answers.splunk.com/answers/83718/splunkd-died-every-day-with-the-same-error.html
https://answers.splunk.com/answers/187283/why-did-our-splunk-forwarder-die-getting-error-pro.html
https://answers.splunk.com/answers/212205/splunk-forwarder-did-die-getting-error-processrunn.html

0 Karma

a263534
New Member

I'm deploying by using the rpm package and then running commands from the cloud formation template to configure it:
/opt/splunk/bin/splunk enable boot-start -user root --accept-license
/opt/splunk/bin/splunk start --accept-license
/opt/splunk/bin/splunk edit cluster-config -mode master -replication_factor 3 -search_factor 3 -cluster_label splunkmaster
/opt/splunk/bin/splunk restart

Once it starts up it dies right away after the reboot.
Manually starting the service it stays up which is very confusing.

0 Karma

Jeremiah
Motivator

Sounds like something is wrong with your init script. If you are running as root, you don't need to specify the -user. Does starting manually reveal any errors or prompts? If you re-run the enable boot-start, does that fix it? Have you run this cfn template multiple times and had this problem with each cluster you've started?

0 Karma

a263534
New Member

No, manually works fine - no errors or anything.
I did find the issue with the reboot however that was a cloud configuration where it was re-running the cloudformation each reboot.

Now the issue only remains on new deployments when the cloudformation runs initially, right when it's done with the script the splunk process is killed.
And yes, the CFT has been run many times and it's the same result each time.
thanks for your help

0 Karma

Jeremiah
Motivator

Gotcha. You might try doing that last restart by executing /etc/init.d/splunk restart instead. Its a longshot but maybe when your user-data script completes that causes the splunk processes to exit as well.

0 Karma

a263534
New Member

no luck 😞

0 Karma
Get Updates on the Splunk Community!

Webinar Recap | Revolutionizing IT Operations: The Transformative Power of AI and ML ...

The Transformative Power of AI and ML in Enhancing Observability   In the realm of IT operations, the ...

.conf24 | Registration Open!

Hello, hello! I come bearing good news: Registration for .conf24 is now open!   conf is Splunk’s rad annual ...

ICYMI - Check out the latest releases of Splunk Edge Processor

Splunk is pleased to announce the latest enhancements to Splunk Edge Processor.  HEC Receiver authorization ...