All Apps and Add-ons

Splunk Analytics for Hadoop: Why is Hadoop returning "Permission denied"?

Harishma
Communicator

hI @Ledion Bitincka ,

I'm also getting similar error. I'm using trial version of Splunk Analytics for Hadoop.

[myhadoopprovider2] Error while running external process, return_code=255. See search.log for more info

[myhadoopprovider2] Exception - com.splunk.mr.JobStartException: Failed to start MapReduce job. Please consult search.log for more information. Message: [ Failed to start MapReduce job, name=SPLK_ABC.com_1481188820.161_0 ] and [ Permission denied: user=splunkd1, access=WRITE, inode="/tmp/hadoop-yarn/staging/splunkd1/.staging":hdpuser:supergroup:drwxr-xr-x

hdpuser --> hadoop cluster
splunkd1 --> splunk search head

I did this command and I'm still seeing above error:
hadoop fs -chown splunkd1 hdfs://nnhost:port/path/to/working/dir

hadoop fs -ls /data/input/splunk/linux/
16/12/08 04:07:25 WARN util.NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
Found 3 items
-rwxrwxrwx 1 splunkd1 supergroup 15429 2016-12-08 02:25 /data/input/splunk/linux/LICENSE.txt
-rwxrwxrwx 1 splunkd1 supergroup 101 2016-12-08 02:25 /data/input/splunk/linux/NOTICE.txt
-rwxrwxrwx 1 splunkd1 supergroup 1366 2016-12-08 02:25 /data/input/splunk/linux/README.txt

In fact, I have provided 777 permissions also to the entire HDFS directory

Kindly help

0 Karma

Ledion_Bitincka
Splunk Employee
Splunk Employee

The error message contains the directory and its permissions of where Splunk is failing to write. A simple fix here would be to have Splunk run as user hdpuser instead of splunkd1

 Permission denied: user=splunkd1, access=WRITE, inode="/tmp/hadoop-yarn/staging/splunkd1/.staging":hdpuser:supergroup:drwxr-xr-x
0 Karma

Harishma
Communicator

Hi @Ledion Bitincka ,

In a previous post (link below) you had mentioned to execute the following command as hdpuser (ie change the ownership of that dir to splunkd1)

I did that :
hadoop fs -chown splunkd1 hdfs://data/input/splunk/linux/

and it didn't seem to fix the issue.

Kindly guide

[link text][2]

https://answers.splunk.com/answers/177832/permission-denied-for-hunk-job.html?utm_source=typeahead&u...

0 Karma

rdagan_splunk
Splunk Employee
Splunk Employee

Can you share the Splunk configurations?
What are your values for the Name Node or the Yarn Resource Manager?

0 Karma

Harishma
Communicator

Hi @rdagan,

here are the configs:

[provider:myhadoopprovider2]
vix.command.arg.3 = $SPLUNK_HOME/bin/jars/SplunkMR-hy2.jar
vix.env.HADOOP_HOME = /home/splunkd1/hadoop-2.7.2
vix.env.JAVA_HOME = /opt/common/jdk1.7.0_21
vix.family = hadoop
vix.fs.default.name = hdfs://XXX:9000
vix.mapreduce.framework.name = yarn
vix.output.buckets.max.network.bandwidth = 0
vix.splunk.home.hdfs = /user/home/working/
vix.yarn.resourcemanager.address = XXX:8032
vix.yarn.resourcemanager.scheduler.address = XXX:8030

[vihadooptest2]
vix.input.1.path = /data/input/splunk/linux/...
vix.provider = myhadoopprovider2

Could you please help? I still face this issue..

0 Karma

richgalloway
SplunkTrust
SplunkTrust

I moved this question from a comment on a two-year-old thread.

---
If this reply helps you, Karma would be appreciated.
0 Karma

Harishma
Communicator

Oh Ok..No problem as long as I get some help 🙂

0 Karma

richgalloway
SplunkTrust
SplunkTrust

You're more likely to get help from a new question.

---
If this reply helps you, Karma would be appreciated.
0 Karma
Get Updates on the Splunk Community!

Extending Observability Content to Splunk Cloud

Watch Now!   In this Extending Observability Content to Splunk Cloud Tech Talk, you'll see how to leverage ...

More Control Over Your Monitoring Costs with Archived Metrics!

What if there was a way you could keep all the metrics data you need while saving on storage costs?This is now ...

New in Observability Cloud - Explicit Bucket Histograms

Splunk introduces native support for histograms as a metric data type within Observability Cloud with Explicit ...