All Apps and Add-ons

Splunk on Hadoop

riteshbansal
New Member

Hello Team,

I would like to know what kind of connectivity Splunk has with Hadoop and HDFS?

I noticed that index creation part of splunk takes a good amount of time, so I would like to know following:

  1. Is it possible to install splunk over HDFS? So if we have weblog data over HDFS, can Splunk index creation done using MR jobs?
  2. How splunk stores the data? So if I have connected it to multiple servers to fetch web logs data, will it pull all data to local server, create index and store index as well in local?

Thanks in advance,
Ritesh

Tags (1)
0 Karma

araitz
Splunk Employee
Splunk Employee

See http://www.splunk.com/view/hadoop-connect/SP-CAAAHA3

Splunk itself does not run on HDFS, but Hadoop Connect facilitates interaction with it.

We also have Hadoop Ops for monitoring and troubleshooting Hadoop deployments: http://splunk-base.splunk.com/apps/57004/splunk-app-for-hadoopops

Splunk stores data in a distributed fashion on machines called 'indexers'. Generally indexers are seperate machines than where the data is created. You can use a 'forwarder' to get data from production machines to indexers. Many indexers can be searched at the same time from a machine configured as a 'search head'.

0 Karma
Get Updates on the Splunk Community!

Introducing Splunk Enterprise 9.2

WATCH HERE! Watch this Tech Talk to learn about the latest features and enhancements shipped in the new Splunk ...

Adoption of RUM and APM at Splunk

    Unleash the power of Splunk Observability   Watch Now In this can't miss Tech Talk! The Splunk Growth ...

Routing logs with Splunk OTel Collector for Kubernetes

The Splunk Distribution of the OpenTelemetry (OTel) Collector is a product that provides a way to ingest ...