Monitoring Splunk

Any limits on the number of files that Splunk can monitor?

daniel_splunk
Splunk Employee
Splunk Employee

Are there any limit on how many files can be monitored by one indexer at the same time provided that it doesn't hit the OS file descriptor limits and H/W capacity?

1 Solution

mchang_splunk
Splunk Employee
Splunk Employee

There is no any limitation about how many files Splunk can monitor at the same time.
On local disk, you may see delays when the tailing processor is exposed to several hundred of thousands/millions of files, but eventually they will be indexed.
This issue occurring input-side, so you won't see any queues blocked.

View solution in original post

ahattrell_splun
Splunk Employee
Splunk Employee

At the end of the day, this does really depend on the hardware resources available. A 16 cpu machine with 24 Gb of memory will be able to process far more files in a minute than a 1 core 386 with 512 Mb.

To put it another way, you'll hit the limits of your hardware before you hit the limits of the software. I would always recommend benchmarking any application (Splunk or otherwise) on as close to the hardware you are planning to use as you can get.

mchang_splunk
Splunk Employee
Splunk Employee

There is no any limitation about how many files Splunk can monitor at the same time.
On local disk, you may see delays when the tailing processor is exposed to several hundred of thousands/millions of files, but eventually they will be indexed.
This issue occurring input-side, so you won't see any queues blocked.

hexx
Splunk Employee
Splunk Employee

Note that if the monitored files are hosted on distributed filesystems such as NFS, which incur high latency for file access, you start to incur latency in the acquisition of data well before 100,000 files exposed to the tailing processor.

Using inputs.conf parameters such as ignoreOlderThan can help to reduce the tailing processor's scope and keep it up to date with the important files.

Get Updates on the Splunk Community!

Routing logs with Splunk OTel Collector for Kubernetes

The Splunk Distribution of the OpenTelemetry (OTel) Collector is a product that provides a way to ingest ...

Welcome to the Splunk Community!

(view in My Videos) We're so glad you're here! The Splunk Community is place to connect, learn, give back, and ...

Tech Talk | Elevating Digital Service Excellence: The Synergy of Splunk RUM & APM

Elevating Digital Service Excellence: The Synergy of Real User Monitoring and Application Performance ...