Knowledge Management

Why do we see large variations of the data age for a certain index?

danielbb
Motivator

We see the following for one index in the cluster master -

alt text

Why do we see these fluctuations for the data age among the eight indexers?

Tags (1)
0 Karma

solarboyz1
Builder

Buckets get rolled based on the youngest event in the bucket.

Buckets can span minutes, hours, days, etc.. depending on how much data is received into the index and your index configuration.
Additionally, if you turn up a new datasource and its starts reading in old logs. Those old events will now be mixed with your new events.
Finally, if the clock settings on the systems are incorrect...this can lead to events getting old or future dates.

Long story short, when old events get mixed in a bucket with newer events, the old events have to wait till ALL events in the bucket age out.

danielbb
Motivator

Makes sense - how can I check these buckets?

0 Karma

solarboyz1
Builder
|  dbinspect index=yourindex 

It will show the startEpoch and endEpoch for each bucket, among other things.
https://docs.splunk.com/Documentation/Splunk/7.3.1/SearchReference/Dbinspect

If you want to drill into the events in a specific bucket:

index=yourindex | where _bkt="summary~22~4F582768-7B38-4768-95EA-EC3D491A8A23"

Where _bkt matches the bucketId from dbinspect results.

https://docs.splunk.com/Documentation/Splunk/7.3.1/Knowledge/Usedefaultfields

Get Updates on the Splunk Community!

Announcing Scheduled Export GA for Dashboard Studio

We're excited to announce the general availability of Scheduled Export for Dashboard Studio. Starting in ...

Extending Observability Content to Splunk Cloud

Watch Now!   In this Extending Observability Content to Splunk Cloud Tech Talk, you'll see how to leverage ...

More Control Over Your Monitoring Costs with Archived Metrics GA in US-AWS!

What if there was a way you could keep all the metrics data you need while saving on storage costs?This is now ...