Deployment Architecture

Splunk Cluster Buckets

adrianathome
Communicator

I understand that the hot/warm/cold buckets need the same same storage characteristics for performance. However, is there an in between bucket for data that has aged out but that I would like to retain in slower cheaper disks?

Tags (3)
0 Karma

mikelanghorst
Motivator

In 6.0, replicated buckets are now kept in the same "level" on both sides. Hot buckets are replicated into the same location as the hot buckets on the peer.

0 Karma

stevenpoitras
New Member

From my perspective it will all depend on your configuration and requirements, however I put the cold bucket on a lower performance tier knowing that search times "could" be impacted since disk IO wouldn't be the same as that serving the hot/warm buckets.

I optimize the hot bucket for read and write IO, and the warm and cold buckets for read (array side read caches can help dramatically here). In my opinion it isn't necessarily economical to have cold buckets stored on high-performance storage especially as the data grows to TBs+.

0 Karma

stevenpoitras
New Member

Correct, replication will go from the hot bucket on node 1 to the cold bucked on node 2.

If the cold bucket is "optimized for reads", there will be some penalty on write performance causing replication times to be increased. But for cold, spindle count + read caching is optimal for the sequential repl traffic

In reality what you're really focusing on is the rate at which data can be read from that replica in the case that the originating node fails.

As always its about finding the correct balance to fit the IO requirements, but replication IO is essentially a secondary operation.

0 Karma

adrianathome
Communicator

So if your lower tier storage is optimized for reads, how does the replication affect the performance of the cluster? From what I understand the replication is write intensive and it happens on the cold bucket.

0 Karma
Get Updates on the Splunk Community!

Extending Observability Content to Splunk Cloud

Watch Now!   In this Extending Observability Content to Splunk Cloud Tech Talk, you'll see how to leverage ...

More Control Over Your Monitoring Costs with Archived Metrics!

What if there was a way you could keep all the metrics data you need while saving on storage costs?This is now ...

New in Observability Cloud - Explicit Bucket Histograms

Splunk introduces native support for histograms as a metric data type within Observability Cloud with Explicit ...