Deployment Architecture

Syslog, data storage, buckets

ihingos
Engager

I'm looking to index and store a ton of data (syslog). My question is once splunk has index the data, and moved it to the various buckets, is there any depup, or compression that happens? Is there a document someplace that explains the process in more detail?

Thanks

Tags (1)
0 Karma
1 Solution

bmacias84
Champion

Hello ihingos,

To answer your question Splunk does not dedup raw events and its does compress them; however, Splunk allows you to dedup events in the search query language( yoursearch | dedup _raw …). Depending on the cardinality of your data you can get fairly high compression ratios. Compress will also vary depending on Bucket and index sizes.

In general the formula is : ( Daily average indexing rate ) x ( retention policy ) x 1/2

Additional Reading:

Estimateyourstoragerequirements

HowSplunkcalculatesdiskstorage

View solution in original post

bmacias84
Champion

Hello ihingos,

To answer your question Splunk does not dedup raw events and its does compress them; however, Splunk allows you to dedup events in the search query language( yoursearch | dedup _raw …). Depending on the cardinality of your data you can get fairly high compression ratios. Compress will also vary depending on Bucket and index sizes.

In general the formula is : ( Daily average indexing rate ) x ( retention policy ) x 1/2

Additional Reading:

Estimateyourstoragerequirements

HowSplunkcalculatesdiskstorage

Get Updates on the Splunk Community!

More Ways To Control Your Costs With Archived Metrics | Register for Tech Talk

Tuesday, May 14, 2024  |  11AM PT / 2PM ET Register to Attend Join us for this Tech Talk and learn how to ...

.conf24 | Personalize your .conf experience with Learning Paths!

Personalize your .conf24 Experience Learning paths allow you to level up your skill sets and dive deeper ...

Threat Hunting Unlocked: How to Uplevel Your Threat Hunting With the PEAK Framework ...

WATCH NOWAs AI starts tackling low level alerts, it's more critical than ever to uplevel your threat hunting ...