Splunk Search

Report Dedup

TucoRameriz
Explorer

Is there a way to eliminate duplicates by reports? Specifically what I'm looking to do is run a report every 24hrs for X range | stat count by shost. I don't want the report to show any hosts that showed up on the previous report. I know how to eliminate duplicates from a single report but don't know if it's possible to "dedup" on a previously run report.

Thanks

Tags (1)
0 Karma

kristian_kolb
Ultra Champion

martin_muller is essentially right, the solution below might be slightly more efficient, as the stats is performed on only the smaller set of data.

sourcetype=xxx index=yyy earliest=@d NOT [sourcetype=xxx index=yyy earliest=-1d@d latest=@d | dedup shost | fields + shost] | stats c by shost

The subsearch will execute first and return the distinct set of shost for the previous day, so that the outer search will effectively be;

sourcetype=xxx index=yyy earliest=@d NOT ((shost=host1) OR (shost=host2) OR (shost=host3)) | stats c by shost

Hope this helps,

Kristian

0 Karma

martin_mueller
SplunkTrust
SplunkTrust

You could do something like this:

your  report with stat count for the current day | search NOT [query for shost values in the previous day]

That will remove rows if the shost value appeared in the previous day... once translated into proper splunk commands of course.

0 Karma
Get Updates on the Splunk Community!

Introducing the 2024 SplunkTrust!

Hello, Splunk Community! We are beyond thrilled to announce our newest group of SplunkTrust members!  The ...

Introducing the 2024 Splunk MVPs!

We are excited to announce the 2024 cohort of the Splunk MVP program. Splunk MVPs are passionate members of ...

Splunk Custom Visualizations App End of Life

The Splunk Custom Visualizations apps End of Life for SimpleXML will reach end of support on Dec 21, 2024, ...