Splunk Search

How to edit my regex to filter out images in Apache logs from being indexed?

dmccabe2
New Member

Hi,

We have a large amount of data in the Apache log files, and we do not want images to be indexed.

How do I match GET /pictures and filter this out from being indexed?

//SAMPLE LOG

54.13.26.10 - - [07/Oct/2015:08:42:06 +0000] "GET /pictures/cco/4194417.jpg?1444207325625 HTTP/1.1" 200 1472 "https://www.mydomain.com/osites/t/ion/Nom?action=ActionNomOpen&client=cco&browserId=wn-1444206953462-0.3289887811175155" "Mozilla/5.0 (Macintosh; Intel Mac OS X 10.10; rv:41.0) Gecko/20100101 Firefox/41.0" **4739**

//REGEX (Displays IP part)

(?:(?<!\d)(\d{1,3}\.\d{1,3}\.\d{1,3}\.\d{1,3})(?!\d))

Thank you,
Darren.

0 Karma
1 Solution

krish3
Contributor

You can do this by editing props and transforms.conf

In props.conf set the TRANSFORMS-null attribute:

[source::/path/to/your/access.log*]
TRANSFORMS-null= setnull

And in transforms.conf add these lines

[setnull]
REGEX = \"GET\s\/pictures
DEST_KEY = queue
FORMAT = nullQueue

Thanks,

And let me know how it goes.

View solution in original post

0 Karma

krish3
Contributor

You can do this by editing props and transforms.conf

In props.conf set the TRANSFORMS-null attribute:

[source::/path/to/your/access.log*]
TRANSFORMS-null= setnull

And in transforms.conf add these lines

[setnull]
REGEX = \"GET\s\/pictures
DEST_KEY = queue
FORMAT = nullQueue

Thanks,

And let me know how it goes.

0 Karma

dmccabe2
New Member

Many thanks,

I will try it

0 Karma

somesoni2
Revered Legend
0 Karma
Get Updates on the Splunk Community!

Announcing Scheduled Export GA for Dashboard Studio

We're excited to announce the general availability of Scheduled Export for Dashboard Studio. Starting in ...

Extending Observability Content to Splunk Cloud

Watch Now!   In this Extending Observability Content to Splunk Cloud Tech Talk, you'll see how to leverage ...

More Control Over Your Monitoring Costs with Archived Metrics GA in US-AWS!

What if there was a way you could keep all the metrics data you need while saving on storage costs?This is now ...