Splunk Search

Search for value in field after stripping characters

akhanVG
Path Finder

Not sure how best to word the question but below is what I am trying to do - feel free to edit the question header.

We have a list of URLs that are referrals:

e.g.

www.example.com/this-file/doe?a=a
www.example.com/this-file/dane
www.example.com/this-file/doe

URL 1 and URL 3 are in actuality the same, there is just URL params in the first. Is there a method I Can use to strip the URL params before running the search an doing a count? The outcome would be ideally

www.example.com/this-file/doe - 2
www.example.com/this-file/dane - 1

We cannot pre-filter the data using props or inputs.conf. This would have to be done at search run time.

currently our search string is index="test" regex referrer="^http://www.example.com/these-files/*" | stats count by referrer | sort -count

Tags (2)
0 Karma
1 Solution

lguinn2
Legend

Try this

index="test"  referrer="http://www.example.com/these-files/*" 
| rex field=referrer "(?<new_referrer>.*?)\?"
| stats count by new_referrer | sort -count

View solution in original post

aalanisr26
Path Finder

Alternative would be to use regular expression:

index="test" regex referrer="^http://www.example.com/these-files/*" | rex field=referrer "(?.+)\?"
stats count by url | sort -count

0 Karma

lguinn2
Legend

Try this

index="test"  referrer="http://www.example.com/these-files/*" 
| rex field=referrer "(?<new_referrer>.*?)\?"
| stats count by new_referrer | sort -count

aalanisr26
Path Finder

have you try
faup app:
https://splunkbase.splunk.com/app/1545/

this my help you handling urls

akhanVG
Path Finder

Will definitely look into this. @lguinn answer did it for me so far though - thanks!

0 Karma
Get Updates on the Splunk Community!

Stay Connected: Your Guide to May Tech Talks, Office Hours, and Webinars!

Take a look below to explore our upcoming Community Office Hours, Tech Talks, and Webinars this month. This ...

They're back! Join the SplunkTrust and MVP at .conf24

With our highly anticipated annual conference, .conf, comes the fez-wearers you can trust! The SplunkTrust, as ...

Enterprise Security Content Update (ESCU) | New Releases

Last month, the Splunk Threat Research Team had two releases of new security content via the Enterprise ...