Getting Data In

What is the best way to monitor a web page containing links to xml to index RSS feeds in Splunk?

landen99
Motivator

I want to monitor a web page containing links to xml for rss feeds to be indexed into Splunk in real-time. Let's use the following website as an example: https://spotcrime.com/rss.php

What is (are) the best way(s) to monitor that site for indexing? I want to test the method on my Windows machine first. I am interested in learning more about Powershell or Python scripted inputs if that is the best approach.

0 Karma

LukeMurphey
Champion

If you want to get all of those XML feeds into Splunk without manually entering them making a custom script might be the best approach. I have been considering writing a search command that will allow you to scrape web-pages recursively. I'll take a look at doing that soon (perhaps in the next couple of days); you can monitor progress on that here.

For reference, there are a couple of apps that may be useful to you:

0 Karma

landen99
Motivator

I found a couple of links on the subject, but I don't understand specifically what I should do to make it work: https://splunkbase.splunk.com/app/278/ and http://blogs.splunk.com/2012/03/14/indexing-feeds/ Also, some of these articles are a bit old (2012, etc.) so I am unsure about how much has changed since then and if they are not better ways to do it.

0 Karma
Get Updates on the Splunk Community!

Introducing the 2024 SplunkTrust!

Hello, Splunk Community! We are beyond thrilled to announce our newest group of SplunkTrust members!  The ...

Introducing the 2024 Splunk MVPs!

We are excited to announce the 2024 cohort of the Splunk MVP program. Splunk MVPs are passionate members of ...

Splunk Custom Visualizations App End of Life

The Splunk Custom Visualizations apps End of Life for SimpleXML will reach end of support on Dec 21, 2024, ...