About rotten

rotten · ‎03-10-2011

I recently set up a SuSE box with 8 4.1.7 splunk light forwarders (running on different ports, configured in different directories). 'splunk list monitor' results appear to be mangled. I can see which files each splunk forwarder has open by looking /proc/splunk pid/fd. However, when I run 'splunk list monitor' (with the correct PATH, LD_LIBRARY_PATH, SPLUNK_HOME settings for a specific forwarder) I get a mix of files from all 8 forwarders. Is that command talking to the indexer to get the information?

rotten · ‎02-21-2011

Once the data is indexed it is written in stone. Re-reading the props.conf applies to future events.

rotten · ‎02-18-2011

Also I have another one that is either True or False. I'd like to be able to plot a square wave for the data toggling back and forth, and I'd like it to be obvious that a "0" is 'true' and a "1" is 'false'.

rotten · ‎02-18-2011

The way I was doing it with eval, green = 0, yellow = 1, amber = 2, red = 3. Then I was timecharting max(color) for the timespan bins. However the y-axis was labeled with numbers and I'd rather have the color names there. So I was hoping I could use a category chart and skip the eval step, and graph every event (rather than bucketing the events in windows). The events aren't very frequent, so my data density is low.

rotten · ‎02-18-2011

You could try using port forwarding (use something like ipchains if your indexer is on a linux box). The process I have in mind: Bring up the new splunk indexer. Shut down the old splunk indexer. (requests start to queue on the forwarders) Start a (logging) port forwarder on the old splunk indexer server which forwards the splunk port to the new indexer. (requests all start going to the new indexer) Do the IP address change. After the port forwarder stops forwarding any packets, shut down your old indexer server.

rotten · ‎02-18-2011

You can pull in changes to props.conf with the not-so-intuitive search command (as admin): * | extract reload=true I think you only need to search a short time-window (like 5 minutes) for this to cause props.conf to be reloaded.

rotten · ‎02-18-2011

The only time we've used host_segment, the segment name is in the monitor stanza (rather than after it somewhere), and it has worked well for us. ie: Instead of - C:/Program Files/Splunk/etc/apps/ntt_tougou/tougou_logs Try something like - C:/Program Files/Splunk/etc/apps/ntt_tougou/tougou_logs/.../.../* I have no idea if that will help, but it is worth a try until someone with a more definitive answer can chime in...

rotten · ‎02-18-2011

I have a field with a small set of possible text values. I'd like to plot the value of that field over time. As a first pass I used eval to convert the text values to numerical values. Then I was messing around with changing the labels when I saw on this documentation page: http://www.splunk.com/base/Documentation/4.1.7/Developer/CustomChartingConfig-AxisGrid#categoryaxes That there was a "category" chart type. I'm curious how that sort of chart works and if I can use it to graph a string-state value. eg: Y-Axis values: red amber yellow green X-Axis values: timestamps I gave it a try, but it didn't seem to work. Does it work this way? Am I reading too much into the option?

rotten · ‎01-20-2011

followup - I gave up and wrote a script.

rotten · ‎01-10-2011

This doesn't work. I've tried a few variations on it with no luck. I always get "No Results Found", even though I can see lots of unique field-pairs in the logs.

rotten · ‎01-10-2011

Suppose my log entries resembled: Rick ate a cheeseburger Tony ate a grape Rick ate a frenchfry Tony ate a grape Rick ate a cheeseburger Sally ate a salad ... So I have two fields of interest "name" and "food". Now, I'd like to know which user eats the most different kinds of food. I believe the associate command can be used to tell me which users are most likely to eat which foods. What I'd rather find is almost the opposite. Given a key, how many different (unpredictable) values is it paired with? Then I could do things like send an alert "RicksDog ate 7 different kinds of food in the past 24 hours - he's going to be sick!". I can easily do this with a script. I'd like to do it in Splunk, if possible.

rotten · ‎12-29-2010

Don't forget all the etc/users directories too. Do you have any custom searchscripts? (We always have to update sendemail.py after each upgrade. Back when we ran the Unix app we customized several of those scripts as well.) Can he copy the saved reports in ${SPLUNK_HOME}/var too, or are those os/install specific?

rotten · ‎12-28-2010

It isn't so obvious how to force a sourcetype using props.conf. Here is how we do it: In props.conf on the indexer you need to identify the files you want to map to a sourcetype using some sort of expression that looks similar to the inputs.conf expressions: [source::/my/appserver/path/.../logs/access*] TRANSFORMS-appserver_access = fix_appserver_access_sourcetype Then you need to put this in transforms.conf: [fix_appserver_access_sourcetype] REGEX = . FORMAT = sourcetype::appserver_access DEST_KEY = MetaData:Sourcetype Then you need to restart the indexer. All new files that are identified will get the new sourcetype. The files it has already seen will still get mapped with the old sourcetype. -- Another note: If you want to force the sourcetype on the inputs.conf you do that on the forwarder. -- You can also tag sources that have already been indexed to give them an sourcetype alias. This can get really tedious if you have hundreds and hundreds of files that were indexed with the wrong sourcetype.

rotten · ‎12-17-2010

I may, or may not understand your question... You are looking at a webserver access log and you want to report stats, but need to filter out bots. You think that useragent string is the way to identify the bots from the people. Unfortunately useragent strings are wild creatures and very hard to process consistently. There is too much manual intervention required. Could you start by gathering everything that looks for a robots.txt ? Perhaps get a list of IP's or useragents that requested robots.txt and then use those as your filter. It won't get the virus probes and other black hat hits, but should align your stats closer to reality than counting up everything.

rotten · ‎11-16-2010

Can you get Splunk to divide your license into multiple smaller licenses? Then you could start a separate indexer for each customer (on the same server). You could configure them all as search-peers for your own searching purposes. You could also customize apps and look and feel for each of the customers if you had this configuration.

rotten · ‎11-05-2010

Are you talking about minimizing the forwarder footprint, or maximizing the Indexer performance?

rotten · ‎10-25-2010

I wouldn't use the qualifier "all" in (1). Not all machines will necessarily have the capacity or network architecture that allows you to do it, however, you may still want to install a forwarder on as many machines as you can. Even if you have a standard build, and include a forwarder as part of your standard build, you'll still have systems which end up as "exceptions" for whatever reason. RE: (3) splunk can listen for syslog inputs directly. You don't need to do this manually. Missing: You can scp the data back to the indexer with a scripted input, or a job scheduler, or cron. Then pick the data up with the local file system. This can be handy when you want to grab some data off a production server without having to submit a change control to install something new, or change anything on that server. 😉 Also, sometimes the log files are created that way - the result of batch jobs, and it hardly seems worthwhile running a full time splunk forwarder when you only need the output from a batch job once per day. Another architecture option: Sometimes you might want to forward the splunk data to another forwarder, and then send it to the indexer from there. I can think of two reasons off the top of my head: You want to minimize the number of holes in a firewall that you have to open, so you designate one relay box to pull everything from the other side of the firewall and send it through. You want to run lightweight forwarders on your production systems, but need a "full fledged" forwarder to do filtering before the data ends up on the indexer. A Note on NFS: It is slow. If you have a lot of log files, you will quickly run into latency issues. On the other hand, it is usually quick and easy to set up, and can dramatically shorten the deployment time for many low volume sources. I would argue which solution is "best" depends on what you are trying to do and what kind of data you are working with. That is why there are so many possible solutions.

rotten · ‎10-07-2010

All other things being equal, would we see any performance gains with Splunk if we switch our file system from ext3 to xfs? Another thread on Splunk answers recommended moving $SPLUNK_HOME/var/run to its own ext3 file system because lots of little files end up there. However I never saw any real confirmation in that thread as to whether the possible performance gains were worth the effort of switching the file system out on an already running Splunk indexer. We are always looking for ways to get Splunk to perform a little better....

rotten · ‎09-24-2010

It would be both useful and interesting to be able to graph the indexing latency for various data sources or hosts over time. Is there a way to compare "insert time" (for the splunk database) with "event time" (from the source logfile) and build such a set of charts?

rotten · ‎09-13-2010

I'm getting this with one of our saved searches in a custom app as well. (Splunk 4.1.4) I don't have the option to so easily restart Splunk. I saved the search again with a new name and was able to edit that. Then I deleted the original search.

rotten · ‎08-18-2010

Splunk support showed us how to do it using an approach like this: inputs.conf on the lightweight forwarder: [monitor:///foo/bar/logs/] disabled = false host = myServer_myApplication crcSalt = <SOURCE> blacklist = \.(tar|gz|bz2) props.conf on the indexer: [source::/foo/bar/logs/.../*] TRANSFORMS-foobarlogs = fix_foo_bar_logs_sourcetype transforms.conf on the indexer: [fix_foo_bar_logs_sourcetype] REGEX=. FORMAT=sourcetype::foo_bar_log DEST_KEY=MetaData:Sourcetype This approach is the only way we have found to reliably set the sourcetype for the vast majority of our logs. If we pick up the logs on the indexer, we can simplify this by setting the sourcetype in the inputs.conf directly: [monitor:///foo/bar/logs/] disabled = false sourcetype = foo_bar_log host = myServer_myApplication crcSalt = <SOURCE> blacklist = \.(tar|gz|bz2)

rotten · ‎08-18-2010

Did you get a Flash upgrade? Are both IE and FF using the same version of Flash?

rotten · ‎08-16-2010

What authentication method are you using? Did your LDAP server go down? (Splunk doesn't automatically failover, you have to connect as the admin user and flip it if you have a preconfigured backup.) We actually use an F5 between Splunk and our LDAP server, and we have a backup configuration preconfigured as well. If you can't get into the Splunk UI at all, check out $SPLUNK_HOME/etc/system/local/authentication.conf to see how you are authenticating. You may need to temporarily move it out of the way, and bounce splunk, to revert to the default authentication method (local file). If you are using a local file based authentication, is it possible you trashed it somehow? I believe you can hack $SPLUNK_HOME/etc/passwd to try to recover from such a disaster. Search around for "I forgot my admin password" for hints on how to recover it.

rotten · ‎08-13-2010

Have you seen the documentation notes on timezones? http://www.splunk.com/base/Documentation/4.1.4/admin/ApplyTimezoneOffsetstotimestamps Everything I do is all in the same timezone so I haven't run into this yet. However it reads like all of the dates are automatically converted when they are indexed so that they can be matched up when you search later. In other words, I think as long as the splunk datetime parser can figure out the timezone associated with the event, splunk takes care of the conversions for you.

rotten · ‎08-13-2010

I have not used JackRabbit myself. I started to look closely at it for a project that dematerialized before it even got rolling. It seems to be the tool of choice if you want to index a lot of documents. (which is different than events) I'd suggest looking for a python equivalent, since you are a python guy, but the project at http://www.pycr.org doesn't seem to have a lot of traction. (any, actually)

Posts	45
Solutions	4
Karma Given	11
Karma Received	30
Member Since	‎05-25-2010

Online Status	Offline
Date Last Visited	‎06-05-2020 02:02 AM

multiple forwarders list monitor

category chart for discrete data set

keys with many values

switching ext3 for xfs

Indexing Latency Chart

maxDist value varies (in forwarders)

multiple forwarders list monitor

Re: Installation of Splunk again and again.

Re: category chart for discrete data set

Re: category chart for discrete data set

Re: Does a light forwarder constantly resolve outp...

Re: Installation of Splunk again and again.

Re: Host name can not be change

category chart for discrete data set

Re: keys with many values

Re: keys with many values

keys with many values

Re: Migration of splunk 3.x on Solaris to RHEL 5

Re: Sourcetype would increment?

Re: access_combined hide certain useragents

Re: Limit incoming data?

Re: Looking for "high performance Splunk"

Re: Best practice for getting data into Splunk wit...

switching ext3 for xfs

Indexing Latency Chart

Re: Error while modifying Splunk for BlueCoat dash...

Re: Can't get sourcetype right

Re: Internet Explorer Login Fails

Re: Internet Explorer Login Fails

Re: Date / Time stamp translation in search

Re: I am very, very new to Splunk