About sepkarimpour

sepkarimpour · ‎09-12-2017

I ended up doing it so it was just the values less than or equal to the 99th percentile value for 'x': index=* ... "OK" | eventstats perc99(ResponseTime) as p99Resp by x | where ResponseTime<=p99Resp | stats min(ResponseTime) as p98ResponseMin, max(ResponseTime) as p98ResponseMax, avg(ResponseTime) as p98AvgRespTime by x For average, I imagine using an 'eval' to get the value may be necessary.

sepkarimpour · ‎09-12-2017

I used the following in the end: | multisearch [search index=* sourcetype=x... ] [search index=* sourcetype=y... ] | fields host sourcetype | eval host=upper(host)| stats values(sourcetype) as sourcetype by host | where mvcount(sourcetype)<2 AND sourcetype=x I realised that there was an issue on the boxes themselves so once I fixed the inputs.conf file and restarted the agent, it was picking up as normal so I was able to remove the "AND sourcetype=x"

sepkarimpour · ‎09-12-2017

I used parts of both answers and ran the following with a result: index=* ... "OK" | eventstats perc99(ResponseTime) as p99Resp | eval Type=if(ResponseTime<=p99Resp,"Efficient","Rest") | stats count(eval(Type="Efficient")) as "Under 1000ms", count as Total | eval Efficiency=100*('Under 1000ms'/Total)

sepkarimpour · ‎08-31-2017

Ah, this is picking up a lot more results now. I had read before that sometimes wday and hour aren't picked up properly sometimes from the logs. This seems to work much better now with the number much closer to the original now. Thanks.

sepkarimpour · ‎08-31-2017

I'm looking to run a search over a 4 week period here I find the count of results per week but I want to look for a specific time range - Sunday 11pm to Friday 11pm. Having found an answer to a similar question to this, I'm using this search at the moment: "base search" | eval hour = tonumber(strftime(_time,"%H")) | eval dow = tonumber(strftime(_time,"%w")) | where dow!=6 AND (dow!=0 AND hour<=23) AND (dow!=5 AND hour>=23) | timechart span=7d count I can see the number of results is significantly lower when I run the query with these restrictions: 20m+ results with all days included, under 1m results with the WHERE command included. Would this correctly show the range I'm looking for?

sepkarimpour · ‎08-29-2017

Is there a way I can have it so it's the following: ... | stats count as TotalCount, sum(eval(if(avg(ResponseTime)<=1000,1,0))) as QuickCount To find the percentage difference, I ran the query below: index= ... "OK" | eval QuickCount=if(ResponseTime<=1000,1,0) | eval QuickResponse=if(ResponseTime<=1000,ResponseTime,null()) | stats avg(ResponseTime) AS AvgResponseTime, count as TotalCount, sum(eval(if(AvgResponseTime<=1000,1,0))) as QuickCount | eval Percentage=100*(TotalCount-QuickCount)/TotalCount | fields - AvgResponseTime I get no results for Quick Count now when I try this though.

sepkarimpour · ‎08-29-2017

I want to compare two identical searches but one looking for just count and the other using count | where the average response time is less than 'x' to find the percentage difference between them. When I search for them separately, I can get what I'm looking for: index=* ... "OK" | stats count index=* ... "OK" | stats count, avg(ResponseTime) AS AvgRespTime | where AvgRespTime<=1000 | fields - AvgRespTime As the base search is the same, how do I set the second one to use where without it affecting the first one as well? I've tried adding the where after, but it changes both. If I set them each to a type, e.g. base search | eval Type=P1 & base search | eval Type=P2, can I set P2 to use a WHERE command, e.g. where AvgRespTime<=1000 for P2 only?

sepkarimpour · ‎08-29-2017

So what does the eval side do, remove any queries that have a response time that exceed the p99 value for the queryName? I think I saw that there was an IF function available, but I didn't know you needed to use '<-' for less than or equal to

sepkarimpour · ‎08-29-2017

I think so. I want to eliminate the top 1% of queries that take the longest time in order to show that all the other queries below this have a low response time - general analysis of this to show whether this'll remove the tail-end outliers that could be affecting the average OR to show that these queries have a higher response time in general so we can assess what could be causing this.

sepkarimpour · ‎08-29-2017

I tweaked the first query so it'd work with the initial search - I was running a comparison of the query counts and I can see whilst some of the query names that have fewer queries don't lose any, some queries are losing more than 1% from the top (e.g. the query with the highest count initially has a count of 152638, but when running your version, it falls to 149949, which is a drop of around 1.8%) - is this expected? Also, is there a way I can still display the 100% values for certain items, i.e. Query Count, Min and Max but show the 99% Average Response as well? I'd like to show that although some queries may take longer (as outiers), the majority (99%) of searches have a low response time, but it's more of a preference.

sepkarimpour · ‎08-25-2017

That's what I've been asked to provide - using 'Event Sampling' will show a random set of results, and given some of the query names have single digit results and some have thousands so that wouldn't work for this.

sepkarimpour · ‎08-25-2017

Yes, they're related to each other - I had an idea pop into my head for the first one, but I thought I'd write the question differently. The example you provided is almost what I'm looking for - I'll explain what I mean: I'm looking to use all but the last value so, in this case, the 10th value would be removed but the search include the first value as well. My idea is that either, I could get the sum of the counts that are within the 99% range (or here, it would be all the values <= 12) and then dividing by 9 (since 10 - 1 = 9) The results would be: Sum=69 and new Average=7.7 (rounded to 1 d.p) Yes, ideally, once I get all these averages, I can do further analysis on the ones that exceed a certain value.

sepkarimpour · ‎08-25-2017

I'm currently checking the 'outlier' command - I'm not sure how you can set the upper bound to 99% though. My concern is when running the query now, I can see multiple query names, which all have different counts so I'm not sure how to set a limit. I'll try using this and see if I can get an answer.

sepkarimpour · ‎08-25-2017

Is there a way to using conditions to find all the values (SUM and COUNT) above a certain value to be used as part of another calculation? My logic is to find all the values greater than or equal to the 99th percentile value from a query (e.g. if I have a 1000 queries, I'd want the sum of the last 10 queries), take that value away from the sum of the total query count and divide that by the total count minus the count of queries that are greater than equal to the 99th percentile. I can replicate my idea on Excel with the idea I had in mind: =(SUM(C2:C101)-SUMIF(C2:C101,PERCENTILE.INC(C2:C101,0.99)))/(COUNT(C2:C101)-COUNTIF(C2:C101,PERCENTILE.INC(C2:C101,0.99))) But I'm not sure how to replicate this on Splunk - any ideas?

sepkarimpour · ‎08-25-2017

My search currently gives me some statistics regarding response times including total count, average, min, max and 99% percentile value (I'm assuming that this is ordered) based on one of the fields from the query. I need to find the average query count for the first 99% of my data count per query name, so my logic was to take the minimum value from the 99% percentile value, and dividing that by 99% of the total count. But when I run this, I often get values for this not in the range between minimum and maximum for certain values, e.g. Average: 15533.5, Min: 9076, Max: 24737, *99% Average: 479.0*, which clearly wouldn't make sense even if the numbers are heavily skewed. This was my initial search attempt: index=* ... | stats count AS "Query Count", avg(ResponseTime) AS "AvgRespTime", min(ResponseTime) AS "MinRespTime", max(ResponseTime) AS "MaxRespTime", p99(ResponseTime) AS "99PercRespTime" by queryName | eval AvgRespTime=round(AvgRespTime,1), 99PercAvgRespTime=('99PercRespTime'-'MinRespTime')/(0.99'Query Count'), 99PercAvgRespTime=round('99PercAvgRespTime',1) | sort - 99PercAvgRespTime *EDIT: I've seen a flaw in my initial idea where I'm taking single values rather than sums, which are how you get averages (oops) - what I'd ideally want is to take the sum of the 99th percentile (and above) value(s) away from the total sum and divide that by the count of queries minus the count of queries above (and including) the 99th percentile. * (1) Is it even possible to accurately get the average of the first 99% of the data? My logic was that by removing the last 1% of queries (so if it was 1000, the 10 queries with the highest response times), I can get this average but what should the query be? I considered taking 99% average, but I assume that would remove 0.5% from the lower end as well as the upper end, which is what I've been asked to avoid. (2) Is there a way to show skewness of data based off statistics on Splunk? If possible, I'd want to show a graph or at least data showing this if possible.

sepkarimpour · ‎08-07-2017

Hi Adonio, I tried using your method and it didn't work unfortunately. I initially thought it was because I didn't add the actual string I was searching for in the metrics log, but I couldn't get it to work after adding that. I actually asked this question in a different way and I got the answer I wanted from there: https://answers.splunk.com/answers/560584/using-set-diff-to-compare-searches-but-outputting.html To summarise, I had to use a multisearch to get both sets of results and then it's suggested to use mvcount and where to display what I was initially looking for: | multisearch [...Search 1...] [...Search 2...] | fields host sourcetype | eval host=upper(host) | stats values(sourcetype) as sourcetype by host | where mvcount(sourcetype)<2

sepkarimpour · ‎08-07-2017

Thank you for your answer. It's working as I'd want it to now, but I had couple of further questions: If I only wanted the alert to go off for a certain sourcetype, could I add another WHERE along the lines of "| where sourcetype = XXX", at the end of the search? Is there a way of doing this with using 'hostname' rather than 'host'? This is more out of preference since most of my searches are based on the former, but as the output is virtually the same, this isn't a priority.

sepkarimpour · ‎08-04-2017

I created a search that'll display the difference between two searches using 'set diff' - I initially set it to compare on one column (hosts), so if there's a difference between the two, it'll show up and create an alert based off this. However, I'd want to show both the hostname and the sourcetype if possible in the final search I initially tried to set it to show both host and sourcetype as part of the searches for the set diff, it displays all the hosts as the two searches use separate sourcetypes. I also tried setting it to display the final table to show the host and sourcetype, but I get an error from set diff. Is there a way to do this using 'set diff' or is there a different method, such as using a 'multisearch' to compare?

sepkarimpour · ‎08-04-2017

I've tried to set up an alert to go off whenever the number of hosts from one search is not the same for another search, but I only want it to go off from one side (so if the number of hosts in search A < the number of hosts in search B, it should go off but if the number of hosts in search A >= the number of hosts in search B, I don't want it to go off). As of late, I've seen the number of alerts increase substantially but then when I check the individual searches, I can see it's the latter issue where search A hosts exceed search B hosts - how can I fix this so it only alerts from one side? | set diff [search index=_internal source=.../metrics.log "..." | dedup host | sort host | table host ] << Search A [search index=* sourcetype=core-server-event-tracking-api | dedup host | sort host | table host ] << Search B | rename host as "Missing Host(s)" Also, is there a better way of counting the number of unique hosts from a sourcetype, e.g. core-server-event-tracking-api, rather than counting across all sources?

sepkarimpour · ‎05-15-2017

Ah! I thought I had tried this but I think I had missed the if as it is above but once I added that back in, it works as I'm wanting. Thanks! Just following up though: Since it's looking through a huge number of events now (in the last four hours, it went through over 27m events for the last four hours), it's a lot slower than before. Is there any way to optimise this so it's slightly faster/takes less time?

sepkarimpour · ‎05-15-2017

Similar to the answer above from Andrey, no results are produced and when I remove the 'where ...', only values that have the value of 1 remain. What does the like (...) part do? Does it look in the raw output of the logs for the string I'm looking for? Thanks

sepkarimpour · ‎05-15-2017

When I try this, it still brings back no results. When I remove the search at the end, it shows that only "with_abc" values remain, which is the issue I'm having in the first place. I tried removing the string in the first part of the search query, but it just gives me the hosts without that string (which isn't helpful as it's still all of them). I'm just wondering if you can set a value if it doesn't appear at all in the search. Thanks though.

sepkarimpour · ‎05-15-2017

Currently, I have a search where I'm looking for a specific string in a set of logs across a large number of hosts (62) over the last 4 hours - I'm expecting to see all 62 but only 50 appear. Example of my search: index=... sourcetype=... "abc" | dedup host | table host Is there a way I can get the 12 missing hosts from the search I'm currently using? When I try NOT "abc" , I get all 62 hosts returned which shows that they're all getting picked up correctly. Can I potentially search for the hosts without the string and remove the first list hosts from the total list in order to get the remaining hosts?

sepkarimpour · ‎04-20-2017

DalJeanis' answer worked for me: You can't delete num_source, that's your count you want to display - the number of records for that host from the source from the host. I've renamed it to sourceCount to make it clearer, and added your required chart command at the bottom. yoursearchhere | stats count as sourceCount by host source | eventstats values(sourceCount) as theCounts by host | where mvcount(theCounts)>1 | table host source sourceCount | chart sum(sourceCount) as count by host source The chart command could use sum, avg, first, last, or whatever aggregate command you feel is clearest, since there's only one record for each.

sepkarimpour · ‎04-20-2017

Yes, when I add the last line, it works as expected. Thanks a lot for the help.

Posts	28
Solutions	4
Karma Given	10
Karma Received	1
Member Since	‎04-19-2017

Online Status	Offline
Date Last Visited	‎06-05-2020 02:03 AM

Only include certain hours/days for a long term se...

Where command -- How can I use this on a search wi...

Conditional Sum to find 99% average of total data

Find the average of the first 99% range of data

Using 'set diff' to compare searches, but outputti...

Multisearch alert to go off only from one side + s...

Display hosts with no data

Combining separate columns based on x-axis value

Adding a WHERE clause to chart so only NON-matchin...

Re: Find the average of the first 99% range of dat...

Re: Multisearch alert to go off only from one side...

Re: Where command -- How can I use this on a searc...

Re: Only include certain hours/days for a long ter...

Only include certain hours/days for a long term se...

Re: Where command -- How can I use this on a searc...

Where command -- How can I use this on a search wi...

Re: Conditional Sum to find 99% average of total d...

Re: Find the average of the first 99% range of dat...

Re: Find the average of the first 99% range of dat...

Re: Find the average of the first 99% range of dat...

Re: Conditional Sum to find 99% average of total d...

Re: Find the average of the first 99% range of dat...

Conditional Sum to find 99% average of total data

Find the average of the first 99% range of data

Re: Multisearch alert to go off only from one side...

Re: Using 'set diff' to compare searches, but outp...

Using 'set diff' to compare searches, but outputti...

Multisearch alert to go off only from one side + s...

Re: Display hosts with no data

Re: Display hosts with no data

Re: Display hosts with no data

Display hosts with no data

Re: Adding a WHERE clause to chart so only NON-mat...

Re: Adding a WHERE clause to chart so only NON-mat...