About swapnaliphepale

swapnaliphepale · ‎06-04-2018

Hi All, I am just reading the glass table, but I am confuse that how to import the glass table in normal application? How to create the ITSI service and how to use them? How to build a glass table visualization? How to Configure KPI widgets and is it any tool? Thanks, Swapnali

swapnaliphepale · ‎04-26-2018

Thanks You SukiSen, Please find below Answers to your questions :- 1- How big is your sample data size and what is the split you have done between test and sample data - I have around 51 k events and split is of 80-20 2-Without knowing how many distinct values your JobGroup has, is it possible to convert them to numeric like you said? It might be a bit too much if you have too many jobgroups but I would still suggest trying. I have converted JobGroups into numeric fields and it has more than 90 distinct values. 3- About RMSE. Now, having a high RMSE in general is not good. What does this mean? It means that the cases where there is a variance between predicted values and actual values, the variance is high. So even if say for 95% of your predictions you are reasonably accurate, it could still mean the rest 5% predictions are so huge that your total RMSE is getting too big. Yes, for some result variance is too big. 4- This does not invalidate your model, remember this is an IT scenario. For example say at around 11 AM on 5-6 days someone executed other unplanned jobs (or the jobs normally taking 2 minutes took 20 mins/1 hour / even got hung and had to be killed off). It would mean a very high RMSE , does not mean your model is invalid. Typically jobs by nature will have this kind of scenarios in any IT environment. True. 5- Have you considered applying pre-processing? Just choose standard scaler and apply it to both the dependent and independent variables, it should improve your prediction. I tried this, my R square value increased up to 0.98 which is really good, but RMSE value is still high up to 198. I am not getting what to do,but RMSE value was good previously. 6- How does your adjusted r square look like? It's between 90-98 now. So what I am suggesting as next steps is: 1-Apply a split of 80-20 or 90-10 between your sample and test data and see how accurate your predictions are. If you apply a split of 80-20, the last 20% of your predicted values can be accurately compared with the actual values and you can see how well the model really behaves. Done.Still same result as above. 2-Quantify your jobgroup and run random forest again, let us see how long it takes to run 🙂 I didn't get this. 3-Lastly and most importantly verify the number of occasions your predicted values are different from actuals. Say, for example you have 10000 data points but only 10-20 cases of really high RMSE is present, your model is good. It simply means that on some rare occasions something happened (bad code/system outage) which took the jobs more time to execute than normal. What would be of concern is if your RMSE is evenly spread like 10-20% of your prediction is skewed. Okay, Still can i consider this model as Better than previous

swapnaliphepale · ‎04-16-2018

Thanks for your valuable suggestion Sukisen!! We have applied Random Forest Regressor Algorithm, and found that this algorithm is giving much better results than the previous one. We are getting R square in the range of "0.80-0.85", but RMSE is coming in the range of "150-180". So can we consider it as good result? And also i doubt like if we use the JobGroup as a String as a value "use for predicting", then it will work ? Or else do we need to give only numeric value to form relationship for prediction? Thanks in advance

swapnaliphepale · ‎04-10-2018

Hi John, Yes,I tried to predict duration by giving start time in epoch, as it require the fields in numeric for Predict numeric field of Machine learning Toolkit. But still m not getting good result, the gap between original duration and predicted duration is coming too differ or we can say wast. Suppose the actual duration is 1 sec then it's coming to be around 2000. I know that the start time won't be helpful for calculating duration as if two jobs are starting at same start time it's not necessary that will end at the same time. But when i just giving job group as field to use to predict , then it's saying that " Dropping field(s) with too many distinct values: JobGroup" How to approach now for getting end time correcting ? Thanks in advance!

swapnaliphepale · ‎04-10-2018

I want to predict end time and start time of some jobs and I am currently using "Linear Regression" algorithm Predict numeric field of Machine learning Toolkit. I am having some personal work of prediction, but I am not sure if this approach is right or wrong. I need more details on this and I don't have additional details. Fields that can be used for prediction can - Job Group (Under which group it's coming). But I don't have many fields for predicting start time and end time. If anybody knows about a different approach, please let me know. Thanks in Advance.

Posts	6
Solutions	0
Karma Given	0
Karma Received	0
Member Since	‎04-10-2018

Online Status	Offline
Date Last Visited	‎06-05-2020 02:04 AM

how to use ITSI Glass table in splunk. How to impo...

How to predict end and start time of jobs when usi...

how to use ITSI Glass table in splunk. How to impo...

Re: How to predict end and start time of jobs when...

Re: How to predict end and start time of jobs when...

Re: How to predict end and start time of jobs when...

How to predict end and start time of jobs when usi...