About f8899

f8899 · ‎10-06-2015

Great, much appreciated rich7177! I am totally fine with rearranging, it is a sensible thing to me to have it as a top level answer, absolutely go ahead. All the information you presented I think is very useful, worth keeping it and exposing at the top level.

f8899 · ‎10-06-2015

This is an excellent test example rich7177 - thank you for that, I'll try it out! I have one more question, with big files are there any performance concerns with this compared to if I split the "process" lines into separate lines? I am thinking about cases where there are a lot of these, say hundreds of millions per week. It is prob something that depends on the data, any hunches though?

f8899 · ‎10-04-2015

Say I have the following log, where I have separate input and output parts, however, they are processed as batch in between: input id=1 input_id=1 input id=2 input_id=2 input id=3 input_id=3 process id=4 input_ids=1,2,3 output id=5 input_id=1 output id=6 input_id=2 output id=7 input_id=3 I'd like to be able to trace the above in one transaction, such that when I search for input_id=1, I get this: input id=1 input_id=1 process id=4 input_ids=1,2,3 output id=5 input_id=1 Is that possible (including modifying the log to fit Splunk searches)? I'd like to avoid spreading the logging, i.e. doing something like this: input id=1 input_id=1 input id=2 input_id=2 input id=3 input_id=3 process id=4 input_id=1 process id=4 input_id=2 process id=4 input_id=3 output id=5 input_id=1 output id=6 input_id=2 output id=7 input_id=3 as there are many lines here that will make the log unreadable for human consumption if needed, outside of Splunk. This also could be useful for non-Splunk scripts that can depend on all to be on one line. Another thing, batches are not demarcated, they are time-based. Think of the "process" part being something that's executed every 15 seconds or so. In that time frame, the number of input lines can be 1 or 1000. Outputs also, they depend on what process spits out. Processing is also not ordered. 1000 inputs can arrive, it can pick 1 and 1000 in the next batch, then 2-999 in the following batch, as an example, due to priorities or other specifications by the user who pushed the inputs. The only way to know what was picked up in a batch is to look at input_ids. I'm fine changing the format of individual lines. If I should go ahead and change: process id=4 input_ids=1,2,3 to this: process id=4 input_ids=1_2_3 that is doable, keeps the same information.

Posts	3
Solutions	0
Karma Given	1
Karma Received	0
Member Since	‎10-04-2015

Online Status	Offline
Date Last Visited	‎06-05-2020 02:04 AM

How to trace batch processing in one transaction?

Re: How to trace batch processing in one transacti...

Re: How to trace batch processing in one transacti...

How to trace batch processing in one transaction?