About jasonstanek

jasonstanek · ‎01-04-2019

Thanks for the proposed answer. Was looking for an index-time solution. If I understand your proposal correctly, it appears this is a search-time solution. Please correct if not the case.

jasonstanek · ‎01-04-2019

The number of rows is consistent. The number of items separated by commas is not consistent.

jasonstanek · ‎01-04-2019

Yes, the RowName's are all fixed text. The only other pieces that are consistently found from entry to entry are the line breaks and comma separators.

jasonstanek · ‎01-03-2019

What I am hearing is that "How do I extract multiple multi-value fields from a multi-line event at index time via regex" is not possible.

jasonstanek · ‎01-03-2019

I need all of the comma separated values on a line. I do not want to skip lines. How does one use a regex instead of tokenization to pull all the comma separated values on a line?

jasonstanek · ‎01-03-2019

Because the number of tokens on a line is unbounded, it is not apparent that a RegEx can accommodate all variants. For unbounded tokenization, the current understanding is that is for what the tokenization feature is intended - however the tokenization feature appears to work by columns only, not rows. If you think a regex can accommodate all variants, please provide specifics.

jasonstanek · ‎01-03-2019

Thank you for the reply to the question - would like to understand the proposed answer better. If by "record format is consistent" you mean the number of comma separated items (tokens) on each row, the number of items on rows A,B,&C varies. Row D is the only row that is predictably the same "format". If you mean something else, can you please elaborate?

jasonstanek · ‎12-13-2018

Needing help with multiple multi-value field extraction from a multiline event. Expecting the result of the following extraction to index each of rowA values with each of rowC identifiers, and index each of rowB values with each of rowC identifiers, and extract the endtime into the record timestamp(s). An acceptable alternative to these associations is a record timestamped with EndTime with multivalue field rowA, multivalue field rowB, and multivalue field rowC. RowNameA,1432,4363,6223,7543,19182,... RowNameB,8383,2727,3221,... RowNameC,NumericalIdentifierA,NumericalIdentifierB,... RowNameD,TheDate,StartTime,EndTime,OtherNumbers,... I am stuck at (,(?\d+)[^\S]+) for the regex to pull out rowA values, which unfortunately cuts across all lines. Apparently adding wildcard to the beginning of the regex misses values. Apparently the tokenizer-based approach requires named columns. Can someone demonstrate to me that Splunk is expressive enough at index time to extract the information in the manner I'm requesting? I am working with Splunk Cloud, with data files sourced via a Heavy Forwarder. I've been unable to get the MV_ADD feature to work in transforms.conf, but have been able to get a single multi-value field to extract via the transform+field extraction console.

Posts	8
Solutions	0
Karma Given	0
Karma Received	0
Member Since	‎12-13-2018

Online Status	Offline
Date Last Visited	‎06-05-2020 02:04 AM

How do I extract multiple multi-value fields from ...

Re: How do I extract multiple multi-value fields f...

Re: How do I extract multiple multi-value fields f...

Re: How do I extract multiple multi-value fields f...

Re: How do I extract multiple multi-value fields f...

Re: How do I extract multiple multi-value fields f...

Re: How do I extract multiple multi-value fields f...

Re: How do I extract multiple multi-value fields f...

How do I extract multiple multi-value fields from ...