Getting Data In

How to modify CSV raw event data before fields extraction stage of INDEXED_EXTRACTIONS=CSV

Maite35
Explorer

Hello,

I am using FIELD_DELIMITER=; and am working on data that use commas instead of decimals. I want to use a SED to replace those with dots when indexing (s /,/./ g) I tried this in props.conf:

SEDCMD-coma = s/,/./g

I also tried this in props. Conf :

TRANSFORMS-toto = toto

And in transforms.conf :

[toto]
REGEX = s/,/./g

And in all cases the behavior is the same : on my raw events ( _raw ) it works fine:

18/03/2015;23:50:00;XXX;XXX;XXX;16;6.52;41740109;0.03;46987.89;193790;0;12885230;0;25215.5;0;15;87;0;0;40008787;0;37.97;0;667;563.19;47255.63;525.22;369.59

But it never effects the fields that are exracted:

10 premières valeurs,          Nombre,     %
0          3832     6,415 %
0,07        108     0,181 %
0,76        103     0,172 %
0,02        97      0,162 %
0,77        96      0,161 %

Ideas to do this?

Thank you in advance. Best Regards.

Tags (3)
1 Solution

Maite35
Explorer

Finaly I used Date Model :

rex mode=sed field=FIELD "s/,/./g"

View solution in original post

0 Karma

woodcock
Esteemed Legend

OK, your solution was to post-modify the fields one-by-one at search time. You don't have to use a Data Model, you can just do it like this whenever you need it (search bar, dashboard), like this:

... | rex mode=sed field=<SomeFieldName> "s/,/./g"
0 Karma

Maite35
Explorer

Finaly I used Date Model :

rex mode=sed field=FIELD "s/,/./g"

0 Karma

woodcock
Esteemed Legend

You should "Accept" the answer from the person who gives you the answer.

0 Karma

woodcock
Esteemed Legend

It looks like you will probably have to pre-process the file outside of Splunk. I wish there was more detail here:

http://docs.splunk.com/Documentation/Splunk/latest/admin/Configurationparametersandthedatapipeline

0 Karma

Maite35
Explorer

Thanks for your help !
finaly, I used Data-model to sed my coma with point ...

0 Karma

rapmancz
Explorer

please what did you do exactly?

0 Karma

woodcock
Esteemed Legend

OK, post exactly what you did as an Answer and then Accept your answer so that we can all learn.

0 Karma

woodcock
Esteemed Legend

How are you creating your fields? Are you using INDEXED_EXTRACTIONS as described here?

http://docs.splunk.com/Documentation/Splunk/latest/Data/Extractfieldsfromfileheadersatindextime

0 Karma

Maite35
Explorer

Hi woodcock,

Yes I am using INDEXED_EXTRACTIONS=CSV

0 Karma

Maite35
Explorer

hello somesoni2 and thank you for your answer and help.
The behavior with what you offer is the same as quoted above: dot is present in _raw but not passed on to the fields extracted from csv file.

0 Karma

somesoni2
Revered Legend

Give this a try

In props.conf:

    SEDCMD-coma = s/(\d*),(\d*)/\1.\2/g
0 Karma
Get Updates on the Splunk Community!

More Ways To Control Your Costs With Archived Metrics | Register for Tech Talk

Tuesday, May 14, 2024  |  11AM PT / 2PM ET Register to Attend Join us for this Tech Talk and learn how to ...

.conf24 | Personalize your .conf experience with Learning Paths!

Personalize your .conf24 Experience Learning paths allow you to level up your skill sets and dive deeper ...

Threat Hunting Unlocked: How to Uplevel Your Threat Hunting With the PEAK Framework ...

WATCH NOWAs AI starts tackling low level alerts, it's more critical than ever to uplevel your threat hunting ...