Find a portion of a value in a list

camiller · ‎01-12-2017

Hello!

I have two CSV files:

in the first file, there is a list of machines hostnames (ex: ABCZER12).
and in the second file, I have a list of the first letters of the machines hostnames (ex: ABC), and each expression correspond to a location (ex: Europe, Brazil...).

An example of the first file:

ABCZER12
ABCSDF56
ABCFHG76
OPQYGT65
XYTGHY41
XYZRFV78

And an example of the second file:

ABC => Europe
OPQR => Brazil
XY => USA

I need to associate each hostname with its corresponding location. To do that, I need to search for each hostname if its begining match with one of the expressions in the second list. Example: ABCZER12, ABCSDF56 and ABCFHG76 will match with ABC, means that those hostnames correspond to the location Europe.
Some hostnames can not match any of the expressions in the second list (ex: OPQYGT65).
In the second file, expressions does not always have the same amount of letters.
In the two files, there is no field that are the same, so I think that I can't use lookups to resolve my problem here.

If anyone have an idea to resolve my problem, it would be great.
Thanks 🙂

somesoni2 · ‎01-22-2017

Have you looked at the Wild card option with lookups? If you're able to convert data from 2nd csv to a lookup with following format, you should be able to use the wildcard feature.

hostprefix,country
ABC*,Europe
OPQR*,Brazil
XY*,USA

See an example here. https://answers.splunk.com/answers/52580/can-we-use-wildcard-characters-in-a-lookup-table.html

camiller · ‎01-23-2017

Yes, thank you. It works.

DalJeanis · ‎01-18-2017

okay, so you are going to have to reformat your second file in order to make this work. Here's one way.

[your search here ]
| fields hostname _time Otherstuff
| eval MatchField=mvappend(
    "Left02=".substr(hostname,1,2),
    "Left03=".substr(hostname,1,3),
    "Left04=".substr(hostname,1,4),
    "None99=((None))")
| mvexpand MatchField
| join max=0 MatchField 
   [| inputlookup prefixfile.csv 
    | fields prefix region
    | eval MatchField="Left0".len(prefix)."=".prefix
    | fields MatchField region
    | append [makeresults | eval MatchField="None99=((None))" | eval region="((none))" ]
    | dedup MatchField region
    ]

| sort 0  _time hostname MatchField
| dedup _time hostname
| fields hostname _time Otherstuff MatchField region

The above code assumes there won't be any events on the same hostname with the same _time timestamp. If your first file is only the host name, just remove _time and Otherstuff from the above code and it should work.

If you have any prefixes longer than 9 digits, you'll have to adjust the code somewhat. (Format the len(prefix) as 2-digits.)

edited to use sort 0 instead of sort in case there were more than 100 values to be sorted.

DalJeanis · ‎01-18-2017

Obviously the code "Left04=".substr(hostname,1,4), needs to be repeated for any length up to the longest one in your second file.

Find a portion of a value in a list

Introducing the 2024 SplunkTrust!

Introducing the 2024 Splunk MVPs!

Splunk Custom Visualizations App End of Life