Getting Data In

How to index unstructured text file?

mukundd
New Member

I'm facing issue with indexing unstructured text file. Is there any config setting?

Tags (2)
0 Karma

ashutoshab
Communicator

Can you please mention some more details, such as the file type, contents of the file and what is the issue you are facing.

Splunk does not face any problem with unstructured data. The only requirement is your data should be in ASCII format. Splunk does not worry about whether the data is structured or not, it just needs to be in ASCII format.

There are some possibilities,

  1. Either the data is not a text file / ASCII data.
  2. The data is indexed but you are not able to search them.
  3. You are able to search the data but the fields are not extracted.

Please write details about your issue and we can help to fix them.

0 Karma

mukundd
New Member

Hi Ashutosh,
I've converted pdf file (unstructured) into text file and indexed same.
Issue I'm facing with extracting fields, I've extracted Patient Name, Provider, Date of Birth, Visit Date, however facing issue with extracting columnar data (table) as below (table may having variable no. of rows).

Dx Code
Diagnosis Code Comment
Other fatigue R53.83

Pruritus, unspecified L29.9

0 Karma

DavidHourani
Super Champion

Hi @mukundd,

There are settings in props.conf for your line breakers and other index time actions. Share your text file with us and we can help you with the indexing issue.

You can find all the settings here :
https://docs.splunk.com/Documentation/Splunk/latest/admin/Propsconf

Cheers,
David

0 Karma
Get Updates on the Splunk Community!

Announcing Scheduled Export GA for Dashboard Studio

We're excited to announce the general availability of Scheduled Export for Dashboard Studio. Starting in ...

Extending Observability Content to Splunk Cloud

Watch Now!   In this Extending Observability Content to Splunk Cloud Tech Talk, you'll see how to leverage ...

More Control Over Your Monitoring Costs with Archived Metrics GA in US-AWS!

What if there was a way you could keep all the metrics data you need while saving on storage costs?This is now ...