Splunk Search

How to convert binary ms word doc into text so I can splunk it?

maverick
Splunk Employee
Splunk Employee

I currently have some medical records in doc form that are binary text created in ms office word.

I want to create dates, times, doctor, patient visits into charts for the doctors within a dashboard in Splunk.

Is there a way that I can convert my binary document into a text based document so I can splunk it?

Is this possible?

0 Karma

lguinn2
Legend

It would be trivial to write a Microsoft VBA macro that simply re-saved a file with the same name, but in Unicode text format with a .txt extension. In fact, here it is

Public Sub SaveAsUnicodeFile()
    Dim newName As String
    Dim pos As Integer  
    pos = InStrRev(ActiveDocument.Name, ".")
    If pos > 0 Then
        newName = Left$(ActiveDocument.Name, pos - 1)
    Else
        newName = ActiveDocument.Name
    End If
    newName = ActiveDocument.Path & "\" & newName & ".txt"  
    Call ActiveDocument.SaveAs(FileName:=newName, FileFormat:=wdFormatUnicodeText)
End Sub

This version of the macro re-saves the active document, but you could change it so that the name of the file to save was supplied as an argument...

0 Karma

yannK
Splunk Employee
Splunk Employee

What about exporting to RTF ?

0 Karma

maverick
Splunk Employee
Splunk Employee

Not sure if this is what you are asking for, but it appears to me to be worth trying:

http://www.cometdocs.com/

0 Karma
Get Updates on the Splunk Community!

.conf24 | Registration Open!

Hello, hello! I come bearing good news: Registration for .conf24 is now open!   conf is Splunk’s rad annual ...

ICYMI - Check out the latest releases of Splunk Edge Processor

Splunk is pleased to announce the latest enhancements to Splunk Edge Processor.  HEC Receiver authorization ...

Introducing the 2024 SplunkTrust!

Hello, Splunk Community! We are beyond thrilled to announce our newest group of SplunkTrust members!  The ...