I have a CSV file with fields mentioned below:
Updated Date, SMSMessage,Sender,SMS Date,userID
The SMSMessage field contains various textual messages. I want to group the similar messages together in a cluster.
Also, I have already used the "cluster" command in splunk. Have been able to group them to an extent, but still not satisfied.
I want to know if there is a better and more sophisticated method (maybe ML Algo) to enable text message clustering?
Well - you could try to use the Splunk Machine Learning Toolkit, but explaining it in detail is a little beyond the scope of a single answer here.
Start with the toolkit - it has a Youtube playlist and a algorithm cheat sheet in its description, so that's a good point to get started. However, machine learning is a little more sophisticated, so your results may vary 😉
Hope that helps - if it does I'd be happy if you would upvote/accept this answer, so others could profit from it. 🙂