Hi risgupta,
Have you determined where is the bottleneck in data pipeline?
In Monitoring Console, go to indexing performance - instance/deployment, and the panels there can give you a good understanding of the indexing performance across all the components in the indexing pipeline set. Median Fill Ratio of Data Processing Queues will be very helpful in determining the bottleneck.
You can also take a closer look at metrics.log, which periodically samples Splunk activity every 30 seconds and reports top 10 items in each category to reveal the whole picture across the toplogy, including forwarding thruput and indexing thruput.
index=_internal source=metrics.log host=xyz
The log has a variety of inspection information:
group – indicates the data type: pipeline, queue, thruput, tcpout_connections, udpin_connections, and mpool
group=pipeline – plots the frequency and the duration of the pipeline process machinery
group=queue – displays the data to be processed
* current_size can identify which are the bottlenecks
09-07-2016 17:07:21.416 +0000 INFO Metrics - group=pipeline, name=parsing, processor=utf8,
cpu_seconds=0.000000, executes=23, cumulative_hits=691835
09-07-2016 17:07:21.416 +0000 INFO Metrics - group=queue, name=parsingqueue, blocked!!=true,
max_size=1000, filled_count=0, empty_count=8, current_size=0, largest_size=2, smallest_size=0
Hope this helps. Thanks!
Hunter
... View more