I have Elasticsearch v 2.2.0 cluster, 1 node, 4g heap size, 7g RAM, 2 cpu cores, 401 indices, 1,873 shards, 107,780,287 docs, total data 70.19GB.
I also have configured indices.fielddata.cache.size: 40%.
The problem is when I am using Kibana to query some thing (very simple queries), if it a single query it`s working fine, but if I continue to query some more - elastic is getting so slow and eventually stuck because the JVM heap usage (from Marvel) is getting to 87-95%. It happens also when I trying to load some Kibana dashboard and the only solution for this situation is to restart the elastic service or clear all cache.
Why is the heap stuck like that?
EDIT:
_node/stats when heap is stuck
_node/stats when cluster in normal state
EDIT 2:
To better understand the problem, I went as far as analyzing a memory dump. This analysis was performed after the cluster stuck trying some Kibana queries:
I do have, in some indices, settings of _ttl that is NOT working (the _ttl set is for 4 weeks but the documents still there...). I have changed the default mappings since then but have not deleted the "not working ttl" indices.
Can it be the main problem?




GET /_nodes/stats? - Andrei Stefan