I wanted to use Titan Db (Storage Back end: HBase) for performing aggregations on the graph Data we have maintained. We aim to store our Data Warehouse data in the form of a graph in Titan DB. However , aggregations take a lot of time , i am using one instance of titan. Steps followed: 1. Creation of graph -4.5lac vertices , 4 lac edges. 2. Creation of Indexes -vertex & edge. 3. Enable database caching.
when i traverse the graph over a depth of 4 to find the sum to find sum of approx 8000 vertices on a property, i see that the first time it takes approx 30 secs to respond the query. Subsequent queries till 3 mins(database cache is flushed at 3 ins) served in under one sec. But after 3 mins again it takes 30 secs to rebuild the cache and provide the reponse. Has anyone has a similar use-case and any suggestions as to how i can make titan perform faster for aggregations? I am expecting a real time performance from titan.