I use my API logs to extract information like:
- In this period of time how many are the users of my API ?
- Or in this period of time, what type of services are called the most ?
Almost all the information I extract depend on the timestamp. Actually I use MongoDB and I added the time-stamp as an index(for 80GB, indexes size is 12GB).
A migration to cassandra or Hbase was recommended for me. And I want to know which is better for my use case:
- Analysis for timeseries data.
- Both good write and read performance are required.
- Possibility of using hadoop to do my data analysis.
Thanks for sharing your point of view or your experience.