I would like to aggregate our IIS logs and be able to quickly perform simple queries against them (e.g. how many times as page x hit in the past month etc.)
I'd like aggregrate this data (maybe down to 10 minutes or so granularity on the time dimension) into an SSAS cube.
We already import the logs via SSIS into tables. The tables are becoming very large, and I'd like to start deleting older data (say more than 3 months ago), whilst keeping the history in the cube (so I can still query over longer than 3 months). Is this a reasonable thing to do - clearly I'd not be able to rebuild my cube easily should I wish to change it... and I guess I'd need to start backing up the cube as if it was a database?
Currently I don't have a PK on the data - Would it be better to add an identity column or create a composite PK out of date,time and url?
Any feedback from someone that has successfully implemented this would be fantastic :)
Thanks!