I am quite new to machine learning, so I need some help.
I have spark streaming job which ingests data about user electricity consumption into Cassandra. I fill multiple tables with that data, out of which is most important "hourly_data", which specifies how much electricity each user spent within specific hour.
What I want to do, is some forecasting about how much electricity user will spend until the end of the day, month or year.
Which libraries and models I should use for that? Is the regression what I actually need?
I guess I cannot do forecasting in streaming job, but I need to start a batch process for that?
Also, it would be nice if I could for a specific day, plot the expected user behaviour until the end of the day (same for the month or a year...) Which libraries in Spark can help me do that? Any tutorials?
Thanks a lot