I have a Hive table. If I have a requirement that the data will be coming into the Hive table daily. If the data which is coming in is a new record(inserts) then the record should be inserted into hive table or if the data which is coming in is already existing(updates) in hive then the record should be updated.
Can anyone explain how this is achieved in Hive.
I was checking online i found this article. http://hortonworks.com/blog/four-step-strategy-incremental-updates-hive/