Currently, we are using Hadoop and Snowflake for storing our data.
The process is Copy the Hadoop ORC files to Snowflake s3 location using DISTCP and then run the Copy into Snowflake table from S3. This will copy everything that is there in the Hadoop ORC table to Snowflake table.
Now, I have a new requirement wherein My Hadoop table is a transactional table and existing entries are getting updated every hour. If I copy the ORC files to S3 and run the Copy command, it adds up more entries to the existing table and not update the existing 1s.
How can I solve this problem in Snowflake?