I have psudeo distributed hadoop environment and using sqoop and hive for data importing and analysis.
I used sqoop to import data from mysql to hive and it worked, i got data in my hive location which is /user/hive/warehouse/ and i can able to see the data in hive table.
my doubts are :
- will the data stored in hadoop when i import from mysql to hive(build on top of hadoop)?
- why it is creating
/user/hive/warehousein my hdfs? - how hive would work on top of data?
- if i update data in mysql , i can use sqoop incremental import to update the same in hdfs but if i directly use hive import it will reflect in hive table but not not in hdfs, is my understanding correct?
I am confuse while performing data ingestion using sqoop and hdfs concept
Can somebody help me on this?