During Hbase data migration I have encountered ajava.lang.IllegalArgumentException: KeyValue size too large
In long term :
I need to increase the properties hbase.client.keyvalue.maxsize
(from 1048576 to 10485760) in the /etc/hbase/conf/hbase-site.xml
but I can't change this file now (I need validation).
In short term :
I have success to import data using command :
hbase org.apache.hadoop.hbase.mapreduce.Import \
-Dhbase.client.keyvalue.maxsize=10485760 \
myTable \
myBackupFile
Now I need to run a Spark Job using spark-submit
What is the better way :
- Prefix the HBase properties with 'spark.' (I'm not sure it's possible and if it's works)
spark-submit \
--conf spark.hbase.client.keyvalue.maxsize=10485760
- Using 'spark.executor.extraJavaOptions' and 'spark.driver.extraJavaOptions' to explicitly transmit HBase properties
spark-submit \
--conf spark.executor.extraJavaOptions=-Dhbase.client.keyvalue.maxsize=10485760 \
--conf spark.driver.extraJavaOptions=-Dhbase.client.keyvalue.maxsize=10485760
spark
prefix will work, you' need to useextraJavaOptions
. – Ben Watson