I'm using Hive/Hue/MapReduce with a json Serde. To get this working I have copied the json_serde.jar to several lib directories on every cluster node:
- /opt/cloudera/parcels/CDH/lib/hive/lib
- /opt/cloudera/parcels/CDH/lib/hadoop-mapreduce/lib
- /opt/cloudera/parcels/CDH/lib/hadoop/lib
- /opt/cloudera/parcels/CDH/lib/hadoop-0.20-mapreduce/lib
- ...
On every CDH update of the cluster I have to do that again. Is there a more elegant way where the distribution of the Serde in the cluster would be automatic and resistant to updates?