0
votes

I am using Hortonworks environment for big data processing.

I observed that as part of Map Reduce jobs from Hive/Oozie/Pig etc my below local cache directory is getting filled up.It creates one folder for every map reduce execution with the jar files.

/hadoop/yarn/local/usercache/root/filecache

Is it fine if I delete these folders directly? is there any configuration that I can set to remove it automatically?

1

1 Answers

1
votes

These directories should be cleaned up automatically every 600000ms (10 minutes).

Or you can reduce this timeperiod by setting up this property in yarn-site.xml, yarn.nodemanager.localizer.cache.cleanup.interval-ms.

You can also change the maximum size this cache directory should occupy, yarn.nodemanager.localizer.cache.target-size-mb. The default target size is 10240MB.

If no jobs are running, manual deletion of cache is fine. Restart nodemanager after deleteing the cache.