I have referred below links and did the same changes
- https://aws.amazon.com/premiumsupport/knowledge-center/core-node-emr-cluster-disk-space/
- Cleaning up Spark history logs
And restarted history-server and resource-manager but It is not deleting containers logs after defined time. it is causing issue of unhealthy node.
I have configuration like below
- /etc/hadoop/conf/yarn-site.xml
<property>
<name>yarn.log-aggregation.retain-seconds</name>
<value>600</value>
</property>
- /etc/logpusher/hadoop.config
"/var/log/hadoop-yarn/containers" : {
"includes" : [ "(.*)" ],
"s3Path" : "containers/$0",
"retentionPeriod" : "1h",
"deleteEmptyDirectories": true,
"logType" : [ "USER_LOG", "SYSTEM_LOG" ]
}
3 . /etc/spark/spark-defaults.conf
spark.history.fs.cleaner.enabled true
spark.history.fs.cleaner.maxAge 1h
spark.history.fs.cleaner.interval 1h
Could you please suggest what I am missing?
spark.eventLog.enabled, see issues.apache.org/jira/browse/SPARK-5210 - ollik1