I have a spark streaming job running in production with 1 sec batches. I use CDH 5.5 Spark 1.5. We use Kafka Create Directstream. We have enabled Back Pressure. We dont want to sue dynamic allocation So the job executed with fix number of executor.
From the below image i can see that these is sudden increase in the scheduling delay from 13.50 But during the same time i dont see any dealy in the processing time.
- What would be possible reasons for increase in the scheduling time whne processing times are same.
- Does other job loads in the cluster effect the current streaming job. In my understanding it should not be the case because the executors for streaming are pre-allocated and are already running
Any thoughts?
