I am currently using a Dataproc cluster with a fixed number of workers. Each worker has a non-trivial initialization action needed to install some specific libraries on the workers.
Recently, we decided to try to use some preemptible workers, but our Spark jobs are failing because some libraries are missing. The reason seems to be that there is no initialization actions on the preemptible workers. In fact, I have connected using ssh to these workers and I am completely sure that the initialization script is not executed on these preemptible workers, because the expected libraries are not there and our initialization script leaves a log of its execution, which is missing.
Is this a normal situation? How can I ensure that my preemptible workers have run my custom initialization action script?