I train an object object detection model, based on pre-trained model from TF2 Object Detection efficientdet _d2_coco17_tpu-32. https://tensorflow-object-detection-api-tutorial.readthedocs.io/en/latest/training.html https://github.com/tensorflow/models/blob/master/research/object_detection/g3doc/tf2_detection_zoo.md
I changed pipeline.config as needed to this process (I did it many time before, on eff_d1 or ssd models from tf2 object detection zoo).
I succseful trained this model on batch size 2 and 10K steps. But when tried to train on 100K steps/ 50K steps/ 20K steps I'm getting a 00M error.
I cant understant why it might happen.
Training on GPU - Nvidia GeForce RTX 3070 Ubuntu 20.04 TF 2.4.1
Any ideas, Thank you