I wrote a Theano machine learning program. But I got two absolutely different results between on CPU and on GPU.
Below is the log.(only a tiny part of log)
The loss function will quickly decrease and then converg to 0.2 on CPU.
However, the loss function will increase and finally become NaN on GPU.
What mistakes may be in my program? Or what should I take into attention? Thank you!