Loss Scaling and Step Size in Deep Learning Optimizatio