Machine Learning Insight – Adapt to a “fake” loss reduction. The model will adapt to the loss reduction to maintain clearity. // Per
Just try loss = F.mean_squared_error(y,yt) then loss = loss/(j+10) so its a reduction that came from nowhere. So maybe the reduction should be supervised aswell.