Hello,
Thanks for sharing the code. After downloading the required dataset, the code performs well. However, when I try to modify the optimizer, for example, replace adam by sgd, the training process cannot converge even with smaller learning rate.
Could you give me some suggestions or some hints I shall take care of ? In any case thanks a lot!