Hello, I'm trying to train the model, but it takes 3 days for one epoch!!!! Is that normal? I'm using dateset that contain about 100,000 records of short text. Can you tell me your device configuration ?!