This repository was archived by the owner on Mar 30, 2025. It is now read-only.

Description
When i do a training on the s3dis dataset, after same epochs the output just stops.
I start the training with:
CUDA_VISIBLE_DEVICES=0 nohup python examples/segmentation/main.py --cfg cfgs/s3dis/pointnext-s.yaml > outputlog 2>&1 &
Sometimes it already stops in the first epoch, sometimes it runs for five-sixes epochs but eventually it stops.
The python process for the training is then still alive (status “sleeping”) and the hardware resources are also still allocated.
Can someone help?
output.log