Hi Vu, Thanks for your great work. I wonder it is possible for training object detection only? When I read your code it seems that both are required. Best regards.