How to use a trained model to perform inference and generate answers with only PNG images and without NPY label input information