Dear authors, thanks for your inspiring work. Could you please add the implementation of inference using base model instead of the distilled ones? I think the provided code lacked specification of how to use the pretrained base model weight provided on huggingface.
Thanks a lot in advance!