-
Notifications
You must be signed in to change notification settings - Fork 92
Open
Description
Hello!
I noticed your ONNX script (also referenced in the readme), but I can't manage to get a quantized smaller ONNX file from it.
I tried multiple scripts, including the one from transformers.js repo, but still no success.
From what I get, there is some script already somewhere because we have such a quantized model here: https://huggingface.co/minishlab/M2V_base_output/blob/main/onnx/model_quantized.onnx .
Please considered adding support for the quantized ONNX files.
Metadata
Metadata
Assignees
Labels
No labels