GPTQ is currently not available in llama-fast @HDCharles can you please bring the GPTQ support from gpt-fast over to llama-fast? x-ref: https://github.com/pytorch-labs/gpt-fast/pull/148 Thanks so much! @supriyar please re-assign if appropriate