https://github.com/zai-org/GLM-4.5
https://huggingface.co/zai-org/GLM-4.5
https://github.com/vllm-project/llm-compressor
See recipe.yaml for the quantization recipe.
recipe.yaml