bobchenyx
/

GLM-4.6-GGUF

Text Generation

Model card Files Files and versions

bobchenyx commited on Oct 5

Commit

f223ed8

·

verified ·

1 Parent(s): 307a739

Update README.md

Files changed (1) hide show

README.md +41 -3

README.md CHANGED Viewed

@@ -1,3 +1,41 @@
----
-license: mit
----

+---
+quantized_by: bobchenyx
+base_model:
+- zai-org/GLM-4.6
+base_model_relation: quantized
+license: mit
+tags:
+- GLM
+- GLM-4.6
+- transformers
+- GGUF
+pipeline_tag: text-generation
+---
+## Llamacpp Quantizations of zai-org/GLM-4.6
+Adopting **BF16** & **Imatrix** from [unsloth/GLM-4.6-GGUF](https://huggingface.co/unsloth/GLM-4.6-GGUF). (Huge fan of unsloth)
+Personalized Replication of Low-Bit Mixed Precision Quant using `--tensor-type` option in [llama.cpp](https://github.com/ggml-org/llama.cpp)
+```
+- IQ1_M : 83.63 GiB (2.01 BPW)
+```
+---
+## Download Guide
+```
+# !pip install huggingface_hub hf_transfer
+import os
+os.environ["HF_HUB_ENABLE_HF_TRANSFER"] = "1"
+from huggingface_hub import snapshot_download
+snapshot_download(
+    repo_id = "bobchenyx/GLM-4.6-GGUF",
+    local_dir = "bobchenyx/GLM-4.6-GGUF",
+    allow_patterns = ["*IQ1_M*"], # Q2_K_L,Q4_K_M
+)
+```