tzervas commited on
Commit
f26846f
·
verified ·
1 Parent(s): d0342f9

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +6 -4
README.md CHANGED
@@ -49,7 +49,9 @@ This is an **experimental** BitNet 1.58-bit quantization of the Qwen2.5-Coder-32
49
  | Format | File | Description |
50
  |--------|------|-------------|
51
  | SafeTensors | `model-*.safetensors` | Sharded quantized weights + scales |
52
- | GGUF | `qwen-coder-32b-tq2.gguf` | llama.cpp compatible |
 
 
53
 
54
  ## Quantization Method
55
 
@@ -71,10 +73,10 @@ This is an **experimental** BitNet 1.58-bit quantization of the Qwen2.5-Coder-32
71
 
72
  ## Usage
73
 
74
- ### With Ollama/llama.cpp
75
  ```bash
76
- # llama.cpp
77
- ./llama-cli -m qwen-coder-32b-tq2.gguf -p "Write a Python function:"
78
  ```
79
 
80
  ### Unpacking Weights (Python)
 
49
  | Format | File | Description |
50
  |--------|------|-------------|
51
  | SafeTensors | `model-*.safetensors` | Sharded quantized weights + scales |
52
+ | GGUF | `qwen2.5-coder-32b-TQ2_0.gguf` | llama.cpp TQ2_0 format (experimental) |
53
+
54
+ > **GGUF Compatibility Note**: The GGUF conversion is experimental. Our BitNet quantization uses group size 64, while TQ2_0 uses 256-element blocks. This may cause compatibility issues with some inference engines. The SafeTensors format is the primary supported format.
55
 
56
  ## Quantization Method
57
 
 
73
 
74
  ## Usage
75
 
76
+ ### With Ollama/llama.cpp (experimental)
77
  ```bash
78
+ # llama.cpp (GGUF format - experimental, may have issues)
79
+ ./llama-cli -m qwen2.5-coder-32b-TQ2_0.gguf -p "Write a Python function:"
80
  ```
81
 
82
  ### Unpacking Weights (Python)