akera commited on
Commit
0a92804
·
verified ·
1 Parent(s): fc44b32

Add GGUF quantized Sunflower models

Browse files
.gitattributes CHANGED
@@ -33,3 +33,10 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ sunflower-imatrix.dat filter=lfs diff=lfs merge=lfs -text
37
+ sunflower-merged-f16.gguf filter=lfs diff=lfs merge=lfs -text
38
+ sunflower-q4_k_m.gguf filter=lfs diff=lfs merge=lfs -text
39
+ sunflower-q5_k_m.gguf filter=lfs diff=lfs merge=lfs -text
40
+ sunflower-q5_k_s.gguf filter=lfs diff=lfs merge=lfs -text
41
+ sunflower-q6_k.gguf filter=lfs diff=lfs merge=lfs -text
42
+ sunflower-q8_0.gguf filter=lfs diff=lfs merge=lfs -text
Modelfile ADDED
@@ -0,0 +1,22 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ FROM sunflower-q4_k_m.gguf
2
+
3
+ # System message
4
+ SYSTEM """You are a linguist and translator specialising in Ugandan languages, made by Sunbird AI."""
5
+
6
+ TEMPLATE """<|im_start|>system
7
+ {{ .System }}<|im_end|>
8
+ <|im_start|>user
9
+ {{ .Prompt }}<|im_end|>
10
+ <|im_start|>assistant
11
+ {{ .Response }}<|im_end|>"""
12
+
13
+ # Stop tokens
14
+ PARAMETER stop "<|im_start|>"
15
+ PARAMETER stop "<|im_end|>"
16
+
17
+ # Quality parameters
18
+ PARAMETER temperature 0.3
19
+ PARAMETER top_p 0.95
20
+ PARAMETER top_k 40
21
+ PARAMETER repeat_penalty 1.1
22
+ PARAMETER num_ctx 4096
README.md ADDED
@@ -0,0 +1,91 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: Sunbird/qwen3-14b-sunflower-merged
4
+ tags:
5
+ - quantized
6
+ - gguf
7
+ - llama.cpp
8
+ - ollama
9
+ - ugandan-languages
10
+ - translation
11
+ - qwen
12
+ library_name: transformers
13
+ pipeline_tag: text-generation
14
+ language:
15
+ - en
16
+ - lg
17
+ ---
18
+
19
+ # Sunflower 14B - GGUF
20
+
21
+ GGUF quantized versions of the Sunflower model for Ugandan language translation tasks.
22
+
23
+ ## Model Details
24
+
25
+ - **Base Model**: [Sunbird/qwen3-14b-sunflower-merged](https://huggingface.co/Sunbird/qwen3-14b-sunflower-merged)
26
+ - **Model Size**: 14B parameters
27
+ - **Architecture**: Qwen2.5
28
+ - **Quantization**: K-means quantization with importance matrix
29
+ - **Languages**: English, Luganda, and other Ugandan languages
30
+
31
+ ## Available Files
32
+
33
+ | Filename | Quant type | File Size | Description |
34
+ | -------- | ---------- | --------- | ----------- |
35
+ | sunflower-merged-f16.gguf | F16 | 28GB | Original precision |
36
+ | sunflower-q8_0.gguf | Q8_0 | 15GB | Highest quality quantized |
37
+ | sunflower-q6_k.gguf | Q6_K | 12GB | High quality |
38
+ | sunflower-q5_k_m.gguf | Q5_K_M | 9.8GB | Balanced quality/size |
39
+ | sunflower-q5_k_s.gguf | Q5_K_S | 9.6GB | Smaller Q5 variant |
40
+ | sunflower-q4_k_m.gguf | Q4_K_M | 8.4GB | Recommended for most users |
41
+ | sunflower-imatrix.dat | - | 7.4MB | Importance matrix data |
42
+
43
+ ## Usage
44
+
45
+ ### llama.cpp
46
+
47
+ ```bash
48
+ # Download model
49
+ huggingface-cli download Sunbird/Sunflower-14B-GGUF sunflower-q4_k_m.gguf --local-dir .
50
+
51
+ # Run inference
52
+ ./llama-cli -m sunflower-q4_k_m.gguf -p "Translate to Luganda: Hello, how are you today?"
53
+ ```
54
+
55
+ ### Ollama
56
+
57
+ ```bash
58
+ # Create Modelfile
59
+ echo 'FROM ./sunflower-q4_k_m.gguf' > Modelfile
60
+
61
+ # Create model
62
+ ollama create sunflower-14b -f Modelfile
63
+
64
+ # Use model
65
+ ollama run sunflower-14b "Translate to Luganda: Good morning"
66
+ ```
67
+
68
+ ### Python (llama-cpp-python)
69
+
70
+ ```python
71
+ from llama_cpp import Llama
72
+
73
+ llm = Llama(model_path="sunflower-q4_k_m.gguf")
74
+ result = llm("Translate to Luganda: How are you?")
75
+ print(result['choices'][0]['text'])
76
+ ```
77
+
78
+ ## Performance Notes
79
+
80
+ - **Q4_K_M**: Recommended for most use cases
81
+ - **Q5_K_M**: Better quality with moderate size increase
82
+ - **Q6_K**: High quality for production use
83
+ - **Q8_0**: Near-lossless quality
84
+
85
+ ## Technical Details
86
+
87
+ Quantized using llama.cpp with importance matrix calibration for optimal quality preservation.
88
+
89
+ ## License
90
+
91
+ Apache 2.0
sunflower-imatrix.dat ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9717ce1e0643a19b3686e86162ce04db8bd01635954eb85284826a3834ebc0f8
3
+ size 7743552
sunflower-merged-f16.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:eded257f1acad597e1ddda3f6e69cd387b58bfe8f6ef45dbcae724b68f0b1ba5
3
+ size 29543423264
sunflower-q4_k_m.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9d548245e1b7d4d52e9dad363e3f0076dbf8ee8a4e9eaf66332c8b856cfd9bbc
3
+ size 9001753088
sunflower-q5_k_m.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:23c419c7560a205cecc991ca95bd0f7d55b641d1d250efe92172b58c39fb3ee6
3
+ size 10514569728
sunflower-q5_k_s.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2d1aa7f358e345bf0bad2ce21f82fb7e830e9650ab5ce6ad2d54dd6362bd9cb6
3
+ size 10263894528
sunflower-q6_k.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5b2074f4414b47b05c7d468cebc9c9ed2a21443cdd8e58cb489d20be4693b942
3
+ size 12121937408
sunflower-q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e50d50e7cfd15af8ad6ae0accdf13a8fe740d7daca95d38a36c78034ee8d4f9b
3
+ size 15698533888