---
license: apache-2.0
language:
- en
base_model:
- prithivMLmods/Blitzar-Coder-4B-F.1
pipeline_tag: text-generation
library_name: transformers
tags:
- text-generation-inference
- code
- coder
---

# **Blitzar-Coder-4B-F.1-GGUF**

> **Blitzar-Coder-4B-F.1** is a high-efficiency, multi-language coding model fine-tuned on **Qwen3-4B** using **larger coding traces datasets** spanning **10+ programming languages** including Python, Java, C#, C++, C, Go, JavaScript, TypeScript, Rust, and more. This model delivers exceptional code generation, debugging, and reasoning capabilities—making it an ideal tool for developers seeking advanced programming assistance under constrained compute.

## Model Files

| Filename | Size | Format | Description |
|----------|------|--------|-------------|
| Blitzar-Coder-4B-F.1.BF16.gguf | 8.05 GB | BF16 | Brain Float 16-bit quantization |
| Blitzar-Coder-4B-F.1.F16.gguf | 8.05 GB | F16 | Half precision (16-bit) floating point |
| Blitzar-Coder-4B-F.1.F32.gguf | 16.1 GB | F32 | Full precision (32-bit) floating point |
| Blitzar-Coder-4B-F.1.Q2_K.gguf | 1.67 GB | Q2_K | 2-bit quantization with K-quant |
| Blitzar-Coder-4B-F.1.Q3_K_L.gguf | 2.24 GB | Q3_K_L | 3-bit quantization (Large) with K-quant |
| Blitzar-Coder-4B-F.1.Q3_K_M.gguf | 2.08 GB | Q3_K_M | 3-bit quantization (Medium) with K-quant |
| Blitzar-Coder-4B-F.1.Q3_K_S.gguf | 1.89 GB | Q3_K_S | 3-bit quantization (Small) with K-quant |
| Blitzar-Coder-4B-F.1.Q4_K_M.gguf | 2.5 GB | Q4_K_M | 4-bit quantization (Medium) with K-quant |
| Blitzar-Coder-4B-F.1.Q4_K_S.gguf | 2.38 GB | Q4_K_S | 4-bit quantization (Small) with K-quant |
| Blitzar-Coder-4B-F.1.Q5_K_M.gguf | 2.89 GB | Q5_K_M | 5-bit quantization (Medium) with K-quant |
| Blitzar-Coder-4B-F.1.Q5_K_S.gguf | 2.82 GB | Q5_K_S | 5-bit quantization (Small) with K-quant |
| Blitzar-Coder-4B-F.1.Q6_K.gguf | 3.31 GB | Q6_K | 6-bit quantization with K-quant |
| Blitzar-Coder-4B-F.1.Q8_0.gguf | 4.28 GB | Q8_0 | 8-bit quantization |

### Recommended Usage

- **Q4_K_M** or **Q5_K_M**: Best balance of quality and performance for most users
- **Q6_K** or **Q8_0**: Higher quality, larger file sizes
- **Q2_K** or **Q3_K_S**: Fastest inference, lower quality
- **F16** or **BF16**: High quality, requires more VRAM
- **F32**: Highest quality, requires significant VRAM

## Quants Usage 

(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

Here is a handy graph by ikawrakow comparing some lower-quality quant
types (lower is better):

![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png)