GPT-OSS-20B STEM Reasoning (GGUF Q8_0)

GGUF quantized version of GPT-OSS-20B fine-tuned for STEM reasoning tasks. Ready for use with Ollama, llama.cpp, LM Studio, and other GGUF-compatible tools.

Author: Khadim Hussain

Available Versions

Model Size Description
gpt-oss-20b-stem-reasoning ~61MB LoRA adapter only
gpt-oss-20b-stem-reasoning-merged ~41GB Full merged model (bf16)
gpt-oss-20b-stem-reasoning-GGUF ~21GB GGUF Q8_0 (this repo)

Files

Filename Quant Size Description
gpt-oss-20b-finetuned-f16.gguf F16 39GB Full 16-bit precision, best quality
gpt-oss-20b-finetuned-q8_0.gguf Q8_0 21GB 8-bit quantization, excellent quality
gpt-oss-20b-finetuned-q4_k_m.gguf Q4_K_M 15GB 4-bit quantization, good balance of size/quality

Usage with Ollama

# Download the GGUF file and Modelfile
# Then create and run:
ollama create gpt-oss-20b-stem -f Modelfile
ollama run gpt-oss-20b-stem

Modelfile

FROM ./gpt-oss-20b-finetuned-q8_0.gguf

TEMPLATE """<|start|>system<|message|>You are a helpful assistant trained by OpenAI.

Reasoning: medium

# Valid channels: analysis, commentary, final. Channel must be included for every message.<|end|><|start|>user<|message|>{{ .Prompt }}<|end|><|start|>assistant"""

PARAMETER stop "<|return|>"
PARAMETER temperature 0.7
PARAMETER top_p 0.9
PARAMETER num_ctx 4096
PARAMETER num_predict 2048

Usage with llama.cpp

./llama-cli -m gpt-oss-20b-finetuned-q8_0.gguf \
  -p "<|start|>system<|message|>You are a helpful assistant.<|end|><|start|>user<|message|>What is DNA?<|end|><|start|>assistant" \
  -n 512

Usage with LM Studio

  1. Download gpt-oss-20b-finetuned-q8_0.gguf
  2. Import into LM Studio
  3. Use the Harmony chat template

Harmony Response Format

GPT-OSS uses the OpenAI Harmony format. The model outputs:

  • <|channel|>analysis - Thinking/reasoning (displayed as "Thinking..." in Ollama)
  • <|channel|>final - Final answer

Example:

Thinking...
DNA stands for deoxyribonucleic acid. It is a polymer made from nucleotides...
...done thinking.

DNA is a molecule that carries genetic instructions used in growth, development, and reproduction.

Training Details

Metric Value
Base Model openai/gpt-oss-20b
Train Loss 1.087
Eval Loss 0.837
Training Examples 4,260
Quantization Q8_0 (8-bit)

Acknowledgments

Citation

If you use this model, please cite:

@misc{hussain2026gptoss-stem,
  author = {Hussain, Khadim},
  title = {GPT-OSS-20B STEM Reasoning: Fine-tuned for Science Q&A with Chain-of-Thought},
  year = {2026},
  publisher = {Hugging Face},
  url = {https://huggingface.co/khadim-hussain/gpt-oss-20b-stem-reasoning-GGUF}
}

Also cite the original GPT-OSS model:

@misc{openai2025gptoss,
  author = {OpenAI},
  title = {GPT-OSS: Open-Weight Language Models},
  year = {2025},
  url = {https://github.com/openai/gpt-oss}
}

License

Apache 2.0 (inherited from GPT-OSS)

Downloads last month
19
GGUF
Model size
21B params
Architecture
gpt-oss
Hardware compatibility
Log In to add your hardware

4-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for khadim-hussain/gpt-oss-20b-stem-reasoning-GGUF

Quantized
(203)
this model