TinyLlama 1.1B โ€” BonfyreFPQ v12 Native

  • Format: .fpq v12 (rANS entropy coded E8 + 6-bit tiles + FP16 scales)
  • Base model: TinyLlama/TinyLlama-1.1B-Chat-v1.0 (1.10B params)
  • Size: 1.1 GB (vs 4.1 GB FP32 = 3.8ร— compression)
  • Bits per weight: 8.43 bpw
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for NICKO/TinyLlama-1.1B-BonfyreFPQ-v12

Finetuned
(534)
this model