output002

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the Tri-Ring merge method using /mnt/e/text-generation-webui-1.14/user_data/models/k2-nano-zero as a base.

Models Merged

The following models were included in the merge:

  • /mnt/f/mergekit-my-test-gemma4/output000
  • /mnt/f/mergekit-my-test-gemma4/output001
  • /mnt/e/text-generation-webui-1.14/user_data/models/k2-nano-karcher-100000

Configuration

The following YAML configuration was used to produce this model:

# Tri-Ring: exactly three non-base models are used as the three ring centres.
merge_method: tri_ring
base_model: /mnt/e/text-generation-webui-1.14/user_data/models/k2-nano-zero
models:
  - model: /mnt/f/mergekit-my-test-gemma4/output000
    parameters:
      weight: 0.34
  - model: /mnt/f/mergekit-my-test-gemma4/output001
    parameters:
      weight: 0.33
  - model: /mnt/e/text-generation-webui-1.14/user_data/models/k2-nano-karcher-100000
    parameters:
      weight: 0.33
parameters:
  normalize_weights: true
  overlap_alpha: 0.35
  ring_temperature: 4.0
  max_iter: 8
  tol: 1e-6
  eps: 1e-8
dtype: bfloat16
Downloads last month
27
Safetensors
Model size
26B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for win10/K2-nano-v1

Quantizations
2 models

Paper for win10/K2-nano-v1