How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="DrRiceIO7/HereticFT-Aggressive")
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
pipe(text=messages)
# Load model directly
from transformers import AutoProcessor, AutoModelForMultimodalLM

processor = AutoProcessor.from_pretrained("DrRiceIO7/HereticFT-Aggressive")
model = AutoModelForMultimodalLM.from_pretrained("DrRiceIO7/HereticFT-Aggressive")
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
inputs = processor.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(processor.decode(outputs[0][inputs["input_ids"].shape[-1]:]))
Quick Links

Originally, I wanted to try fine tuning my model with DPO but I couldn't figure out how to get Unsloth to do it using Gemma based models, so this is based on regular old SFT. It still got that abrasive edge though, so I'm calling it a partial success, on account of it seeming a little bit unstable. Next plan: try out a new architecture.

Uploaded finetuned model

  • Developed by: DrRiceIO7
  • License: apache-2.0
  • Finetuned from model : DrRiceIO7/HereticFT

This gemma3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month
4
Safetensors
Model size
4B params
Tensor type
BF16
·
Inference Providers NEW
Input a message to start chatting with DrRiceIO7/HereticFT-Aggressive.

Model tree for DrRiceIO7/HereticFT-Aggressive

Finetuned
(2)
this model
Finetunes
1 model

Dataset used to train DrRiceIO7/HereticFT-Aggressive