Resources

Apr 11: Updated with Google chat template fixes + more

pinned

🤗❤️ 14

#24 opened 1 day ago by

danielhanchen

Gemma 4 Tool Calling is amazing in Unsloth Studio!

pinned

🔥 4

#4 opened 10 days ago by

danielhanchen

Apr 11 chat template causes ~7.5s template rendering overhead per request in llama.cpp

#27 opened about 7 hours ago by

btdeviant

llama.cpp flags / visual token budget

#26 opened about 11 hours ago by

234r89r23u89023rui90

D:\a\llama.cpp\llama.cpp\ggml\src\ggml-cuda\ggml-cuda.cu:911: GGML_ASSERT(tensor->view_src == nullptr) failed

#25 opened 1 day ago by

osabc

Do NOT use CUDA 13.2

❤️ 8

#22 opened 4 days ago by

danielhanchen

Gemma 4 seems to work best with high temperature for coding

👍 1

#21 opened 4 days ago by

Reverger

Apr 8 - New GGUF Updates

🚀❤️ 14

#20 opened 4 days ago by

danielhanchen

gguf updates

👀 5

#17 opened 6 days ago by

tstello

Ollama Error

👍 1

#16 opened 6 days ago by

edm-research

Inference speed on 12GB VRAM

#15 opened 6 days ago by

drakexp

Fails to run on vLLM

#14 opened 7 days ago by

Skodra

Only 2nd <13GB model to one-shot the Heptagon-Tumbler

❤️🔥 3

#12 opened 8 days ago by

BingoBird

New uploads adds llama.cpp fixes

👍 6

#11 opened 9 days ago by

danielhanchen

Commit description

👍 4

#10 opened 9 days ago by

Kelheor

Q4_0 and Q4_1?

👍 1

#9 opened 9 days ago by

elpirater312

How to enable thinking

❤️👍 7

#6 opened 10 days ago by

watchingyousleep

Tool call with dates fails

#5 opened 10 days ago by

EmilPi

Model produces `<|channel><unused49><unused49><unused49>`

👍 5

#2 opened 10 days ago by

kyuz0