Safetensors
GGUF
English
llama
conversational
QKing-Official commited on
Commit
91d4e30
·
verified ·
1 Parent(s): 3a92501

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +23 -3
README.md CHANGED
@@ -1,3 +1,23 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ## EndAI‑small.py (TinyLlama)
2
+
3
+ ### Purpose
4
+
5
+ To quickly finetune a model and easily run and use it with the EndAI server!
6
+ This model is specifically designed by me!
7
+
8
+ ### Usage
9
+
10
+ ```bash
11
+ python EndAI-small.py
12
+ ```
13
+
14
+ ### What it does
15
+
16
+ 1. Downloads `TinyLlama/TinyLlama-1.1B-Chat-v1.0`
17
+ 2. Loads 3% of `HuggingFaceH4/ultrachat_200k` (train_sft)
18
+ 3. Applies a system prompt and formats.
19
+ 4. Tokenises with max length 512
20
+ 5. Trains a LoRA adapter (`r=8`, targets `q_proj`, `v_proj`) for 2 epochs
21
+ 6. Saves the merged model to `./endai-small-final/`
22
+
23
+ Manually convert to gguf using the llama.cpp build from the non-small version of this.