Update README.md
Browse files
README.md
CHANGED
|
@@ -42,7 +42,7 @@ datasets:
|
|
| 42 |
|
| 43 |
An experimental fine-tune of qwen-14b using [bagel](https://github.com/jondurbin/bagel)
|
| 44 |
|
| 45 |
-
The resulting model didn't turn out quite as great as I would have liked - in fact, I'd probably use the mistral-7b version over this, because it scored higher on mt-bench, is much faster, and generally is uncensored in comparison to this model (even after toxic DPO, several epochs)
|
| 46 |
|
| 47 |
I modified the qwen tokenizer to use `<s>` instead of `<|im_start|>` and `</s>` instead of `<|endoftext|>`, and it may have caused some issues but I'm not entirely sure.
|
| 48 |
|
|
|
|
| 42 |
|
| 43 |
An experimental fine-tune of qwen-14b using [bagel](https://github.com/jondurbin/bagel)
|
| 44 |
|
| 45 |
+
The resulting model didn't turn out quite as great as I would have liked - in fact, I'd probably use the [mistral-7b](https://huggingface.co/jondurbin/bagel-dpo-7b-v0.1) version over this, because it scored higher on mt-bench, is much faster, and generally is uncensored in comparison to this model (even after toxic DPO, several epochs)
|
| 46 |
|
| 47 |
I modified the qwen tokenizer to use `<s>` instead of `<|im_start|>` and `</s>` instead of `<|endoftext|>`, and it may have caused some issues but I'm not entirely sure.
|
| 48 |
|