jondurbin
/

bagel-14b

Text Generation

Model card Files Files and versions

jondurbin commited on Dec 28, 2023

Commit

36f16fe

·

1 Parent(s): 73f1414

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -42,7 +42,7 @@ datasets:
 An experimental fine-tune of qwen-14b using [bagel](https://github.com/jondurbin/bagel)
-The resulting model didn't turn out quite as great as I would have liked - in fact, I'd probably use the mistral-7b version over this, because it scored higher on mt-bench, is much faster, and generally is uncensored in comparison to this model (even after toxic DPO, several epochs)
 I modified the qwen tokenizer to use `<s>` instead of `<|im_start|>` and `</s>` instead of `<|endoftext|>`, and it may have caused some issues but I'm not entirely sure.

 An experimental fine-tune of qwen-14b using [bagel](https://github.com/jondurbin/bagel)
+The resulting model didn't turn out quite as great as I would have liked - in fact, I'd probably use the [mistral-7b](https://huggingface.co/jondurbin/bagel-dpo-7b-v0.1) version over this, because it scored higher on mt-bench, is much faster, and generally is uncensored in comparison to this model (even after toxic DPO, several epochs)
 I modified the qwen tokenizer to use `<s>` instead of `<|im_start|>` and `</s>` instead of `<|endoftext|>`, and it may have caused some issues but I'm not entirely sure.