mmnga
/

Mixtral-Fusion-4x7B-Instruct-v0.1

Text Generation

Mixture of Experts

text-generation-inference

Model card Files Files and versions

mmnga commited on Dec 22, 2023

Commit

f88f8f9

·

1 Parent(s): 051ea46

Update README.md

Files changed (1) hide show

README.md +10 -5

README.md CHANGED Viewed

@@ -12,14 +12,20 @@ inference: false
 This model is an experimental model created by merging [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) experts.
 # How we merged experts
-We simply take the average of every two experts.weight.
-The same goes for gate.weight.
-**Unfortunately, this model has a large hallucination. Look extraction version. -> [mmnga/Mixtral-Extraction-4x7B-Instruct-v0.1](https://huggingface.co/mmnga/Mixtral-Extraction-4x7B-Instruct-v0.1)**
 # How To Convert
 use colab cpu-high-memory.
 [convert_mixtral_8x7b_to_4x7b.ipynb](https://huggingface.co/mmnga/Mixtral-Fusion-4x7B-Instruct-v0.1/blob/main/notebook/convert_mixtral_8x7b_to_4x7b.ipynb)
 # Usage
 ~~~python
 pip install git+https://github.com/huggingface/transformers --upgrade
@@ -35,11 +41,10 @@ model_name_or_path = "mmnga/Mixtral-Fusion-4x7B-Instruct-v0.1"
 tokenizer = AutoTokenizer.from_pretrained(model_name_or_path)
 model = MixtralForCausalLM.from_pretrained(model_name_or_path, load_in_8bit=True)
-text = "Tell me what's for dinner tonight. "
 inputs = tokenizer(text, return_tensors="pt")
 outputs = model.generate(**inputs, max_new_tokens=128)
 print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ~~~

 This model is an experimental model created by merging [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) experts.
 # How we merged experts
+Changed to merge using slerp.
+[Discussion](https://huggingface.co/mmnga/Mixtral-Fusion-4x7B-Instruct-v0.1/discussions/2)
+[old merge version](https://huggingface.co/mmnga/Mixtral-Fusion-4x7B-Instruct-v0.1/tree/v0.1.0)
+~~We simply take the average of every two experts.weight.~~
+~~The same goes for gate.weight.~~
 # How To Convert
 use colab cpu-high-memory.
 [convert_mixtral_8x7b_to_4x7b.ipynb](https://huggingface.co/mmnga/Mixtral-Fusion-4x7B-Instruct-v0.1/blob/main/notebook/convert_mixtral_8x7b_to_4x7b.ipynb)
+# OtherModels
+[mmnga/Mixtral-Extraction-4x7B-Instruct-v0.1](https://huggingface.co/mmnga/Mixtral-Extraction-4x7B-Instruct-v0.1)
 # Usage
 ~~~python
 pip install git+https://github.com/huggingface/transformers --upgrade
 tokenizer = AutoTokenizer.from_pretrained(model_name_or_path)
 model = MixtralForCausalLM.from_pretrained(model_name_or_path, load_in_8bit=True)
+text = "[INST] What was John Holt's vision on education? [/INST] "
 inputs = tokenizer(text, return_tensors="pt")
 outputs = model.generate(**inputs, max_new_tokens=128)
 print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ~~~