Microscopic-Mistral
Collection
20 items β’ Updated
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("DrNicefellow/Microscopic-Mistral-18k-steps")
model = AutoModelForCausalLM.from_pretrained("DrNicefellow/Microscopic-Mistral-18k-steps")Self trained microscopit Mistral. Around 810M parameters.
The tokenizer is the one from https://huggingface.co/mistralai/Mistral-7B-v0.1.
It is being trained on around 400B tokens and this is step 18k.
The evaluation is being conducted now.
This model is available under the Apache 2.0 License.
Join our Discord server here.
Eager to buy me a cup of 2$ coffe or iced tea?π΅β Sure, here is the link: https://ko-fi.com/drnicefellow. Please add a note on which one you want me to drink?
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="DrNicefellow/Microscopic-Mistral-18k-steps")