How to use TheBloke/MPT-7B-Instruct-GGML with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("TheBloke/MPT-7B-Instruct-GGML", dtype="auto")