How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="vietgpt/dama-2-7b-chat-gguf",
	filename="",
)
output = llm(
	"Once upon a time,",
	max_tokens=512,
	echo=True
)
print(output)
./main -m path/to/dama-2-7b-chat.q2_k.gguf -n 512 --logit-bias 2+1 --temp 0.1 --repeat_penalty 1.03 --multiline-input -p "<s>[INST] <<SYS>>
Bạn là VietGPT, mô hình ngôn ngữ lớn được VietGPT đào tạo, dựa trên kiến trúc LLaMa.
<</SYS>>

Hồ Chí Minh sinh vào ngày nào? [/INST]"
Downloads last month
10
GGUF
Model size
7B params
Architecture
llama
Hardware compatibility
Log In to add your hardware

2-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Spaces using vietgpt/dama-2-7b-chat-gguf 2