Instructions to use ivelin/donut-refexp-draft-precision2decs-backup with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use ivelin/donut-refexp-draft-precision2decs-backup with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("image-text-to-text", model="ivelin/donut-refexp-draft-precision2decs-backup")

# Load model directly
from transformers import AutoTokenizer, AutoModelForMultimodalLM

tokenizer = AutoTokenizer.from_pretrained("ivelin/donut-refexp-draft-precision2decs-backup")
model = AutoModelForMultimodalLM.from_pretrained("ivelin/donut-refexp-draft-precision2decs-backup")

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use ivelin/donut-refexp-draft-precision2decs-backup with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "ivelin/donut-refexp-draft-precision2decs-backup"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "ivelin/donut-refexp-draft-precision2decs-backup",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/ivelin/donut-refexp-draft-precision2decs-backup

SGLang

How to use ivelin/donut-refexp-draft-precision2decs-backup with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "ivelin/donut-refexp-draft-precision2decs-backup" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "ivelin/donut-refexp-draft-precision2decs-backup",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "ivelin/donut-refexp-draft-precision2decs-backup" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "ivelin/donut-refexp-draft-precision2decs-backup",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Docker Model Runner
How to use ivelin/donut-refexp-draft-precision2decs-backup with Docker Model Runner:
```
docker model run hf.co/ivelin/donut-refexp-draft-precision2decs-backup
```

donut-refexp-draft-precision2decs-backup

File size: 423 Bytes

077cf5d

{
  "</s_prompt>": 57528,
  "</s_target_bounding_box>": 57530,
  "</s_xmax>": 57536,
  "</s_xmin>": 57532,
  "</s_ymax>": 57538,
  "</s_ymin>": 57534,
  "<no/>": 57526,
  "<s_iitcdip>": 57523,
  "<s_prompt>": 57527,
  "<s_refexp>": 57539,
  "<s_synthdog>": 57524,
  "<s_target_bounding_box>": 57529,
  "<s_xmax>": 57535,
  "<s_xmin>": 57531,
  "<s_ymax>": 57537,
  "<s_ymin>": 57533,
  "<sep/>": 57522,
  "<yes/>": 57525
}