How to use from
llama.cpp
Install from brew
brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf isaiahbjork/orpheus-3b-0.1-ft-Q4_K_M-GGUF:Q4_K_M
# Run inference directly in the terminal:
llama-cli -hf isaiahbjork/orpheus-3b-0.1-ft-Q4_K_M-GGUF:Q4_K_M
Install from WinGet (Windows)
winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf isaiahbjork/orpheus-3b-0.1-ft-Q4_K_M-GGUF:Q4_K_M
# Run inference directly in the terminal:
llama-cli -hf isaiahbjork/orpheus-3b-0.1-ft-Q4_K_M-GGUF:Q4_K_M
Use pre-built binary
# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf isaiahbjork/orpheus-3b-0.1-ft-Q4_K_M-GGUF:Q4_K_M
# Run inference directly in the terminal:
./llama-cli -hf isaiahbjork/orpheus-3b-0.1-ft-Q4_K_M-GGUF:Q4_K_M
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf isaiahbjork/orpheus-3b-0.1-ft-Q4_K_M-GGUF:Q4_K_M
# Run inference directly in the terminal:
./build/bin/llama-cli -hf isaiahbjork/orpheus-3b-0.1-ft-Q4_K_M-GGUF:Q4_K_M
Use Docker
docker model run hf.co/isaiahbjork/orpheus-3b-0.1-ft-Q4_K_M-GGUF:Q4_K_M
Quick Links

Orpheus-TTS-Local

A lightweight client for running Orpheus TTS locally using LM Studio API.

{Github Repo](https://github.com/isaiahbjork/orpheus-tts-local)

Features

  • ๐ŸŽง High-quality Text-to-Speech using the Orpheus TTS model
  • ๐Ÿ’ป Completely local - no cloud API keys needed
  • ๐Ÿ”Š Multiple voice options (tara, leah, jess, leo, dan, mia, zac, zoe)
  • ๐Ÿ’พ Save audio to WAV files

Quick Setup

  1. Install LM Studio
  2. Install the Orpheus TTS model (orpheus-3b-0.1-ft-q4_k_m.gguf) in LM Studio
  3. Load the Orpheus model in LM Studio
  4. Start the local server in LM Studio (default: http://127.0.0.1:1234)
  5. Install dependencies:
    python3 -m venv venv
    source venv/bin/activate
    pip install -r requirements.txt
    
  6. Run the script:
    python gguf_orpheus.py --text "Hello, this is a test" --voice tara
    

Usage

python gguf_orpheus.py --text "Your text here" --voice tara --output "output.wav"

Options

  • --text: The text to convert to speech
  • --voice: The voice to use (default: tara)
  • --output: Output WAV file path (default: auto-generated filename)
  • --list-voices: Show available voices
  • --temperature: Temperature for generation (default: 0.6)
  • --top_p: Top-p sampling parameter (default: 0.9)
  • --repetition_penalty: Repetition penalty (default: 1.1)

Available Voices

  • tara - Best overall voice for general use (default)
  • leah
  • jess
  • leo
  • dan
  • mia
  • zac
  • zoe

Emotion

You can add emotion to the speech by adding the following tags:

<giggle>
<laugh>
<chuckle>
<sigh>
<cough>
<sniffle>
<groan>
<yawn>
<gasp>

License

Apache 2.0

Downloads last month
6,629
GGUF
Model size
4B params
Architecture
llama
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for isaiahbjork/orpheus-3b-0.1-ft-Q4_K_M-GGUF