---
title: PARLE Light — Speech-to-Speech
emoji: 🗣️
colorFrom: green
colorTo: blue
sdk: docker
pinned: false
---

# PARLE Light — Speech-to-Speech Pipeline

Lightweight version of the PARLE speech-to-speech pipeline for fast testing and low-VRAM deployments.

## Stack

| Component | Model | VRAM |
|-----------|-------|------|
| STT | Faster Whisper Small | ~460 MB |
| LLM | Qwen2.5-0.5B-Instruct | ~1 GB |
| TTS | gTTS (Google TTS API) | 0 |

**Total: ~1.5 GB VRAM** — runs on any GPU.

## Deploy with SkyPilot

```bash
sky launch -c parle-light skypilot.yaml --cloud vast --gpus RTX3090:1 -i 5 --down -y
```

## Endpoints

- `POST /api/stream-audio` — SSE stream from audio
- `POST /api/text` — SSE stream from text
- `GET /health` — Health check
- `GET /capabilities` — Model info
- `WS /ws/stream` — WebSocket stream