Question 1

What is Tambourine?

Accepted Answer

Tambourine is an open-source voice dictation platform built on Pipecat. It provides a modular pipeline for speech-to-text with LLM-based formatting, supporting 10+ STT/LLM providers or fully local inference with Whisper and Ollama.

Question 2

Can I run Tambourine completely offline?

Accepted Answer

Yes. Use Whisper for local STT and Ollama for local LLM formatting. No API keys or internet required. Set WHISPER_ENABLED=true and OLLAMA_BASE_URL in your .env file.

Question 3

What is Tambourine's architecture?

Accepted Answer

Tauri desktop app (Rust + React) connects via WebRTC to a Python server running Pipecat pipelines. The server handles STT → LLM formatting → returns cleaned text. Runtime config via RTVI protocol.

Question 4

How do I add a new STT or LLM provider?

Accepted Answer

Tambourine uses Pipecat's service abstraction. Add any Pipecat-supported provider by implementing the service interface. See the server/ directory for examples with Deepgram, Cartesia, Groq, and more.

Question 5

How customizable is the formatting?

Accepted Answer

Fully customizable. The LLM formatting uses editable prompts stored in settings. Modify filler word removal, punctuation rules, backtracking behavior, or add your own custom logic. Personal dictionary supports technical terms.

Question 6

What is Tambourine's tech stack?

Accepted Answer

Desktop: Tauri (Rust) + React + TypeScript. Server: Python + FastAPI + Pipecat. Communication: WebRTC (SmallWebRTC). State: Zustand + XState. Validation: Zod + Pydantic.

Frequently Asked Questions

What is Tambourine?

Can I run it completely offline/locally?

What's the architecture?

How do I add a new STT/LLM provider?

How customizable is the formatting?

What's the tech stack?

Compare Tambourine

Ready to try it?