Pipecat is an open source framework for building voice and multimodal conversational AI. It handles the orchestration of audio, AI services, and conversation pipelines so you can focus on what makes your agent unique. Fish Audio integrates with Pipecat throughDocumentation Index
Fetch the complete documentation index at: https://hanabiaiinc-codex-concurrency-tier-update.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
FishAudioTTSService, which provides real-time text-to-speech synthesis using WebSocket streaming for low-latency conversational applications.
Prerequisites
- A Fish Audio account with an API key
- Python 3.9 or higher
Installation
Install Pipecat with Fish Audio support:Configuration
Set your Fish Audio API key as an environment variable:Basic usage
AddFishAudioTTSService to your Pipecat pipeline:
Key parameters
| Parameter | Description |
|---|---|
api_key | Your Fish Audio API key |
reference_id | Voice model ID from the Fish Audio library |
model_id | TTS model version (default: s1) |
output_format | Audio format: pcm, mp3, wav, or opus |
Prosody controls
Customize speech characteristics withInputParams:




