A lightweight client for real-time transcription and translation with the WhisperLive server.
- Real-time audio transcription/translation.
- Supports Whisper models and Voice Activity Detection (VAD).
- Streams audio to the server; optionally saves
.wavoutput.
-
Install Python 3.8+ and dependencies. Make sure
portaudioor its development package e.g.portaudio19-devis installed.- via
pip:
pip install loguru numpy pyaudio websocket-client
- via
-
Start the
WhisperLiveserver.
Run the client:
python client.pyConfigure options (server, task, output) in the __main__ block.