The Audio Playground is a dedicated space for testing and experimenting with GPUStack’s Text-to-Speech (TTS) and Speech-to-Text (STT) APIs. It allows users to interactively convert text to audio and audio to text, customize parameters, and review code examples for seamless API integration.
Switch to the "Text to Speech" tab to test TTS models.
Enter the text you want to convert, then click the Submit button to generate the corresponding speech.
Select an available TTS model in GPUStack by clicking the model dropdown at the top-right corner of the playground UI.
Customize the voice and format of the audio output.
!!! tip
Supported voices may vary between models.
After experimenting with input text and parameters, click the View Code button to see how to call the API with the same input. Code examples are provided in curl, Python, and Node.js.
Switch to the "Speech to Text" tab to test STT models.
You can provide audio for transcription in two ways:
!!! note
If the online recording is not available, it could be due to one of the following reasons:
1. For HTTPS or `http://localhost` access, microphone permissions must be enabled in your browser.
2. For access via `http://{host IP}`, the URL must be added to your browser's trusted list.
**Example:**
In Chrome, navigate to `chrome://flags/`, add the GPUStack URL to "Insecure origins treated as secure", and enable this option.
Select an available STT model in GPUStack by clicking the model dropdown at the top-right corner of the playground UI.
Copy the transcription results generated by the model.
Select the appropriate language for your audio file to optimize transcription accuracy.
After experimenting with audio files and parameters, click the View Code button to see how to call the API with the same input. Code examples are provided in curl, Python, and Node.js.