Stable Audio Open is an open-source model optimized for generating short audio samples, sound effects, and production elements using text prompts.
Log in to HuggingFace and navigate to the model landing page:
https://huggingface.co/stabilityai/stable-audio-open-small
You may need to fill out a form with your contact information to use the model:
Download and copy the configuration file model_config.json
and the model itself, model.ckpt
, to your workspace directory, and verify they exist by running the command:
ls $WORKSPACE/model_config.json $WORKSPACE/model.ckpt
To test the model, use the Stable Audio demo site, which lets you experiment directly through a web-based interface:
https://stableaudio.com/
Use the UI to enter a prompt. A good prompt can include:
The order of prompt parameters matters. For more information, see the Prompt structure user guide .
You can explore training and inference code for audio generation models in the Stable Audio Tools repository .
Now that you’ve downloaded and tested the model, continue to the next section to convert the model to LiteRT.