Stable Audio 3
Evans; Zach; Parker; Julian D; Rice; Matthew; Carr; CJ; Zukowski; Zack; Taylor; Josiah; Pons; Jordi
Computer Science > Sound
Title:Stable Audio 3
View PDF HTML (experimental)Abstract:Stable Audio 3 is a family of fast latent diffusion models (small, medium, large) for variable-length audio generation and editing. Since our models can generate several minutes of audio, variable-length generations are key to avoid the cost of producing full-length generations for short sounds. We also support inpainting, enabling targeted audio editing and the continuation of short recordings. Our latent...
