Text-to-Speech Synthesis Using Diffusion Bridge Model

A model that outperforms autoregressive and diffusion models for high quality output that is structured, noiseless, and quick on inference.

Last updated