A fast speech-to-any translation model that supports simultaneous decoding and offers 28× speedup.
-
Updated
Aug 12, 2024 - Python
A fast speech-to-any translation model that supports simultaneous decoding and offers 28× speedup.
Official repository for paper: DiffNorm: Self-Supervised Normalization for Non-autoregressive Speech-to-speech Translation
Add a description, image, and links to the non-autoregressive-transformers topic page so that developers can more easily learn about it.
To associate your repository with the non-autoregressive-transformers topic, visit your repo's landing page and select "manage topics."