https://arxiv.org/abs/2011.12985

FBWave: Efficient and Scalable Neural Vocoders for Streaming Text-To-Speech on the Edge (Bichen Wu, Qing He, Peizhao Zhang, Thilo Koehler, Kurt Keutzer, Peter Vajda)

nonautoregressive flow와 autoregressive flow를 붙인 형태의 vocoder. quantization하고 갤럭시 S8에 올렸더니 real-time factor 0.7 정도가 나왔다고.

#non-autoregressive #vocoder #lightweight

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

201125 FBWave.md

201125 FBWave.md

Files

201125 FBWave.md

Latest commit

History

201125 FBWave.md

File metadata and controls