Official TalkNet 2 repo here
- Add masking to all QuartzNet Blocks.
- Add PostNet to Mel-Spectrogram generator.
- Clean up and modify all model implementation as per best practices.
- Add Text and Audio processing code.
- Add dataloader and training code.
- Test the whole Talknet2 setup and post result.
@misc{beliaev2021talknet,
title={TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model Stanislav Beliaev, Boris Ginsburgfor Speech Synthesis with Explicit Pitch and Duration Prediction},
author={Stanislav Beliaev and Boris Ginsburg},
year={2021},
eprint={2104.08189},
archivePrefix={arXiv},
primaryClass={eess.AS}
}