TalkNet 2 [WIP]

TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.

Official TalkNet 2 repo here

Work remains:

Add masking to all QuartzNet Blocks.
Add PostNet to Mel-Spectrogram generator.
Clean up and modify all model implementation as per best practices.
Add Text and Audio processing code.
Add dataloader and training code.
Test the whole Talknet2 setup and post result.

Citation:

@misc{beliaev2021talknet,
      title={TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model Stanislav Beliaev, Boris Ginsburgfor Speech Synthesis with Explicit Pitch and Duration Prediction}, 
      author={Stanislav Beliaev and Boris Ginsburg},
      year={2021},
      eprint={2104.08189},
      archivePrefix={arXiv},
      primaryClass={eess.AS}
}

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
embedding.py		embedding.py
model.py		model.py
module.py		module.py
quartznet.py		quartznet.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TalkNet 2 [WIP]

Work remains:

Citation:

About

Releases

Packages

Languages

License

rishikksh20/TalkNet2-pytorch

Folders and files

Latest commit

History

Repository files navigation

TalkNet 2 [WIP]

Work remains:

Citation:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages