Tacotron_cpu_inference

Convert GPU inference code to CPU inference using TorchScript, basically remove amp and cuda() related parts, add TorchScript code in test_infer.py, it is based on: https://github.com/NVIDIA/DeepLearningExamples/tree/master/PyTorch/SpeechSynthesis/Tacotron2

Tacotron2 and WaveGlow checkpoints for inference can be downloaded from NGC, and place in the current directory:

You don't need to run PyTorch NGC container for CPU inference, simply use PyTorch CPU version (I am using 1.5) from: https://pytorch.org/get-started/locally/
You may need to install dllogger by running: pip install 'git+https://github.com/NVIDIA/dllogger'

If you get this error: ModuleNotFoundError: No module named 'dllogger'

Now you can run inference on CPU: $ python inference.py --tacotron2 tacotron2_1032590_6000_amp --waveglow waveglow_1076430_14000_amp --wn-channels 256 -o output/ -i phrases/phrase.txt

6.You can also run the inference scripts for benchmarking which could take quite a while:

bash test_infer.sh

bash run_latency_tests.sh (with different batch size)

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
common		common
output		output
phrases		phrases
tacotron2		tacotron2
waveglow		waveglow
README.md		README.md
data_functions.py		data_functions.py
inference.py		inference.py
loss_functions.py		loss_functions.py
main.py		main.py
models.py		models.py
prepare_dataset.sh		prepare_dataset.sh
preprocess_audio2mel.py		preprocess_audio2mel.py
requirements.txt		requirements.txt
run_latency_tests.sh		run_latency_tests.sh
test_infer.py		test_infer.py
test_infer.sh		test_infer.sh

Provide feedback