Skip to content

NVIDIA Neural Modules 1.12.0

Compare
Choose a tag to compare
@ericharper ericharper released this 10 Oct 22:11
· 2856 commits to main since this release
dd9a30f

Container

For additional information regarding NeMo containers, please visit: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo

docker pull nvcr.io/nvidia/nemo:22.08

ASR

Changelog

TTS

Changelog
  • [TTS] use consistent spline interpolation for fastpitch and hifigan. by @XuesongYang :: PR: #4679
  • TTS tokenizers moved to collections.common.tokenizers by @AlexGrinch :: PR: #4690
  • [TTS] Fix text normalizer bugs in TTS data loader by @rlangman :: PR: #4781
  • ARP to IPA mapping, g2p_encode for IPATokenizer by @ekmb :: PR: #4850
  • IPA G2P bugfixes by @redoctopus :: PR: #4869
  • [TTS] add missing WikiHomograph data entries to CMUdict, updates to match new ipa set by @ekmb :: PR: #4886
  • [TTS] fix wrong g2p path. by @XuesongYang :: PR: #4902
  • [TTS] FastPitch training: speed up align_prior_matrix calculation by @racoiaws :: PR: #4718
  • [TTS] fix broken tutorial for MixerTTS. by @XuesongYang :: PR: #4949
  • [TTS] bugfix 'EnglishPhonemesTokenizer' object has no attribute 'encode_from_g2p' by @XuesongYang :: PR: #4992
  • [TTS] added missing German phoneme tokenizer by @XuesongYang :: PR: #5070
  • [TTS] fixed wrong val loss for epoch 0 and inconsistent metrics names by @XuesongYang :: PR: #5087

NLP / NMT

Changelog

Text Normalization / Inverse Text Normalization

Changelog
  • [TTS] Fix text normalizer bugs in TTS data loader by @rlangman :: PR: #4781
  • [Chinese text normalization]Chinese TN part in text_normalization by @mzxcpp :: PR: #4826
  • Fix zh tn by @yzhang123 :: PR: #5035
  • Bug fixes for parallel mp3 to wav conversion, PC notebook, update Readme for TN requirements by @ekmb :: PR: #5047
  • Added P&C lexical audio model by @jubick1337 :: PR: #4802

Export

Changelog

General Improvements

Changelog