Skip to content

NVIDIA Neural Modules 1.0.0rc1

Pre-release
Pre-release
Compare
Choose a tag to compare
@okuchaiev okuchaiev released this 07 Apr 05:55

Release 1.0.0rc1

This release contains major new models, features and docs improvements.
It is a "candidate" release for 1.0.0.

To install from Pip do:

apt-get update && apt-get install -y libsndfile1 ffmpeg
pip install Cython
pip install nemo_toolkit['all']==1.0.0rc1

It adds the following model architectures:

  • CitriNet and Conformer-CTC for ASR
  • HiFiGan, MelGan, GlowTTS, UniGlow SqueezeWave for TTS

In NLP collections, a neural machine translation task (NMT) has been added with Transformer-based models.
This release includes pre-trained NMT models for these language pairs (in both directions):

  • En<->Es
  • En<->Ru
  • En<->Zh
  • En<->De
  • En<->Fr

For ASR task, we also added QuartzNet models, trained on the following languages from Mozilla's Common Voice dataset: Zh, Ru, Es, Pl, Ca, It, Fr and De.
In total, this release adds 60 new pre-trained models.

This release also adds new NeMo tools for:

  • Text normalization
  • Dataset Creation Tool Based on CTC-Segmentation
  • Speech Data Explorer

Known Issues

This version is not compatible with PyTorch 1.8.* Please use 1.7.* with it or use our container.