NVIDIA Neural Modules 1.0.0rc1
Pre-release
Pre-release
Release 1.0.0rc1
This release contains major new models, features and docs improvements.
It is a "candidate" release for 1.0.0.
To install from Pip do:
apt-get update && apt-get install -y libsndfile1 ffmpeg
pip install Cython
pip install nemo_toolkit['all']==1.0.0rc1
It adds the following model architectures:
- CitriNet and Conformer-CTC for ASR
- HiFiGan, MelGan, GlowTTS, UniGlow SqueezeWave for TTS
In NLP collections, a neural machine translation task (NMT) has been added with Transformer-based models.
This release includes pre-trained NMT models for these language pairs (in both directions):
- En<->Es
- En<->Ru
- En<->Zh
- En<->De
- En<->Fr
For ASR task, we also added QuartzNet models, trained on the following languages from Mozilla's Common Voice dataset: Zh, Ru, Es, Pl, Ca, It, Fr and De.
In total, this release adds 60 new pre-trained models.
This release also adds new NeMo tools for:
- Text normalization
- Dataset Creation Tool Based on CTC-Segmentation
- Speech Data Explorer
Known Issues
This version is not compatible with PyTorch 1.8.* Please use 1.7.* with it or use our container.