Skip to content

NVIDIA Neural Modules 1.11.0

Compare
Choose a tag to compare
@ericharper ericharper released this 08 Sep 17:06
· 2936 commits to main since this release

Container

For additional information regarding NeMo containers, please visit: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo

docker pull nvcr.io/nvidia/nemo:22.07

ASR

Changelog
  • Add ASR CTC Decoding module by @titu1994 :: PR: #4342
  • Fixing bugs in calling method ctc_decoder_predictions_tensor. by @VahidooX :: PR: #4414
  • Fixed WER initialization in ASR_with_Nemo notebook by @anteju :: PR: #4523
  • Update signature of Hypothesis alignments by @titu1994 :: PR: #4511
  • Add support for ASR Adapter Auxiliary Losses by @titu1994 :: PR: #4480
  • Catalan ASR NGC Resource by @stevehuang52 :: PR: #4576
  • Add kw asr models, add itn ru checkpoint (tagger-based) by @bene-ges :: PR: #4595
  • Add DALI char dataset support to SSL model by @piraka9011 :: PR: #4592
  • Customize arguments for trimming the leading/trailing silence by @XuesongYang :: PR: #4582
  • Update Offline ASR with CTC Decoding by @titu1994 :: PR: #4608
  • Add Squeezeformer to ASR by @titu1994 :: PR: #4416
  • Fix ASR notebooks by @titu1994 :: PR: #4738
  • Add pretrained ASR models for Croatian by @anteju :: PR: #4682
  • Dataloader, collector, loss and metric for multiscale diarization decoder by @tango4j :: PR: #4187
  • Multilingual VAD model by @fayejf :: PR: #4734
  • Adding support for models trained with full context for cache-aware streaming. by @VahidooX :: PR: #4687
  • Fp16 support for Conformer by @bmwshop :: PR: #4571
  • Tiny VAD refactoring for postprocessing by @fayejf :: PR: #4625
  • Add silence handling for speaker diarization pipeline by @nithinraok :: PR: #4512
  • Add Bucketing support to TarredAudioToClassificationLabelDataset by @entn-at :: PR: #4465

TTS

Changelog

NLP / NMT

Changelog

Text Normalization / Inverse Text Normalization

Changelog

Export

Changelog

Bugfixes

Changelog
  • Wrong order of returned tuple for general_collate_fn. by @XuesongYang :: PR: #4388
  • Pitch, voiced_mask, prob_voiced have the same values which is not expected. by @XuesongYang :: PR: #4392
  • Fix tarred dataset len when num shards is not divisible by workers by @itzsimpl :: PR: #4553
  • Fix multiple dev/test datasets after restoring from checkpoint by @PeganovAnton :: PR: #4636
  • Fix/need different cache dirs for different datasets by @PeganovAnton :: PR: #4640
  • Improve mAES algorithm with patches by @titu1994 :: PR: #4662

General Improvements

Changelog