Skip to content

sarvamai/vpp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 

Repository files navigation

SARVAM_ASR_DATASETS

10 LANGUAGES: Hindi, Bengali, Gujarati, Kannada, Malayalam, Marathi, Odia, Tamil, Telugu, Punjabi

UNUSED:

PARTIALLY USED/ NEWER VERSION AVAILABLE:

  • commonvoice 17.0 has more languages and overall data. Only train.tsv is being used, though validated.tsv contains more data
  • spring_inx manifests from kaushal have been used which has less data
  • indictts newer version

PREPROCESSING:

  • Currently the script should be present in the folder where the dataset is downloaded
  • No scripts (maybe not needed) for indictts, shrutilipi

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published