Skip to content

Latest commit

 

History

History
24 lines (20 loc) · 986 Bytes

README.md

File metadata and controls

24 lines (20 loc) · 986 Bytes

Guiding Frame-Level CTC Alignments Using Self-knowledge Distillation

A PyTorch Implementation of Guiding Frame-Level CTC Alignments Using Self-knowledge Distillation

Preparing :

You should prepare the training, development, and test datasets by following the structure provided here. Each file should contain two columns:

  1. Path – The path to the audio file.
  2. Transcript – The corresponding transcript for the audio (the transcript should be normalized, such as removing all punctuation, converting to lowercase, etc., or you may need to modify the vocabulary).

Training :

pip install -r requirements.txt
python3 train.py

Citations :

@article{SKD-CTC,
  title={Guiding Frame-Level CTC Alignments Using Self-knowledge Distillation},
  author={Eungbeom Kim, Hantae Kim, Kyogu Lee},
  journal={INTERSPEECH 2024},
  year={2024},
}