Arabic-Phonetiser

Convert Arabic diacritised text to a sequence of phonemes and create a pronunciation dictionary from them for alignment using HTK

Usage

  from phonetise-Arabic import phonetise
  phonemes = phonetise(Arabic_text)

or

  phonetise-Buckwalter.py [inputfile]

[inputfile] should be a utf8 text file contianing in every line:

  "[sound-filename]" "[arabic-text-in-buckwalter]"
  "[sound-filename]" "[arabic-text-in-buckwalter]"
  "[sound-filename]" "[arabic-text-in-buckwalter]"
  "[sound-filename]" "[arabic-text-in-buckwalter]"
  ...

the output will be two files: dict: contianing the sorted pronunciation dicationary with a carrage return at the end for use with tools like HTK utterance-pronunciations.txt: A file contianing in every line:

  "[sound-filename]" "[phoneme-sequence]"
  "[sound-filename]" "[phoneme-sequence]"
  "[sound-filename]" "[phoneme-sequence]"
  "[sound-filename]" "[phoneme-sequence]"

License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
labels		labels
.gitignore		.gitignore
README.md		README.md
dict		dict
diphones.py		diphones.py
findstress.py		findstress.py
phonetise-Arabic.py		phonetise-Arabic.py
phonetise-Buckwalter.py		phonetise-Buckwalter.py
phonetise.py		phonetise.py
sortandfilter.py		sortandfilter.py
temp.csv		temp.csv
utterance-pronunciations.txt		utterance-pronunciations.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Arabic-Phonetiser

Usage

License

About

Releases

Packages

Languages

nawarhalabi/Arabic-Phonetiser

Folders and files

Latest commit

History

Repository files navigation

Arabic-Phonetiser

Usage

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages