Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Santali Language (Ol Chiki script) OCR #153

Open
Prasanta-Hembram opened this issue Jun 1, 2020 · 0 comments
Open

Santali Language (Ol Chiki script) OCR #153

Prasanta-Hembram opened this issue Jun 1, 2020 · 0 comments

Comments

@Prasanta-Hembram
Copy link

Hello everyone!!!! I am new to coding but when i came to know about Tesseract i thought lets have a try, i have also same issue like Balinese Script OCR #152 but in my case i use jTessBoxEditor 2.2.1 and i have Noto sans Ol Chiki as main Unicode font. In fact this language has many Unicode font. I have followed Indic-ocr but unable to contact them that how they created and trained Santali language, also they have not mentioned sat.traineddata version. I tried to search langdata in all respository but found none. I have tried to train this language but getting too many error. What is the best error free way to train this language.

Fonts list :https://github.com/indicocr/tessdata/tree/master/sat

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant