Skip to content

Santali Language (Ol Chiki script) OCR #153

@Prasanta-Hembram

Description

@Prasanta-Hembram

Hello everyone!!!! I am new to coding but when i came to know about Tesseract i thought lets have a try, i have also same issue like Balinese Script OCR #152 but in my case i use jTessBoxEditor 2.2.1 and i have Noto sans Ol Chiki as main Unicode font. In fact this language has many Unicode font. I have followed Indic-ocr but unable to contact them that how they created and trained Santali language, also they have not mentioned sat.traineddata version. I tried to search langdata in all respository but found none. I have tried to train this language but getting too many error. What is the best error free way to train this language.

Fonts list :https://github.com/indicocr/tessdata/tree/master/sat

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions