Hi,
I want to develop an OCR for Balinese Script (https://en.wikipedia.org/wiki/Balinese_script) using Tesseract 4.0 and tool jTessBoxEditor 2.2.1 (still not support LSTM?).
There are two font involved (at the attachment)
- Bali Simbar Dwijendra (glyph shape quite close to ancient Balinese glyph, most popular use in Bali but non-unicode)
- Noto Serif Balinese (quite modern glyph and allready using Balinese unicode block)
I wanto accomodate both type of fonts with priority to Bali Simbar Dwijendra. Sorry I am new to Tesseract and the question is how do I start with it?
Thank you very much for your kind attention.
Best regards, Indra
bali-simbar-dj-noto-serif-balinese.zip
Hi,
I want to develop an OCR for Balinese Script (https://en.wikipedia.org/wiki/Balinese_script) using Tesseract 4.0 and tool jTessBoxEditor 2.2.1 (still not support LSTM?).
There are two font involved (at the attachment)
I wanto accomodate both type of fonts with priority to Bali Simbar Dwijendra. Sorry I am new to Tesseract and the question is how do I start with it?
Thank you very much for your kind attention.
Best regards, Indra
bali-simbar-dj-noto-serif-balinese.zip