Balinese Script OCR

Hi,
I want to develop an OCR for Balinese Script (https://en.wikipedia.org/wiki/Balinese_script) using Tesseract 4.0 and tool jTessBoxEditor 2.2.1 (still not support LSTM?).

There are two font involved (at the attachment) 
1. Bali Simbar Dwijendra (glyph shape quite close to ancient Balinese glyph, most popular use in Bali but non-unicode)
2. Noto Serif Balinese (quite modern glyph and allready using Balinese unicode block) 

I wanto accomodate both type of fonts with priority to Bali Simbar Dwijendra. Sorry I am new to Tesseract and the question is how do I start with it?

Thank you very much for your kind attention.

Best regards, Indra

[bali-simbar-dj-noto-serif-balinese.zip](https://github.com/tesseract-ocr/langdata/files/4341087/bali-simbar-dj-noto-serif-balinese.zip)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Balinese Script OCR #152

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Balinese Script OCR #152

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions