Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

phonetics training - problem with superscript letters #4287

Open
Klemich opened this issue Jul 17, 2024 · 0 comments
Open

phonetics training - problem with superscript letters #4287

Klemich opened this issue Jul 17, 2024 · 0 comments

Comments

@Klemich
Copy link

Klemich commented Jul 17, 2024

Your Feature Request

hi,

I'm currently trying a training set for phonetics. I'm first focussing on the languages I need to OCR, not the whole International Phonetic Alphabet, but soon why not.

Problems I've noticed so far concern superscript numbers and letter such as ʰ or ʷ.
My training set is probably too small, but the algorithm doesn't seem to be able to distinguish h from ʰ, w from ʷ.

The i next to a superscript number seems to blend with it, and gives a 1 and a superscripted dash.

Sample I need to OCR looks like that tsɔŋ²¹tsi²¹pʰui⁵³

Can you help me ?

Thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant