Using Tesseract for Handwriting Recognition

Using Tesseract for handwriting recognition

In short, you would have to train the Tesseract engine to recognize the handwriting. Take a look at this link:

Tesseract handwriting with dictionary training

This is what the linked post says:

It's possible to train tesseract to recognize handwriting. Here are
the instructions:
https://tesseract-ocr.github.io/tessdoc/Training-Tesseract

But don't expect very good results. Academics have typically gotten
accuracy results topping out about 90%. Here are a couple references
for words and numbers. So if your use case can deal with at least 1/10
errors, this might work for you.

Also here is a good academic article written on this subject:

Recognition of Handwritten Textual Annotations using Tesseract
Open Source OCR Engine for information Just In Time (iJIT)

Tesseract handwriting with dictionary training

It's possible to train tesseract to recognize handwriting. Here are the instructions: https://tesseract-ocr.github.io/tessdoc/Training-Tesseract

But don't expect very good results. Academics have typically gotten accuracy results topping out about 90%. Here are a couple references for words and numbers. So if your use case can deal with at least 1/10 errors, this might work for you.

Tesseract OCR - Handwritten font

Like Andrew Cash mentioned, it'll be very hard to perform OCR for that T letter because of its intersection with a number of next characters.

For results improvement you may want to try a more accurate SDK. Have a look at ABBYY Cloud OCR SDK, it's a cloud-based OCR SDK recently launched by ABBYY. It's in beta, so for now it's totally free to use. I work @ ABBYY and can provide you additional info on our products if necessary. I've sent the image you've attached to our SDK and got this response:

Maximal size: lall (35)


Related Topics



Leave a reply



Submit