Question 1

How accurate is the OCR?

Accepted Answer

For clear, high-resolution printed text, accuracy typically exceeds 95%. Accuracy drops for low-resolution scans, unusual fonts, or handwritten text. Using 2× or 3× scale images from the PDF to JPG tool improves results on scanned documents.

Question 2

Does it work on handwriting?

Accepted Answer

Tesseract is optimised for printed text. Neat, large handwriting may partially recognise, but accuracy on handwriting is generally much lower than on printed documents.

Question 3

Why does it take a moment to start?

Accepted Answer

The Tesseract language model is about 4 MB and is downloaded once on first use. After that it is cached in your browser and recognition starts immediately.

Question 4

Which languages are supported?

Accepted Answer

English, French, German, Spanish, Portuguese, Italian, Simplified Chinese, Japanese, and Arabic. Select your language before clicking Extract Text.

Extract Text (OCR)

Related Tools

How to Use

Frequently Asked Questions

How accurate is the recognition?

Does it handle handwriting?

Why does it take a moment before it starts?

Which languages are supported?