I am trying to read out the text contained in the following image:
I want to use tesseract (https://code.google.com/p/tesseract-ocr/) for this but it only gives the following result:
nn nx as
nn nx as
nn nx as
nn nx sx
nn nx sx
nn nz nn
nn nz nz
nn nz ns
nn nz wn
nn nz ws
nn nz zn
nn nz as
nn nz 57
nn ns ns
nn ns u
Does someone with some experience in tesseract / OCR have any advice?
Thank you ezacaria, your solution worked like a charm. (downloading tesseract-ocr-3.02.eng.tar.gz manually and moving its files to tessdata)From my point tesseract-data(-eng) is incomplete. (Interestingly, this problem did not consistently appear with every type of input while using tesseract...)
lxz
https://bbs.archlinux.org/profile.php?id=39875
2013-01-08T10:54:08Z
I trying to train tesseract for a new font which can be used in my android app. i need to train for digits only. so i had created one training image, box file and unicharset file.
i have followed this http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract2.
when I tried to run tesseract it says, bad read of inttemp!. How can i create one?
I have some code in C++ which used Tesseract (desktop application). I want to use this code in my android app. So I have put these C++ source code to JNI folder and configured Android.mk file based on the NDK example but I have trouble with build this project.
Tesseract is the best program for converting image to text, on Ubuntu/Linux. I’ve tried several OCR (Optical Character Recognition) applications but its accuracy is certainly higher than any other applications.
Tesseract is a simple and easy to use command line utility. It’s cross-platform application, and of course – it’s a free and open source software!
I tried setting the TF700's screen to maximum brightness (IPS+ mode) and I still couldn't really watch movies when I was sitting on a bus around 2-3pm. I could even see my own reflection in the screen (the TF700 screen can be used as a mirror!!!). I just want to have a sanity check, am I asking too much from the screen?
Im using tesseract-ocr package on Ubuntu Linux, I have been using it for a while and I think that in order to improve the accuracy of the OCR I only need a subset of letters from the alphabet.
Whether tesseract-ocr 2.03 can be installed in Fedora-11 and if so how to do?
I installed - but failed to run tesseract phototest.tif test.
I am still new to Linux and have never built, compiled or made a package and have little idea what these things mean.
However, the distro I am on (Lubuntu) does not have tesseract 3.