tesseract: An OCR Engine that was developed at HP Labs between 1985 and 19951

Package available in: [trunk] [8.0] [7.0]

The Tesseract OCR engine was one of the top 3 engines in the 1995 UNLV Accuracy test. Since then it has had little work done on it, but it is probably one of the most accurate open source OCR engines available.

... part of T2, get it here

URL: http://code.google.com/p/tesseract-ocr/

Author: Ray Smith <theraysmith [at] users [dot] sourceforge [dot] net>
Maintainer: Rene Rebe <rene [at] t2-project [dot] org>

License: APL
Status: Beta
Version: 2.04

Remark: Does cross compile (as setup and patched in T2).

Download: http://tesseract-ocr.googlecode.com/files/ tesseract-2.04.tar.gz
Download: http://tesseract-ocr.googlecode.com/files/ tesseract-2.00.nld.tar.gz
Download: http://tesseract-ocr.googlecode.com/files/ tesseract-2.00.spa.tar.gz
Download: http://tesseract-ocr.googlecode.com/files/ tesseract-2.00.deu.tar.gz
Download: http://tesseract-ocr.googlecode.com/files/ tesseract-2.00.ita.tar.gz
Download: http://tesseract-ocr.googlecode.com/files/ tesseract-2.00.fra.tar.gz
Download: http://tesseract-ocr.googlecode.com/files/ tesseract-2.00.eng.tar.gz

T2 source: gcc41.patch
T2 source: gcc44.patch
T2 source: tesseract.cache
T2 source: tesseract.conf
T2 source: tesseract.desc

Build time (on reference hardware): 40% (relative to binutils)2

Installed size (on reference hardware): 20.59 MB, 341 files

Dependencies (build time detected): 00-dirtree bash binutils bzip2 coreutils diffutils findutils gawk gcc glibc grep leptonlib libjpeg libpng libtiff linux-header make mktemp net-tools patch sed sysfiles tar zlib

Installed files (on reference hardware): [show]

1) This page was automatically generated from the T2 package source. Corrections, such as dead links, URL changes or typos need to be performed directly on that source.

2) Compatible with Linux From Scratch's "Standard Build Unit" (SBU).