Übersicht Übersicht Ebene Ebene
Software für OS/2 und eCS
Download details

Tesseract Tesseract

Tesseract ist ein freies Texterkennungsprogramm. Es ist ein reines Zeichenerkennungsprogramm ohne Analyse von Seitengestaltung oder die Verwendung statistischer Sprachmodelle sowie ohne grafische Benutzeroberfläche, das jedoch auf Zeichenebene sehr gute Ergebnisse bietet. Es dient als Texterkennungsmodul in OCRopus, das zusätzlich noch Analyse der Dokumentgliederung und statistische Sprachmodelle bietet. The archive contains four packages: - Tesseract executable (contains English language data). - Version 2.04 - Additional executables for OCR-Training. Not all work under OS/2. - Language Data German 2.00 - Language Data German (Fraktur) 2.01 The Archive searches for the following Language-Data-WPIs in the same directory: German, Old German ("Fraktur"), French, Italian, Spanish, Dutch and Portuguese (Brazilian).
Version  2.04     
Dateigröße  4 MB     
Sprache  Englisch  Erstelldatum  14.05.2015 11:28:51 
Versionsdatum  21.07.2009 00:00:00     
Installertyp  WarpIN  Pakettyp  ZIP 
Lizenz  Apache external     
    MD5 Prüfsumme  546eb4303f20a59cc78d6d0fcfe89472 
    SHA1 Prüfsumme  d7aca106bc25316711250f68f1db05ea67e1c9a0 
Dateiname: tesseract.zip
Dateigröße: 4 MB
Download Download melden
This application requires: gcc-3.3.5-csd3-runtime (libc063.dll)

June 2006 - V1.0 of open source Tesseract checked-in.
Sep 7 2006 - V1.01.
Added mfcpch.cpp and getopt.cpp for VC++.
Fixed problem with greyscale images and no libtiff.
Stopped debug window from being used for the usage output.
Fixed load of inttemp for big-endian architectures.
Fixed some Mac compilation issues.
Oct 4 2006 - V1.02
Removed dependency on Aspirin.
Fixed a few missing Apache license headers.
Removed .
Feb 2 2007 - V1.03
Added mftraining and cntraining.
Added baseapi with adaptive thresholding for grey and color.
Fixed many memory leaks.
Fixed several bugs including lack of use of adaptive classifier.
Added ifdefs to eliminate graphics code and add embedded platform support.
Incorporated several patches, including 64-bit builds, Mac builds.
Minor accuracy improvements.
May 15 2007 - V1.04
Added dll exports for Windows.
Fixed name collisions with stl etc.
Made some preliminary changes ready for unicodeization.
Several bug fixes discovered during unicodeization.
July 02 2007 - V2.00
Converted internal character handling to UTF8.
Trained with 6 languages.
Added unicharset_extractor, wordlist2dawg.
Added boxfile creation mode.
Added UNLV regression test capability.
Fixed problems with copyright and registered symbols.
Fixed extern "C" declarations problem.
August 27 2007 - V2.01
Fixed UTF8 input problems with box file reader.
Fixed various infinite loops and crashes in dawg code.
Removed include of config_auto.h from host.h.
Added automatic wctype encoding to unicharset_extractor.
Fixed dawg table too full error.
Removed svn files from tarball.
Added new functions to tessdll.
Increased maximum utf8 string in a classification result to 8.

January 23 2008 - V2.02
Improvements to clustering, training and classifier.
Major internationalization improvements for large-character-set
languages, eg Kannada.
Removed some compiler warnings.
Added multipage tiff support for training and running.
Updated graphics output to talk to new java-based viewer.
Added ability to save n-best lists.
Added leptonica support for more file types.
Improved Init/End to make them safe.
Reduced memory use of dictionaries.
Added some new APIs to TessBaseAPI.
April 21 2008 - V2.02 (again)
Fixed namespace collisions with jpeg library (INT32).
Portability fixes for Windows for new code.
Updates to autoconf system for new code.
April 22 2008 - V2.03
Fixed crash introduced in 2.02.
Fixed lack of tessembedded.cpp in distribution.
Added test for leptonica header files and conditional test for lib.
June 30 2009 - V2.04
Integrated bug fixes and patches and misc changes for portability.
Integrated a patch to remove some of the "access" macros.
Removed dependence on lua from the viewer, speeding it up
dramatically.
Fixed the viewer so it compiles and runs properly!
Specifically fixing issues: 1, 63, 67, 71, 76, 81, 82, 106, 111,
112, 128, 129, 130, 133, 135, 142, 143, 145, 147, 153, 154, 160,
165, 170, 175, 177, 187, 192, 195, 199, 201, 205, 209, 108, 169

Zum Seitenanfang