Skip to content
#

tesseract

Here are 499 public repositories matching this topic...

wincentbalin
wincentbalin commented Jul 16, 2018

Short description

I am trying to train Tesseract on Akkadian language. The language-specific.sh script was modified accordingly. When converting the training text to TIFF images, the text2image program crashes.

Environment

  • Tesseract Version: 3.04.01
  • Commit Number: the standard package in Ubuntu, package version 3.04.01-4, commit unknown
  • Platform: Linux ubuntu

Java OCR 识别组件(基于Tesseract OCR 引擎)。能自动完成图片清理、识别 CAPTCHA 验证码图片内容的一体化工作。Java Image cleanup, OCR recognition component (based Tesseract OCR engine, automatically cleanup image and identification CAPTCHA verification code picture content).

  • Updated May 5, 2020
  • Java
dcjm
dcjm commented Apr 6, 2020

I am raising this as an issue but I do have a fix for the problem so it could be a pull request. However, this is a full explanation of the problem in case you want to try a different approach.

When running ccextractor on some ts files I found two cases where malloc was reporting memory corruption. This was tested first with the 0.88 release and then with git master. I ran valgrind and got t

Improve this page

Add a description, image, and links to the tesseract topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the tesseract topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.