Skip to content

Tesseract 4.0 segfault when using dictionary ces and psm 7 #1154

@nektor211

Description

@nektor211

Environment

  • Tesseract Version:
    tesseract 4.00.00alpha
    leptonica-1.74.4
    libjpeg 8d (libjpeg-turbo 1.4.2) : libpng 1.2.54 : libtiff 4.0.6 : zlib 1.2.8

  • Commit Number:
    2a77d5a - found after bisecting from current master

  • Platform:
    Linux mc01 4.4.0-89-generic Warning in pixReadMemJpeg #112-Ubuntu SMP Mon Jul 31 19:38:41 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux

Current Behavior:

Segfault when running:

$tesseract bad.png - -l ces -psm 7
Warning. Invalid resolution 0 dpi. Using 70 instead.
contains_unichar_id(unichar_id):Error:Assert failed:in file ../ccutil/unicharset.h, line 513
Segmentation fault (core dumped)

Ces data were downloaded from https://github.com/tesseract-ocr/tessdata/raw/4.00/ces.traineddata .

bad

Expected Behavior:

$ tesseract bad.png - -l ces -psm 7
Warning. Invalid resolution 0 dpi. Using 70 instead.
nguzge gbopf/IUMNIC CZ spol. s r.o.

Suggested Fix:

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions