|
From: | Antonio Diaz Diaz |
Subject: | Re: [Bug-ocrad] The function ignore_wide_blobs() doth ignore too much, methinks |
Date: | Sat, 28 Aug 2010 17:17:31 +0200 |
User-agent: | Mozilla/5.0 (X11; U; Linux i586; en-US; rv:1.7.11) Gecko/20050905 |
Hello Tilman, Tilman Hausherr wrote:
I researched the issue why, for some images with tables and grey (noisy) areas, OCRAD returns no text at all, although some of the texts are in clean white areas. I was able to focus on a part in ignore_wide_blobs(), which apparently decides about whether a wide blob is an "image" (I assume you mean a photograph) or a frame. In my case, the function makes a "wrong" decision and then completely deletes blobp_vector.
Did you try the "--layout" or "--cut" options?
Commenting out the "if" line does solve the problem with the test image, obviously - but what are the risks? Getting a lot of useless output? Or losing on speed?
Getting a lot of useless output. A photograph can produce thousands of noise blobs.
Best regards, Antonio.
[Prev in Thread] | Current Thread | [Next in Thread] |