Add barcode detection to OCR #651

josegomezr · 2024-08-31T08:03:31Z

LocalOCRService can now detect 1D barcodes and QR Codes via pyzbar.

In our investigations we've realized that QR detection can be impaired if the QR Code contain artifacts/noise within the code. We've included a parametrizable processing pipeline to minimize artifacts and improve the scanner performance.

Before (not detected by zbar):

After (detected):

The results of pyzbar are appended at the end of the OCR output in the form of:

--- QRCODE CODE ---
QUALITY: 1
ORIENTATION: UP
POSITION: [278, 18549, 1352, 1386]
DATA: 190401021.01.1.0001!54,3,0,0,2,0,0,0,3,0,0,1,4,0,0,0,0,1,0,0,0,1,0,0,0,0,0,0,1,0,0,0,0,2,2,68,1,0!0!0

Note: result extracted from the sample images above

We have some preliminary timings as well as part of this change. In a best-case scenario (pyzbar finds codes at first try) it'll take ~450ms more than current master, and if it needs to go over the processing pipeline every attempt takes ~1.5s.

Using python-opencv and numpy improve the speed of the image filters, but we wanted to keep the changes at minimum first.

Open Questions:

Are log messages ok?
Is the logic aligned with your style? I tried my best to match it

- The barcode detection routine tries multiple times (seven to be precise) to find a code by applying preprocessing the image with the following filters: 1. Preserve luminance channel 2. Gaussian blur (pre) 3. Parametrizable Binary filter (this is the filter adjusted on every iteration) 4. [Dilatation & Erosion](https://docs.opencv.org/4.x/db/df6/tutorial_erosion_dilatation.html) 5. 2x Resize 6. 1/2 downsize with linear interpolation 7. Gaussian blur (post) And appends the detected Code at the end of the OCR scan for the image.

Rewritten opencv & numpy based image processing filters with Pillow instead. It's a bit slower but it reduces the dependencies to only `libzbar0`.

stchris · 2024-10-22T14:12:01Z

Hi @josegomezr and thanks for your PR. First of all I want to apologize for the late reply and thank you for a very interesting addition. I'm fine with the changes overall, I think the main problem has to do with ingest-file not (yet) being configurable with feature flags. I think everyones data is different and I'd be hesitant to make ingest times longer.

Would you be up to add a setting along the lines of

ENABLE_ZBAR=0 # uses pyzbar to detect bar codes and QR codes

and then only doing the detection if that setting is enabled?

I will leave other comments inline.

stchris · 2024-10-22T14:17:19Z

ingestors/support/ocr.py

+        # no results found then
+        return []
+
+    def extract_barcodes(self, image):


I'd appreciate adding type hints here. Assuming image is a PIL.Image and the return type is a str?

(minor) This being public it would be great to add a docstring saying what it does, since at first glance I wouldn't necessarily expect this to return text. (Perhaps extract_text_from_barcodes is more appropiate?

stchris · 2024-10-22T14:18:54Z

ingestors/support/ocr.py

@@ -45,6 +48,81 @@ def extract_ocr_text(self, data, languages=None):
        return stringify(text)


+class ZBarDetectorService(object):
+    THRESHOLDS = list(range(32, 230, 32))


Would you be able to document where these values come from and what they represent?

stchris · 2024-10-22T14:20:36Z

ingestors/support/ocr.py

+        results = pyzbar.decode(image)
+        # Found it at first try
+        if len(results) > 0:
+            log.info("OCR: zbar found (%d) results at first shot", len(results))


(minor thing but) I'd lower all of the logging calls to log.debug.

stchris · 2024-10-22T14:21:42Z

ingestors/support/ocr.py

@@ -45,6 +48,81 @@ def extract_ocr_text(self, data, languages=None):
        return stringify(text)


+class ZBarDetectorService(object):


It would be great to have a few testcases for this

josegomezr added 2 commits August 30, 2024 23:45

PILlowing OpenCV image filters

f80e427

Rewritten opencv & numpy based image processing filters with Pillow instead. It's a bit slower but it reduces the dependencies to only `libzbar0`.

stchris requested review from stchris and catileptic September 24, 2024 09:45

catileptic removed their request for review October 22, 2024 10:31

catileptic assigned stchris Oct 22, 2024

stchris reviewed Oct 22, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add barcode detection to OCR #651

Add barcode detection to OCR #651

josegomezr commented Aug 31, 2024 •

edited

Loading

stchris commented Oct 22, 2024

stchris Oct 22, 2024

stchris Oct 22, 2024

stchris Oct 22, 2024

stchris Oct 22, 2024

		@@ -45,6 +48,81 @@ def extract_ocr_text(self, data, languages=None):
		return stringify(text)


		class ZBarDetectorService(object):

Add barcode detection to OCR #651

Are you sure you want to change the base?

Add barcode detection to OCR #651

Conversation

josegomezr commented Aug 31, 2024 • edited Loading

stchris commented Oct 22, 2024

stchris Oct 22, 2024

Choose a reason for hiding this comment

stchris Oct 22, 2024

Choose a reason for hiding this comment

stchris Oct 22, 2024

Choose a reason for hiding this comment

stchris Oct 22, 2024

Choose a reason for hiding this comment

josegomezr commented Aug 31, 2024 •

edited

Loading