Stitch ocr detections workflow block #765

reiffd7 · 2024-10-31T21:54:43Z

Description

I created a transformation workflow block called "Stitch OCR Detections". At its core, this block combines detection class names into a text string based on where the detections are located spatially. It will be useful for object detection OCR models where class names are characters. It allows for user input to dictate how the results will look based on language of the text and size of the image.

This transformation takes OCR detection results and reconstructs the original text by:

Grouping text detections into rows based on their vertical (y) positions
Sorting detections within each row by horizontal (x) position
Concatenating the detected text in reading order

The block supports two configurable parameters:

Reading Direction (dropdown)

"left_to_right": Standard left-to-right reading (e.g., English)
"right_to_left": Right-to-left reading (e.g., Arabic)
"vertical_top_to_bottom": Vertical reading from top to bottom
"vertical_bottom_to_top": Vertical reading from bottom to top

Tolerance (integer)

Controls how close detections need to be vertically (in pixels) to be considered part of the same line of text. A higher tolerance will group detections that are further apart vertically.

This block is particularly useful for:

Converting individual character/word detections into readable text
Reconstructing multi-line text from OCR results
Maintaining proper reading order of detected text elements
Supporting different writing systems and text orientations

Type of change

New feature (non-breaking change which adds functionality)

Testing

The changes have been tested through:

Unit Tests
- Tests for all reading directions
- Edge cases (empty detections, single characters)
- Tolerance grouping behavior
- Multi-line text handling
Integration Testing
- Tested on local inference server
- Created workflows with various text orientations:
  - Horizontal text
  - Vertical text
  - Multi-line text
  - Different languages/writing systems

CLAassistant · 2024-10-31T21:54:49Z

All committers have signed the CLA.

refactored block code and created unit tests fixed some bugs in the unit tests with tolerance and vertical top to bottom Bump version Make linters happpy Adding fixes for the block discovered a bug with reading vertically. fixed it by switching initial grouping to x dimension. adjusted unit tests appropriately Make linters happpy

reiffd7 requested review from PawelPeczek-Roboflow, grzegorz-roboflow, yeldarby, probicheaux and hansent as code owners October 31, 2024 21:54

PawelPeczek-Roboflow requested a review from capjamesg as a code owner November 1, 2024 09:08

PawelPeczek-Roboflow previously approved these changes Nov 1, 2024

View reviewed changes

grzegorz-roboflow previously approved these changes Nov 1, 2024

View reviewed changes

reiffd7 dismissed stale reviews from grzegorz-roboflow and PawelPeczek-Roboflow via 7dc0888 November 1, 2024 15:44

PawelPeczek-Roboflow force-pushed the stitch_ocr_detections_workflow_block branch from 7b1cf12 to 8017160 Compare November 1, 2024 17:14

PawelPeczek-Roboflow added 2 commits November 1, 2024 18:18

Resolve conflicts with main

7997a98

Make linters happpy

1e5774e

PawelPeczek-Roboflow requested review from PawelPeczek-Roboflow and grzegorz-roboflow November 1, 2024 17:19

grzegorz-roboflow approved these changes Nov 1, 2024

View reviewed changes

PawelPeczek-Roboflow merged commit 727ebd0 into main Nov 1, 2024
58 checks passed

PawelPeczek-Roboflow deleted the stitch_ocr_detections_workflow_block branch November 1, 2024 17:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stitch ocr detections workflow block #765

Stitch ocr detections workflow block #765

reiffd7 commented Oct 31, 2024

CLAassistant commented Oct 31, 2024 •

edited

Loading

Stitch ocr detections workflow block #765

Stitch ocr detections workflow block #765

Conversation

reiffd7 commented Oct 31, 2024

Description

Reading Direction (dropdown)

Tolerance (integer)

Type of change

Testing

CLAassistant commented Oct 31, 2024 • edited Loading

CLAassistant commented Oct 31, 2024 •

edited

Loading