add Docker and GHA CD via Dockerhub #70

bertsky · 2024-09-30T19:27:30Z

No description provided.

kba

LGTM.

I am aware that it has been deprecated for a while but until we have the binarization functionality in eynollah available via OCR-D interface, this is still one of the best binarizers out there and we should support it as a slim container in ocrd_all.

bertsky · 2024-10-11T12:20:41Z

Py3.9 and 3.10 failures are extremely strange:

>>> import sbb_binarize.cli
>>> type(sbb_binarize.cli)
>>> sbb_binarize.cli is None
>>> import inspect
>>> inspect.ismodule(sbb_binarize.cli)
>>>

So it seems that Python has some kind of null type not made explicit. No idea what's wrong with the packaging. The RECORD seems fine, it lists all module files.

(now unsupported anyway)

bertsky · 2024-10-14T11:32:24Z

Py3.9 and 3.10 failures are extremely strange:
>>> import sbb_binarize.cli
>>> type(sbb_binarize.cli)
>>> sbb_binarize.cli is None
>>> import inspect
>>> inspect.ismodule(sbb_binarize.cli)
>>>
So it seems that Python has some kind of null type not made explicit. No idea what's wrong with the packaging. The RECORD seems fine, it lists all module files.

Migrating from setup.py to pyproject.toml did not help.

bertsky · 2024-10-14T11:33:25Z

I'll also relax TF requirement slightly (to be in line with eynollah) – tested: no different between 2.11 and 2.12.

bertsky · 2024-10-14T15:30:21Z

oh, wow – relaxing the TF requirements also magically fixed the problem with Py3.9 and 3.10! (I wonder why we did not receive a build-time error saying that no such TF version is available, though...)

joschrew

The dockerimage does not work for me without these changes. I testet ocrd-sbb-binarize --help and a simple binarization, which took quite long (72 secs for a single image).
Maybe it would also be good to include one ore more models into the dockerimage, because I had to download one by myself with resmgr, but I am not sure if this should be included.

Dockerfile

pyproject.toml

Co-authored-by: joschrew <91774427+joschrew@users.noreply.github.com>

bertsky · 2024-10-15T10:51:57Z

which took quite long (72 secs for a single image).

yes, without a GPU this is slow, and even with it is not efficient. But that's well known and not the issue here.

Maybe it would also be good to include one ore more models into the dockerimage, because I had to download one by myself with resmgr, but I am not sure if this should be included.

No, that would just bloat the image. Downloading models in advance of using is the normal procedure for OCR-D. We don't want to reign in the admin's decision where to store persistent data (like models, which should not change as often as code).

Also I am wondering if the image should me based on the special ocr-d-tensorflow image as tensorflow is a requirement. But I might be totally wrong regarding the latter.

In principle, yes. But since here, TF is pinned to <=2.12, while our current core-cuda-tf2 is unrestricted on Py38 (and thus pulls 2.13) … https://github.com/OCR-D/core/blob/85bde1574293ea8b7ba29255fbb8e07312c28eb1/Makefile#L153-L158 … it would not help (only increase the image size even more).

But we should at least switch to core-cuda then...

bertsky · 2024-10-15T17:51:12Z

So this is ready IMO. Fixes #67 and also brings support for Python 3.9 and 3.10.

cneud · 2024-10-16T11:21:19Z

Dear all, thx - I can merge this today but would have 2 more small requests:

can we also include Python 3.11 in the CI please
Tensorflow can be relaxed to 2.12.x iiuc

bertsky · 2024-10-16T11:42:18Z

@cneud done!

can we also include Python 3.11 in the CI please

let's see if it works

Tensorflow can be relaxed to 2.12.x iiuc

IMO it would make sense now to relax Eynollah's current pin (2.12.1) as well (2.12.x) – so we don't reintroduce the conflict if some TF 2.12.2 should be published

cneud · 2024-10-16T12:03:11Z

relax Eynollah's current pin (2.12.1) as well (2.12.x)

Will do and thanks again! Ready to merge in 3, 2, 1...

bertsky added 4 commits September 30, 2024 21:27

add GHA CD via Dockerhub

23e282c

make install: update setuptools, too

1162a1c

CI: increase memory on VM

e0ba83e

remove shebang from setup.py (somehow breaking py39)

ccfc821

kba approved these changes Oct 10, 2024

View reviewed changes

bertsky added 5 commits October 14, 2024 13:12

make docker: fix docker tag

05e3088

add pyproject.toml

676b6f1

remove setup.py

0f611f8

CI: remove py37 from matrix

1b8f54c

(now unsupported anyway)

CI: remove py37 from matrix

eb9a9fe

relax TF requirement

4eabd12

bertsky mentioned this pull request Oct 14, 2024

Update 2024 10 11 OCR-D/ocrd_all#452

Merged

joschrew suggested changes Oct 15, 2024

View reviewed changes

Dockerfile Outdated Show resolved Hide resolved

Dockerfile Show resolved Hide resolved

pyproject.toml Outdated Show resolved Hide resolved

bertsky and others added 2 commits October 15, 2024 12:41

dockerfile: switch to pyproject.toml

b581568

Co-authored-by: joschrew <91774427+joschrew@users.noreply.github.com>

forgot to include package data

547229c

Co-authored-by: joschrew <91774427+joschrew@users.noreply.github.com>

docker: rebase on core-cuda stage

d259795

kba approved these changes Oct 16, 2024

View reviewed changes

joschrew approved these changes Oct 16, 2024

View reviewed changes

bertsky added 3 commits October 16, 2024 13:36

relax TF requirement (subminor)

00f70d1

CI: try adding py3.11

7ee111d

relax TF requirement (fix syntax)

ddcec5b

cneud approved these changes Oct 16, 2024

View reviewed changes

cneud merged commit 5385162 into qurator-spk:master Oct 16, 2024
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add Docker and GHA CD via Dockerhub #70

add Docker and GHA CD via Dockerhub #70

bertsky commented Sep 30, 2024

kba left a comment

bertsky commented Oct 11, 2024

bertsky commented Oct 14, 2024

bertsky commented Oct 14, 2024

bertsky commented Oct 14, 2024

joschrew left a comment •

edited

Loading

bertsky commented Oct 15, 2024

bertsky commented Oct 15, 2024

cneud commented Oct 16, 2024

bertsky commented Oct 16, 2024

cneud commented Oct 16, 2024

add Docker and GHA CD via Dockerhub #70

add Docker and GHA CD via Dockerhub #70

Conversation

bertsky commented Sep 30, 2024

kba left a comment

Choose a reason for hiding this comment

bertsky commented Oct 11, 2024

bertsky commented Oct 14, 2024

bertsky commented Oct 14, 2024

bertsky commented Oct 14, 2024

joschrew left a comment • edited Loading

Choose a reason for hiding this comment

bertsky commented Oct 15, 2024

bertsky commented Oct 15, 2024

cneud commented Oct 16, 2024

bertsky commented Oct 16, 2024

cneud commented Oct 16, 2024

joschrew left a comment •

edited

Loading