Skip to content

v2024-10-15

Latest
Compare
Choose a tag to compare
@bertsky bertsky released this 15 Oct 18:33
3e33522

v2024-10-15

Changes:

  • added new module ocrd_page2alto (also in ocrd_fileformat, now with standalone processor)
  • new fixup recipes for shared venv without dependency conflicts
  • protect venv creation by semaphore as well
  • docker: update OCRD_MODULES (default selection for custom make docker)
  • docker: fix minimum and medium module lists
  • docker: do not rm venv created by previous stage
  • CI/CD: rewrite CircleCI config to split up mini/medi/maxi into interdependent incremental jobs
  • CI/CD: fix storing test results

core 92b217e..85bde15

Release: v2.70.0

  • PyPI: do not upload deprecated distribution aliases anymore
  • deps-cuda: retry micro.mamba.pm even more
  • 📦 v2.70.0
  • 📝 changelog
  • create PyPI CD
  • 📝 changelog
  • Merge remote-tracking branch 'github/cli-decorator-import-network'
  • deps-cuda: retry if micromamba is unresponsive
  • Merge branch 'master' of https://github.com/OCR-D/core
  • 📝 changelog
  • Merge remote-tracking branch 'github/fix_mets_server_zombies'
  • 📝 changelog
  • Merge remote-tracking branch 'github/deps-torch-torchvision'
  • 📝 changelog
  • Merge branch 'network_client_block_prints'
  • Merge pull request #1280 from OCR-D/fix-docker-cuda-torch
  • 📦 v2.69.0
  • 📝 changelog
  • Merge branch 'mexthecat-master'
  • 📝 update changelog again
  • 📝 changelog: remove spurious entries
  • 📝 changelog
  • disableLogging: clearer comment
  • ocrd.cli.workspace: use physical_pages if possible, fix default output_field
  • OcrdMets.get_physical_pages: cover return_divs w/o for_fileIds for_pageIds
  • update OcrdPage from generateds
  • OcrdPage: add PageType.get_ReadingOrderGroups()
  • ocrd.cli.workspace: assert non-server in cmds mutating METS
  • tests: make sure ocrd_utils.config gets reset whenever changing it globally
  • lib.bash: fix errexit
  • run_processor: be robust if ocrd_tool is missing steps
  • ocrd.cli.validate tasks: pass on --mets-server-url, too
  • ocrd.cli.workspace server: add 'reload' and 'save'
  • ocrd.cli.workspace: consistently pass on --mets-server-url and --backup (also, simplify)
  • METS Server: also export+delegate physical_pages
  • PcGts.Page.id / make_xml_id: replace '/' with '_'
  • test_mets_server: add test for force (overwrite)
  • OcrdMetsServer.add_file: pass on 'force' kwarg, too
  • Workspace.reload_mets: fix for METS server case
  • add test for OcrdEnvConfig.reset_defaults()
  • ocrd_utils.config: add reset_defaults()
  • bashlib: re-add --log-filename, implement as stderr redirect
  • test-logging: also remove ocrd.log from tempdir
  • disableLogging: re-instate root logger, to
  • test_mets_server: use tmpdir to avoid side effects between suites
  • ClientSideOcrdMets: use same logger name prefix as server
  • pylint: try ignoring generateds (again)
  • update pylintrc
  • OcrdMets.add_agent: does not have positional args
  • cli.workspace: pass fileGrp as well, improve description
  • adapt to PIL.Image moved constants
  • fix exception
  • fix --log-filename (6fc606027a): apply in ocrd_cli_wrap_processor
  • tests report.is_valid: improve output on failure
  • Processor.zip_input_files: more verbose log msg
  • Processor.zip_input_files: warning instead of exception for missing input files
  • fix imports
  • ocrd_utils: forgot to export scale_coordinates at toplvl
  • allow "from ocrd_models import OcrdPage
  • improve output in case of assertion failures
  • hide/test expected deprecation warnings
  • use up-to-date kwargs (avoiding old deprecations)
  • mets_server: ClientSideOcrdMets needs OcrdMets-like kwargs (without deprecation)
  • test_mets_server: fix arg vs kwarg
  • processor CLI: delegate --resolve-resource, too
  • 📦 v2.68.0
  • 📝 changelog
  • refactor client cli: process -> run
  • Merge branch 'master' into extend-network-client
  • 📝 changelog
  • Merge pull request #1270 from OCR-D/fix-parsing
  • fix: exception handling
  • add: check processing job log file
  • add: discovery cli, processors and processor
  • add sort to network agents
  • add: parameter_override
  • fix: the annoying string dict
  • fix: check report validation outside try block
  • fix: set ps address if None in constructor
  • Fix: server_utils.py > 404 to 400
  • Fix: rename to block
  • add docstring to cli commands
  • fix: required job id
  • add: help section to the cli
  • add cli job status check
  • add help for new env
  • refine status check methods
  • Update src/ocrd_network/client_utils.py
  • add timeout and wait to configs
  • add: client workflow run
  • fix: client processing request
  • fix test
  • refactor status checks
  • remove the client server
  • try docker host ip
  • Fix flag typo
  • integration test for client
  • update network client
  • fix the test dir path in docker
  • add integration test for client
  • Merge branch 'resolve-1257'
  • 📝 changelog
  • revert, and just use < v43.0.0
  • set paramiko logging to INFO
  • fix: supress paramiko warnings
  • set: propagate 0, logging config
  • set: paramiko logging to ERROR
  • remove downloading tool json
  • add: default ocrd-all-tool.json
  • download tool json if missing
  • Merge branch 'master' into resolve-1257
  • load tool json locally

dinglehopper 129e6eb..071e6a8

Release: v0.9.7

  • Merge pull request #120 from joschrew/dockerfile
  • Merge pull request #113 from qurator-spk/python-3.13
  • ✔ pre-commit: Add license check
  • 🐛 Fix --version option in OCR-D CLI
  • ✨ Support --version option in CLI
  • ⚙ pyproject.toml: Add license.file
  • ⚙ pre-commit: Update hooks

docstruct a7ffdda..004e6ec

  • add GHA CD via Dockerhub
    Submodule eynollah 032a99e...51f6ef6:
  • Merge pull request #137 from qurator-spk/dockerfile
  • Merge pull request #132 from qurator-spk/extracting_images_only
  • Merge pull request #133 from qurator-spk/src-layout
  • 📦 v0.3.1
  • 📝 changelog
  • Merge pull request #129 from qurator-spk/resolving_issue_106
  • update Makefile model location
  • update pyproject.toml for v0.3.1
  • update pyproject.toml
  • Update README.md
  • rename GH action
  • create draft pyproject.toml
  • format options table
  • Update README.md
  • improve huggingface url
  • remove CircleCI
  • Update model download url
  • Merge pull request #127 from bertsky/new-namespace-pkg
  • update GitHub actions
  • Update README.md
  • update supported Python+Tensorflow version combinations
  • pin tf2 version to 2.12.1
  • use tf1 compatibility for keras backend
    < adapt to OcrdFile.local_filename now :Path
    < adapt to ocrd>=2.54 url vs local_filename
  • comment unnecessary print commands
  • add supported OS to readme
  • filtering separators in a correct way without missing them
  • Merge pull request #117 from qurator-spk/tf-2.12-or-greater
  • apply missed commit #a56988a back
  • Merge pull request #116 from qurator-spk/fix-typos
  • Merge pull request #113 from qurator-spk/tf_<2.12.0
  • Update citation
  • Update bibtex entry
  • format citation info as bibtex
  • add HIP'23 paper reference
  • Merge pull request #109 from bertsky/patch-3
  • Merge pull request #105 from bertsky/fix-model-archive-path
    < Revert "Merge pull request #97 from qurator-spk/420-namespace-package"
  • Merge pull request #104 from bertsky/reinstate-namespace-pkg
  • Merge pull request #102 from qurator-spk/right2left_reading_order
  • delete printing resized image shape
  • issue #67 solved
  • improve links to GT guidelines
  • Update README.md
  • Update CHANGELOG.md
  • Update ocrd-tool.json
  • Merge pull request #86 from qurator-spk/eynollah_light

nmalign 7832c90..1426dbc

Release: v0.0.3

  • fix dockerfile
  • add GHA CD via Dockerhub

ocrd_calamari caac953..d9cde1f

Release: v1.0.6

  • Merge pull request #116 from bertsky/limit-batch-size
  • Merge pull request #115 from OCR-D/pyproject_toml
  • ✔ CircleCI: Don't test on Python 3.7 anymore
  • ⚙ pre-commit: Update hooks
  • 🧹 ruff: Update config
  • 🎨 Reformat (Black)
  • 🧹 Update OCR-D API usage w.r.t. workspace.add_file()
  • ⚙ pre-commit: Update hooks

ocrd_cis 38ce45b..db65d7f

Release: v0.1.5

  • Merge pull request #6 from MehmedGIT/fix-align-import-levenshtein
  • segment: adapt to numpy deprecation
  • Updated config.yml
  • CI: try testing in parallel
  • test_lib.bash: update GT Github URL
  • fix+update dockerfile
  • recognize: replace python-levenshtein with rapidfuzz
  • fix 53ae7d69 (already str not bytes)
  • use importlib instead of pkg_resources via ocrd_utils
  • docker: adapt to core using /build already
  • ocrd-tool.json: add Ocropy default model resources
  • resegment: fix 2 edge cases

ocrd_detectron2 1f56273..218e0b5

Release: v0.1.8

  • CI: out py3.7, in py3.10
  • CD: publish to ocrd/detectron2 instead of bertsky/ocrd_detectron2
  • Docker: rebase to ocrd/core-cuda-torch
  • update from pillow deprecations
  • make deps: defer to detectron2 master (and always build from src)

ocrd_doxa a95f8e7..15e8423

Release: v0.0.2

  • add DH badge
  • fix GHA username
  • fix GHA syntax
  • add Dockerhub CD
  • Merge pull request #5 from joschrew/dockerfile

ocrd_fileformat ff46bd7..8ab078d

Release: v0.11.1

  • update ocr-fileformat to include UB-Mannheim/ocr-fileformat#188
  • test_convert: fix choice of assets (w/ wf. METS)
  • Merge pull request #60 from bertsky/patch-4
  • Merge pull request #57 from OCR-D/license

ocrd_im6convert be794f6..82bd491

Release: v0.1.1

  • CD: fix GHCR image tag
  • dockerfile: oops, confused im6convert with fileformat
  • Merge pull request #14 from OCR-D/docker-build-args

ocrd_keraslm ea79b2a..2c466bd

Release: v0.4.3

  • update assets
  • Revert "test: replace buggy workspace clone with cp -r"
  • test: replace buggy workspace clone with cp -r
  • docker: base on ocrd/core-cuda:2.69 (via env var)

ocrd_olahd_client 6bcbb4b..56c9272

Release: v0.0.2

  • Merge branch 'dockerfile'

ocrd_segment de824e9..064b7a8

Release: v0.1.24

  • Merge pull request #68 from OCR-D/update-docker-2.67

ocrd_tesserocr d23992b..dcbd522

Release: v0.19.1

  • Merge pull request #219 from OCR-D/fix-docker
  • GHA CD: forgot ghcr.io prefix
  • GHA CD: lowercase repo name
  • Merge pull request #215 from bertsky/circleci-cd-add-pypi
  • dockerfile: build Tesseract in parallel
  • dockerfile: adapt to weird dockerfile COPY syntax
  • docker: unblock+copy .gitmodules
  • dockerignore: unblock .git
  • dockerfile: copy .git repo as well (so submodule update works) during build
  • makefile: ensure submodules are checked out
  • makefile: fix clean-tesseract
  • Merge pull request #218 from OCR-D/update-docker-2.67

ocrd_wrap 2cd800d..fd4a2bc

Release: v0.1.8

  • forgot Dockerfile
  • add GHA CD via Dockerhub

opencv-python cce7c99..6a181ce

Release: 84

  • Merge pull request #1015 from Gornoka:patch-1

sbb_binarization 978f425..d259795

Release: v0.1.0

  • docker: rebase on core-cuda stage
  • forgot to include package data
  • dockerfile: switch to pyproject.toml
  • relax TF requirement
  • CI: remove py37 from matrix
  • remove setup.py
  • add pyproject.toml
  • make docker: fix docker tag
  • remove shebang from setup.py (somehow breaking py39)
  • CI: increase memory on VM
  • make install: update setuptools, too
  • add GHA CD via Dockerhub

workflow-configuration eeea260..63e9969

Release: 0.1.3

  • Adapt dockerimage to ocrd-d-core changes
  • 📝 update readme
  • new pair of XSLTs: un/flatten text regions in arbitrary regions
  • use mkdir -p when creating SHAREDIR