Activate strict mode and fix type hints #220

benoit74 · 2024-11-18T16:39:54Z

Fix #212

Changes:

activate pyright strict mode (obviously)
add all missing type hints and more pyright: ignore[...] hints (always qualified)
remove all headers which are not maintained (shebang + vim config)
remove all from __future__ import annotations since we target only Python 3.12, those are useless now
breaking: remove with_process argument of zimscraperlib.video.encoding.reencode and always return the process ; those who don't mind about this second argument can just drop it, this simplify the codebase and the signature and the type hints
expose zimscraperlib.download.get_retry_adapter since it is used in zimscraperlib.zim.provider
properly expose __all__ in zimscraperlib/video/__init__.py (import where there but not used hence mostly useless)
fix type hint of mappings attribute of zimscraperlib.video.config.Config: mappings values are always a string since they are in fact keys)
simply type hint of options and defaults attributes of zimscraperlib.video.config.Config: options and defaults values are always a string or None since they are intended to be passed to the CLI as argument ; authorizing int and bool is not properly handled in the code, not needed, not used.
I had to rewrite a bit the metadata stuff to please the type checker:
- add a generic and abstract MetadataBase class which holds most property and can accomodate any value type, but obviously is abstract because it has no idea how to transform any value into a libzim value (this is the purpose of real implementation)
- add a convenient AnyMetadata type alias
- use a bounded TypeVar U to properly type all class decorators so that they properly accommodate to the class they are used on while still knowing it has access to all attributes of the MetadataBase class (it took me ages to figure this out, so worth mentioning and have in mind)
- I had to modify (again) get_reserved_names method which broke for some reason ; I feel like new implementation is indeed simpler (it does not need to rely on the weird globals())
- I decided to modify _log_metadata and drop the is None test ; type checker was complaining that this is not possible to happen (which is True if scraper use type hints properly and/or use only public methods which are protected by beartype ... only case where it can happen is if you do not use type checker and you manipulate directly the _metadata attribute ...) and message logged is not awful in such a situation ... is improper metadata type: NoneType: None instead of ... is None

Nota:

there is an important typing issue in zimscraperlib.zim.creator.Creator with methods add_metadata and add_item. I had to set the pragma # pyright: ignore[reportIncompatibleMethodOverride] because these methods override the base class methods (from pylibzim) with incompatible signature. This is not recommended. I feel like it is still acceptable since very convenient, and not a big deal (we never use the base class directly in scrapers in fact, and we do not have multiple child implementation of the base class with different signatures, so this is not really confusing).

codecov · 2024-11-18T20:58:02Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 100.00%. Comparing base (bbe94e8) to head (39095ba).
Report is 5 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff            @@
##              main      #220   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files           39        39           
  Lines         2417      2455   +38     
  Branches       322       331    +9     
=========================================
+ Hits          2417      2455   +38

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

rgaudin

Thank you! That must have been long and painful.

I really like the MetadataBase and the get_reserved_names ; very nice!
I dont mind the add_metata(), it was done on purpose.

My only concern regards accepting str for paths. We should not go this way IMO.

A way to resolve it might be to create a custom context manager like

from pathlib import Path
from tempfile import TemporaryDirectory

@contextmanager
def path_from(path: Path | TemporaryDirectory | str):
    if isinstance(path, Path):
        yield path
    elif callable(hasattr(path, "__enter__")):
        with path as pathname:
            yield Path(pathname)
    else:
        yield Path(pathname)

You would use it like:

with path_from(existing_tmp_path or tempfile.TemporaryDirectory()) as tmp_dir:
    ...

src/zimscraperlib/image/__init__.py

src/zimscraperlib/image/optimization.py

src/zimscraperlib/video/probing.py

src/zimscraperlib/zim/metadata.py

tests/filesystem/test_filesystem.py

benoit74 · 2025-01-06T10:16:33Z

@rgaudin please review the last 3 commits, I've checked, these are the changes since your last review

rgaudin

LGTM

benoit74 self-assigned this Nov 18, 2024

benoit74 force-pushed the strict_mode branch from 880b6aa to 29dd2e7 Compare November 18, 2024 20:55

benoit74 mentioned this pull request Dec 19, 2024

Be more precise than generic Callable in type hints #199

Open

benoit74 changed the base branch from main to i18n_class December 19, 2024 16:52

benoit74 force-pushed the strict_mode branch 5 times, most recently from b137179 to 76e5157 Compare December 20, 2024 08:42

Base automatically changed from i18n_class to main December 20, 2024 08:54

benoit74 force-pushed the strict_mode branch from 76e5157 to e84189c Compare December 20, 2024 09:05

benoit74 marked this pull request as ready for review December 20, 2024 09:12

benoit74 requested a review from rgaudin December 20, 2024 09:12

rgaudin requested changes Dec 20, 2024

View reviewed changes

benoit74 force-pushed the strict_mode branch 2 times, most recently from 0d1a8e2 to 78fa538 Compare December 20, 2024 20:32

benoit74 requested a review from rgaudin January 6, 2025 10:16

rgaudin approved these changes Jan 6, 2025

View reviewed changes

benoit74 added 4 commits January 6, 2025 12:51

Fix typing issues and activate pyright strict mode

1a9990a

Add path_from helper context manager

e0f94d0

Small enhancements following review

c314a77

Make image optimization methods stricter with options types

39095ba

benoit74 force-pushed the strict_mode branch from 78fa538 to 39095ba Compare January 6, 2025 12:51

benoit74 merged commit 5f71a29 into main Jan 6, 2025
9 checks passed

benoit74 deleted the strict_mode branch January 6, 2025 13:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Activate strict mode and fix type hints #220

Activate strict mode and fix type hints #220

Uh oh!

benoit74 commented Nov 18, 2024 •

edited

Loading

Uh oh!

codecov bot commented Nov 18, 2024 •

edited

Loading

Uh oh!

rgaudin left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

benoit74 commented Jan 6, 2025

Uh oh!

rgaudin left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Activate strict mode and fix type hints #220

Activate strict mode and fix type hints #220

Uh oh!

Conversation

benoit74 commented Nov 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Nov 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

rgaudin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

benoit74 commented Jan 6, 2025

Uh oh!

rgaudin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

benoit74 commented Nov 18, 2024 •

edited

Loading

codecov bot commented Nov 18, 2024 •

edited

Loading