Skip to content

Conversation

@Mantisus
Copy link
Collaborator

@Mantisus Mantisus commented Nov 26, 2025

Description

  • Fix non-ASCII character corruption in FileSystemStorageClient on systems without UTF-8 default encoding

Issues

Testing

  • Add new tests for storage

@Mantisus Mantisus self-assigned this Nov 26, 2025
await Dataset.open(name=name, storage_client=storage_client)


async def test_record_with_noascii_chars(dataset: Dataset) -> None:
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The fix is in file system storage only. We can either move this check to file system storage client tests or test with all possible storage clients - your call.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I would keep the tests for all possible storage clients. 🙂

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah right, the dataset fixture uses the parametrized storage client fixture. Makes sense.

await Dataset.open(name=name, storage_client=storage_client)


async def test_record_with_noascii_chars(dataset: Dataset) -> None:
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah right, the dataset fixture uses the parametrized storage client fixture. Makes sense.

@Pijukatel Pijukatel merged commit f179f86 into apify:master Nov 27, 2025
24 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

user_data deforms non-latin letters when passed in

3 participants