Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WARC writer + incremental indexing fixes #679

Merged
merged 4 commits into from
Sep 5, 2024
Merged

Conversation

ikreymer
Copy link
Member

@ikreymer ikreymer commented Sep 5, 2024

  • ensure WARC rollover happens only after response/request + cdx or single record + cdx have been written
  • ensure request payload is buffered for POST request indexing
  • update to warcio 2.3.1 for POST request case-insensitive 'content-type' check
  • recorder: remove unused 'tempdir', no longer used as warcio chooses a temp file on it's own

- ensure WARC rollover happens only after response/request + cdx or single record + cdx have been written
- ensure request payload is buffered for POST request indexing
- update to warcio 2.3.1 for POST request case-insensitive 'content-type' check
- recorder: remove unused 'tempdir', no longer used as warcio chooses a temp file on it's own
@ikreymer
Copy link
Member Author

ikreymer commented Sep 5, 2024

To be merged after warcio.js 2.3.1 is released after webrecorder/warcio.js#78

@ikreymer ikreymer requested a review from tw4l September 5, 2024 06:19
Copy link
Member

@tw4l tw4l left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good, will just need to update version number after other PR is merged

@ikreymer ikreymer marked this pull request as ready for review September 5, 2024 17:46
@ikreymer ikreymer merged commit 9d0e342 into main Sep 5, 2024
4 checks passed
@ikreymer ikreymer deleted the warc-indexing-fixes branch September 5, 2024 18:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants