Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow JOB_TASKS_PLAN to be configurable per publication #304

Open
1 of 4 tasks
jpmckinney opened this issue Sep 18, 2023 · 0 comments
Open
1 of 4 tasks

Allow JOB_TASKS_PLAN to be configurable per publication #304

jpmckinney opened this issue Sep 18, 2023 · 0 comments

Comments

@jpmckinney
Copy link
Member

jpmckinney commented Sep 18, 2023

For example, extremely large datasets (like ProZorro Ukraine, and previously a few Digiwhist publications) will probably not be flattened in a reasonable amount of time.

If large publications only have one release per contracting process, we can replace the process step by either:

  • Running an alternative worker to Kingfisher Process, which reformats the packages from Kingfisher Collect as compiled releases.
  • Adding an option in Kingfisher Collect, which causes AddPackageMiddleware and KingfisherProcessAPI2 to be skipped, and causes a new RemovePackageMiddleware to be run. (This is probably faster, if not overly complicated.)

We would need a new exporter that indexes files on disk, and then writes the monthly and yearly files.

Both options will create a lot of files, which requires open-contracting/kingfisher-collect#740 to be resolved, otherwise the performance will be awful.

Before adding ProZorro, we should also complete #291.

So:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant