Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[obsolete] DF/091: safe dataset metadata update #245

Open
wants to merge 9 commits into
base: pyDKB-storages
Choose a base branch
from

Conversation

mgolosova
Copy link
Collaborator

@mgolosova mgolosova commented Apr 26, 2019

Status: obsolete

The PR is obsolete but not closed yet: need to check if #253 (merged) does what this PR tried to achieve.
Update: #253 is about data4es-start itself, while for the Stage 091 #320 should do the trick.

Original description

Running stage 091 with --es-config path/to/config allows stage to check ES@DKB for metadata that were already removed from Rucio together with the datasets.


There is still a room for improvements, so [WIP].
And some changes are clearly belong to #244, so...


(Waits for #244.) (obsolete)
Waits for #320.

If `--es-config FILE` parameter is specified, use DKB ES storage
as a backup metadata source in case that in primary source (Rucio)
information was removed.

Information from Rucio is removed together with the dataset, so if Rucio
says that dataset is deleted (or not found), the stage can query
internat DKB storage (ES) to get missed information. This functionality
is mostly required for "archived" data reprocessing (when we change the
ETL process or recover missed records).
This branch requires the new functionality, added to the pyDKB library.
@mgolosova mgolosova self-assigned this Apr 26, 2019
Configuration parameters, started with '__', won't be logged.
Without it we won't see in the log those parameters that have default values,
if they are missed in the user passed config.
Move metadata "update" with info from ES to a separate function.
@mgolosova mgolosova changed the base branch from master to pyDKB-storages April 28, 2019 13:39
@mgolosova mgolosova changed the title [WIP] DF/091: safe dataset metadata update [obsolete] DF/091: safe dataset metadata update Jun 18, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant