Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ML-DataFrame] Checkpoint information is wrong for stopped dataframes #42978

Closed
2 tasks
hendrikmuhs opened this issue Jun 7, 2019 · 4 comments
Closed
2 tasks
Assignees
Labels

Comments

@hendrikmuhs
Copy link

hendrikmuhs commented Jun 7, 2019

For stopped data frames the checkpoint information is taken from the stored stats document in the internal index rather than calculated. The stored information is always empty, therefore it always reports 0:

     "checkpointing" : {
        "operations_behind" : 0
      }

Solution:

  • calculate checkpoint information the same ways as it's done for started data frames
  • do not persist checkpoint information in the state-and-stats document in the internal index
@elasticmachine
Copy link
Collaborator

Pinging @elastic/ml-core

@droberts195
Copy link
Contributor

I think this is a prerequisite for #43767, and hence elastic/kibana#40378.

@sophiec20
Copy link
Contributor

A stopped continuous data frame now returns useful info. Can this be closed?

{
  "count" : 1,
  "transforms" : [
    {
      "id" : "simple-01",
      "state" : "stopped",
      "stats" : {
        "pages_processed" : 27003,
        "documents_processed" : 437684532,
        "documents_indexed" : 4291370,
        "trigger_count" : 4289,
        "index_time_in_ms" : 2274847,
        "index_total" : 8805,
        "index_failures" : 0,
        "search_time_in_ms" : 40555084,
        "search_total" : 27004,
        "search_failures" : 0,
        "exponential_avg_checkpoint_duration_ms" : 65092.07653134306,
        "exponential_avg_documents_indexed" : 7297.119039406083,
        "exponential_avg_documents_processed" : 746541.5988733966
      },
      "checkpointing" : {
        "last" : {
          "checkpoint" : 633,
          "timestamp" : "2019-08-22T12:05:36.604Z",
          "timestamp_millis" : 1566475536604,
          "time_upper_bound" : "2019-08-22T12:04:36.604Z",
          "time_upper_bound_millis" : 1566475476604
        },
        "next" : {
          "checkpoint" : 634,
          "position" : {
            "bucket_position" : {
              "clientip" : "10.11.122.11"
            }
          },
          "checkpoint_progress" : {
            "docs_indexed" : 2000,
            "docs_processed" : 210542
          },
          "timestamp" : "2019-08-22T12:06:54.717Z",
          "timestamp_millis" : 1566475614717,
          "time_upper_bound" : "2019-08-22T12:05:54.717Z",
          "time_upper_bound_millis" : 1566475554717
        },
        "operations_behind" : 15996
      }
    }
  ]
}

@benwtrent
Copy link
Member

@sophiec20 definitely, looking at @droberts195 PR, there was logic that gathered checkpointing information for stopped transforms that did not have actively running tasks. Should be good to go now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

5 participants