Task Management #15117

imotov · 2015-11-30T15:28:45Z

nik9000 · 2015-12-02T15:52:06Z

I wonder if we need a way to store the results of a task until they are fetched? I'm thinking of something like update-by-query which would be a task because it is long running, cancelable, etc. But it wants to return counts of how many documents it updated and things like that. Maybe just write them to an index? Maybe with a ttl?

raf64flo · 2015-12-02T16:08:43Z

Nice remark of @nik9000 about long task results availability after its end, as it is already done for snapshots.
But I'd prefer a TTL or/and a dedicated query to drop the result instead of only drop on fetch, which could be problematic in my opinion.

nik9000 · 2015-12-02T16:10:52Z

But I'd prefer a TTL or/and a dedicated query to drop the result instead of only drop on fetch, which could be problematic in my opinion.

Yeah - drop on fetch would be rough.

Not all tasks will want to do this but I think some would like it.

imotov · 2015-12-02T17:29:19Z

@nik9000 is the goal to make results available after the task finished?

nik9000 · 2015-12-02T17:53:41Z

@nik9000 is the goal to make results available after the task finished?

Yeah. In the case of update-by-query it'd be just to make the status available. The most "convenient" way to do it seems like write it to an index with a ttl - but I think I'm just stuck on that idea because it came to me. The point is that after the task is done you'll want to see what its results were for some period of time. You'd want some place you could fetch the results by task id, some way to clear out results when you've finished with them, some way for them to clear themselves out if you don't read them back soon enough.

I don't think it needs to come at iteration 1, but at some point it'd be nice.

Look at delete-by-query, it makes some effort to build a nice results object. Once it becomes a "task" it'll have nothing to do with the fancy result object.

Another thing that might be useful is to make an API that'd block until the task was finished and return the result of it. Or just fetch the result if it was already finished. This'd be super useful in general but kind of required for the REST tests because they don't have loops and things.

imotov · 2015-12-02T19:54:23Z

I think traditionally we do that in two places - 1) log files for per-operation level and 2) in stats as combined metric. I can see how we might want to have a third way, but I think the biggest question here is lifecycle of this result. Persistence (even temporary persistence) of results is very unclear to me unless the result is associated with some persistent object (such as snapshot). So, I would rather make it an option to block and get result if you are interested in the result.

nik9000 · 2015-12-02T20:22:04Z

So, I would rather make it an option to block and get result if you are interested in the result

I don't know if that'll be enough in the end though. Imagine the delete-by-query operation that takes 30 minutes too complete. Its too long for any blocking to be reliable - all kinds of http equipment will time you out after 5 minutes and something is bound to sneak in and get you a connection reset by peer.

So you'd have to build in a retry to the blocking. But if results aren't persisted, at least for a little while, then there is always the possibility that the job will finish between one request timing out and the next one starting. A low possibility but an icky one.

Something like a TTL on the result with explicit commands to read the result and delete it would work. These results wouldn't be huge documents so we could probably keep them in memory, certainly if they were serialized xContent or implemented Accountable or something.

Its complicated but I can't think of how else to report on tasks that are "do a thing" rather than "make a thing".

niemyjski · 2015-12-03T15:24:22Z

+1

clintongormley · 2015-12-05T19:42:16Z

Another user of task mgmt: the forced merge API

nik9000 · 2015-12-07T19:28:56Z

Another user of task mgmt: the forced merge API

I wonder if we should add a list of users to the top, like, below the requirements. I'm happy to work on retrofitting some of our long running requests to make their status more fetch-able and to make them more cancel-able but we should should make a list/tag the old issues.

jprante · 2015-12-21T23:44:47Z

Will it be possible to suspend/resume tasks by API? For perpetual tasks? Or to schedule tasks by a cron-like specification? It seems not, since the task design discusses TransportAction only, which means the lifetime of a task is "one-shot", corresponding to a request/response roundtrip of an action executed by a user?

imotov · 2015-12-22T16:39:31Z

@jprante at the moment we are targeting use cases where tasks have clear start, stop and finite running time. We might extend it to perpetual tasks in the future, but this is not on the immediate road map.

nik9000 · 2015-12-22T16:53:56Z

Will it be possible to suspend/resume tasks by API?

I suspect that'll be opt-in in the same way that cancel will be opt-in. Reindex will probably opt in because it'll want to have API controlled throttling. So you could set the throttle to 0 and it'd just stop. The bulk request powering it would timeout pretty soon, making the whole thing fail though.

… tasks Adds task manager class and enables all activities to register with the task manager. Currently, the immutable Transport*Activity class represents activity itself shared across all requests. This PR adds and an additional structure Task that keeps track of currently running requests and can be used to communicate with these requests using TransportTaskAction. Related to elastic#15117

wuranbo · 2016-01-13T03:19:06Z

We will has the internal result of a long-term-running task? For example:
The query will retrieve total 100 shards on 10 elastic node, each node has 10 shards. We can get the result immediately when the first shard is done. Then the second shard is done, we reduce the result to the former, send notify to the API, so the user can update their view. When all the shards are done, send notify that the task is done.
So we can execute this long-term-running task in background with less thread one shard by another, release the CPU resources for the high-priority task. And the user can update the views of long-term-running task frequently, got a better user experience.

imotov · 2016-02-11T14:22:24Z

@clintongormley good idea, I have added it to the description

… tasks Adds task manager class and enables all activities to register with the task manager. Currently, the immutable Transport*Activity class represents activity itself shared across all requests. This PR adds and an additional structure Task that keeps track of currently running requests and can be used to communicate with these requests using TransportTaskAction. Related to elastic#15117

Related: elastic/elasticsearch#15117

nik9000 · 2017-02-14T14:35:42Z

@imotov, is this done now? I think we've decided not to do the "task can survive restart" thing, right?

oleg-andreyev · 2018-06-14T15:24:06Z

@clintongormley @imotov #15975 it was closed in favor of this ticket, so can we retrieve information about _forcemerge?

imotov · 2018-06-15T21:02:05Z

@oleg-andreyev thanks! I have reopened #15975.

imotov added >feature Meta v5.0.0-alpha1 :Distributed Coordination/Task Management Issues for anything around the Tasks API - both persistent and node level. labels Nov 30, 2015

imotov mentioned this issue Nov 30, 2015

Task management #6914

Closed

nik9000 mentioned this issue Dec 2, 2015

Reindex API #15201

Closed

5 tasks

clintongormley mentioned this issue Dec 5, 2015

Notification system for completion of action (feature request) #9645

Closed

clintongormley mentioned this issue Dec 5, 2015

Ability to track status of optimize api #10008

Closed

imotov mentioned this issue Dec 9, 2015

Task Management: Add framework for registering and communicating with tasks #15347

Merged

imotov mentioned this issue Jan 12, 2016

Restoring specific shard(s) from snapshots #15653

Closed

ppf2 mentioned this issue Jan 12, 2016

Snapshot failover/retry on failed shard if a good copy is available #15940

Closed

karmi added a commit to elastic/elasticsearch-ruby that referenced this issue Mar 30, 2016

[API] Added the "Tasks" API

ccf59fe

Related: elastic/elasticsearch#15117

clintongormley added v5.0.0-alpha2 and removed v5.0.0-alpha1 labels Apr 4, 2016

clintongormley added v5.0.0-alpha3 and removed v5.0.0-alpha2 labels Apr 26, 2016

martijnbastiaan mentioned this issue May 11, 2016

Option to cancel queries when the process takes too long amcat/amcat#274

Closed

clintongormley added v5.0.0-alpha4 and removed v5.0.0-alpha3 labels May 24, 2016

clintongormley added v5.0.0-alpha5 and removed v5.0.0-alpha4 labels Jun 22, 2016

evanvolgas mentioned this issue Jul 22, 2016

Log queries before execution #9172

Closed

clintongormley added v5.0.0-beta1 and removed v5.0.0-alpha5 labels Jul 29, 2016

clintongormley added v5.0.0 and removed v5.0.0-beta1 labels Sep 14, 2016

clintongormley added v6.0.0-alpha1 and removed v5.0.0 labels Oct 11, 2016

clintongormley added v6.0.0 and removed v6.0.0-alpha1 labels May 3, 2017

imotov closed this as completed Jul 19, 2017

colings86 added v6.0.0-beta1 and removed v6.0.0 v6.0.0-beta1 labels Jul 31, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Task Management #15117

Task Management #15117

imotov commented Nov 30, 2015 •

edited

Loading

nik9000 commented Dec 2, 2015

raf64flo commented Dec 2, 2015

nik9000 commented Dec 2, 2015

imotov commented Dec 2, 2015

nik9000 commented Dec 2, 2015

imotov commented Dec 2, 2015

nik9000 commented Dec 2, 2015

niemyjski commented Dec 3, 2015

clintongormley commented Dec 5, 2015

nik9000 commented Dec 7, 2015

jprante commented Dec 21, 2015

imotov commented Dec 22, 2015

nik9000 commented Dec 22, 2015

wuranbo commented Jan 13, 2016

imotov commented Feb 11, 2016

nik9000 commented Feb 14, 2017

oleg-andreyev commented Jun 14, 2018 •

edited

Loading

imotov commented Jun 15, 2018

Task Management #15117

Task Management #15117

Comments

imotov commented Nov 30, 2015 • edited Loading

nik9000 commented Dec 2, 2015

raf64flo commented Dec 2, 2015

nik9000 commented Dec 2, 2015

imotov commented Dec 2, 2015

nik9000 commented Dec 2, 2015

imotov commented Dec 2, 2015

nik9000 commented Dec 2, 2015

niemyjski commented Dec 3, 2015

clintongormley commented Dec 5, 2015

nik9000 commented Dec 7, 2015

jprante commented Dec 21, 2015

imotov commented Dec 22, 2015

nik9000 commented Dec 22, 2015

wuranbo commented Jan 13, 2016

imotov commented Feb 11, 2016

nik9000 commented Feb 14, 2017

oleg-andreyev commented Jun 14, 2018 • edited Loading

imotov commented Jun 15, 2018

imotov commented Nov 30, 2015 •

edited

Loading

oleg-andreyev commented Jun 14, 2018 •

edited

Loading