cache get_platform #5182

oliver-sanders · 2022-10-07T14:24:35Z

Spotted that the get_platform_from_name function was using a lot of CPU.

Turns out that it was called 7662 times for this sample workflow:

[task parameters]
    a = 1..5
    b = 1..5
    c = 1..5

[scheduling]
    initial cycle point = 2000
    final cycle point = 20000110T00Z
    runahead limit = P5
    [[queues]]
        [[[default]]]
            limit = 25
    [[graph]]
        P1D = """
            <a> => <b> => <c>
            <b>[-P1D] => <b>
        """

[runtime]
    [[<a>, <b>, <c>]]
        script = true

Embarrassingly this workflow only uses the localhost platform.

Added lru_cache to the function to reduce the impact:

Before:

After:

Caching does have an overhead, but inspecting the call tree it would appear this saved ~3 seconds of a ~35s run, which is ~10% of CPU!

Check List

I have read CONTRIBUTING.md and added my name as a Code Contributor.
Contains logically grouped changes (else tidy your branch by rebase).
Does not contain off-topic changes (use other PRs for other changes).
Applied any dependency changes to both setup.cfg and conda-environment.yml.
Tests are included (or explain why tests are not needed).
CHANGES.md entry included if this is a change that can affect users
Cylc-Doc pull request opened if required at cylc/cylc-doc/pull/XXXX.
If this is a bug fix, PRs raised to both master and the relevant maintenance branch.

oliver-sanders · 2022-10-07T15:12:11Z

Dammit the bad_hosts make a mess of this. Will need to split the function into the cacheable and non-cacheable parts.

oliver-sanders · 2023-03-08T11:04:32Z

Closing for now pending two changes:

A reduction in the number of get_platform calls (done).
A refactor of the get_platform logic to separate the cachable and non-cachable parts.

oliver-sanders added small efficiency For notable efficiency improvements labels Oct 7, 2022

oliver-sanders added this to the cylc-8.1.0 milestone Oct 7, 2022

oliver-sanders requested a review from wxtim October 7, 2022 14:24

oliver-sanders self-assigned this Oct 7, 2022

oliver-sanders requested a review from datamel October 7, 2022 14:24

oliver-sanders mentioned this pull request Oct 7, 2022

reload the global configuration during a workflow run #3762

Open

oliver-sanders removed request for wxtim and datamel October 7, 2022 15:02

oliver-sanders marked this pull request as draft October 7, 2022 15:02

oliver-sanders modified the milestones: cylc-8.1.0, cylc-8.2.0 Oct 18, 2022

oliver-sanders linked an issue Nov 28, 2022 that may be closed by this pull request

platforms: cache results or reduce overheads #5242

Open

oliver-sanders added 2 commits January 20, 2023 09:12

cache get_platform

1be9ae3

fix unit tests

0ec8ab2

oliver-sanders force-pushed the cache-platforms branch from 37f8130 to 0ec8ab2 Compare January 20, 2023 10:31

oliver-sanders mentioned this pull request Jan 20, 2023

efficiency: increment_graph_window #5315

Closed

oliver-sanders closed this Mar 8, 2023

oliver-sanders removed this from the cylc-8.2.0 milestone Mar 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cache get_platform #5182

cache get_platform #5182

oliver-sanders commented Oct 7, 2022

oliver-sanders commented Oct 7, 2022 •

edited

Loading

oliver-sanders commented Mar 8, 2023

cache get_platform #5182

cache get_platform #5182

Conversation

oliver-sanders commented Oct 7, 2022

oliver-sanders commented Oct 7, 2022 • edited Loading

oliver-sanders commented Mar 8, 2023

oliver-sanders commented Oct 7, 2022 •

edited

Loading