proposal: integration testing battery #3616

oliver-sanders · 2020-05-19T15:24:39Z

tldr

Here is a POC Python test framework which could be used to re-implement our flaky unit tests which run workflows (an alternative option to #3611 which opens more doors).

def test_someting(make_flow, run_flow, run_dir):
    # bare minimum boilerplate
    scheduler = make_flow(
        {
            'scheduling': {
                'dependencies': {
                    'graph': 'foo'
                }
            }
        }
    )
    with run_flow(scheduler):
        # write your tests here
        assert client('ping_suite')
        assert Path(run_dir, reg, '.service', 'contact').exists()

This writeup was probably more work than the code so I'm happy to drop it if not desired. Might cherry pick a few commits onto master if dropped.

The Testing Situation

At present we have two test batteries:

Functional tests (tests/) written in bash and run with prove.
Unit tests (cylc/flow/tests) written in python and run with pytest.

We are missing a layer.

Unit tests test the minimal unit of functionality
Integration tests test the interaction between modules.
Functional tests test the interaction between systems.

(note our functional tests are muddled in with some unit tests and quite a lot of integration tests).

CylcWorkflowTestCase

In the unit tests we have a framework called CylcWorkflowTestCase, a number of tests are already implemented using it:

cylc/flow/tests/network/test_client.py
cylc/flow/tests/network/test_publisher.py
cylc/flow/tests/network/test_resolvers.py
cylc/flow/tests/network/test_server.py
cylc/flow/tests/network/test_subscriber.py
cylc/flow/tests/test_data_store_mgr.py
cylc/flow/tests/test_job_pool.py

These aren't unit tests, they are really more like integration tests. They have been implemented as unittests because it is more convenient to test Python from Python.

Unfortunately running Cylc from Python is not really possible at the moment, consequently CylcWorkflowTestCase works by mocking something that looks like a scheduler and mounting the necessary parts onto it. This makes the tests very complex to write and fragile to changes to the scheduler.

Here's an excerpt from cylc/flow/tests/network/test_client.py so you can see what I'm going on about:

# the test has to re-implment scheduler functionality:
self.task_pool.release_runahead_tasks() 
self.scheduler.data_store_mgr.initiate_data_model() 
self.workflow_id = self.scheduler.data_store_mgr.workflow_id 
# and put in-place system-level functionality
create_auth_files(self.suite_name)  # auth keys are required for comms 
# and handle fine implementation detail which could change at any time
barrier = Barrier(2, timeout=20) 
self.server = SuiteRuntimeServer( 
    self.scheduler, 
    context=SERVER_CONTEXT, 
    threaded=True, 
    barrier=barrier, 
    daemon=True)

There are 100 lines of boilerplate but all the test actually wants to do is to call a ZMQ endpoint.

These tests would be a lot simpler if written in bash (see #3611) but there is more to it than that...

Testing And Python

I think there is a strong argument for running cylc tests in Python, a few quick examples:

Powerful testing framework.
Much faster.
Mocking.
Ability to inspect the internals of Cylc.

Unfortunately CylcWorkflowTestCase does not yet have the benefit of the same evolution as the functional test harness, for example it uses the user/site configuration rather than the test configuration and doesn't tidy up afterwards.

A New Framework

By making some small changes to the Scheduler, namely moving command-line logic back into the command line we can make it possible to run suites by invoking the Scheduler directly.

This makes it possible to write a Python test harness for workflows without re-inventing the wheel.

This PR shows a proof of concept of how that can be done.

A New Battery?

We could just bung this code into the existing unit tests, however, I'm keen to keep integration tests (ones that run suites) out of the unit tests:

Keep the unit tests, simple, fast and clean.
- Often in development we break cylc run, but we might still want to run the unit tests.
Encourage unit tests over integration tests.
- Resist the urge to run workflows where not necessary.
Create a hierarchy of tests
- It's a lot easier to fix simple tests than complex ones, the mix of testing in tests/ makes it hard to flush out the simple errors.
- Devs should run the simplest tests first, fix the issues then move onto the more complex tests.
- unit tests < integration tests < functional tests.

This PR puts the integration test framework in its own directory, this forces us to maintain a distinction between unit and functional tests. (And yes the integration framework is unit tested but "A Foolish Consistency is the Hobgoblin of Little Minds".)

This POC

Registers all tests under a common hierarchy (like the functional tests).
Deletes suites if the tests pass.
Leaves behind no files if all the tests pass.
Handles startup and shutdown.
Is implemented in about 100 lines of code (see the fixtures at the bottom of itests/__init__)

Going Forward

This PR isn't ready to roll, there are a few major issues:

The suite log is blank (because the tests share a logger with the flows).
The tests aren't parallel safe YET because the Scheduler cannot be invoked twice in the same session YET.
Etc.

And some minor features yet to be implemented:

Use the test configuration.
Potentially support no_detach=False
etc.

Oh, and by the way, the tests in this PR take ~2s each, which includes suite installation, execution and shutdown so this could be quicker to run than the bash tests :)

Questions

@cylc/core

Different / better ideas?
Are my arguments for a third test battery compelling enough?
Should we co-locate tests e.g:
- cylc
  - tests
    - unit
    - integration
    - functional
    - u -> unit
    - i -> integration
    - f -> functoinal
      Makes it easy to strip them from distribution.
      Makes more sense if we manage to get the functional tests running under pytest too.
Are these tests close enough to integration tests for this approach to make sense.
Do the guidelines I've dropped in (see README.md files) make sense.

Requirements check-list

I have read CONTRIBUTING.md and added my name as a Code Contributor.
Contains logically grouped changes (else tidy your branch by rebase).
Does not contain off-topic changes (use other PRs for other changes).

Appropriate tests are included (unit and/or functional).
Already covered by existing tests.
Does not need tests (why?).

Appropriate change log entry included.
No change log entry required (why? e.g. invisible to users).

(master branch) I have opened a documentation PR at cylc/cylc-doc/pull/XXXX.
(7.8.x branch) I have updated the documentation in this PR branch.
No documentation update required.

hjoliver · 2020-05-20T02:57:51Z

After a quick skim through (all I have time for today 😬 ) - sounds brilliant 👍

wxtim · 2020-05-20T08:14:48Z

It seems sane as a concept and the examples look reasonable. I'm not sure I could dive in and use it to write a test of the scheduler straight out, although it'd be fun to try. Would you like me to give that a go?

oliver-sanders · 2020-05-20T08:58:38Z

Not quite ready for general use yet, the interface may well change, need to get logging working, etc.

oliver-sanders · 2020-05-22T12:40:07Z

I've managed to get to the point where cylc.flow.scheduler.Scheduler can be imported and run in the same process (meaning you can run multiple schedulers in a single event loop). Syntax simplified and direct access to the scheduler object (in the same process) granted.

It seems robust and I'm confident it can handle the existing use cases. Working my way through the existing unit tests and translating them across.

For me a simple test takes less than 2 seconds, using pytest scoping for sharing objects between test functions that's approx 2 seconds for a small battery of tests.

oliver-sanders

Ok, this is now a working system, not quite ready for review but good enough for discussion. I've converted two of the unittests over as a POC.

Here are some examples of the sort of thing that is now possible.

import logging
from pathlib import Path
  
import pytest
  
  
# no more gripping the log file, log tuples can be obtained returned by run_flow()
  
@pytest.mark.asyncio
async def test_cylc_version(flow, run_flow, simple_conf):
    """Ensure the flow logs the cylc version 8.0a1."""
    scheduler = flow(simple_conf)
    async with run_flow(scheduler) as log:
        assert (
            ('cylc', logging.INFO, 'Cylc version: 8.0a1')
            in log.record_tuples
        )   
  
  
# command line options can be provided to flow() using their "dest" names
  
@pytest.mark.asyncio
async def test_hold_start(flow, run_flow, simple_conf):
    """Ensure the flow starts in held mode when run with hold_start=True."""
    scheduler = flow(simple_conf, hold_start=True)
    async with run_flow(scheduler):
        assert scheduler.paused()
  
  
# when the flow stops the scheduler object is still there for us to poke
  
@pytest.mark.asyncio
async def test_shutdown(flow, run_flow, simple_conf):
    """Ensure the server shutsdown with the flow."""
    scheduler = flow(simple_conf)
    async with run_flow(scheduler):
        await scheduler.shutdown('because i said so')
        assert scheduler.server.socket.closed
  
  
# you don't have to run suites, infact we should avoid it when possible
  
@pytest.mark.asyncio
async def test_install(flow, run_flow, simple_conf, run_dir):
    """Ensure the flow starts in held mode when run with hold_start=True."""
    scheduler = flow(simple_conf)
    jobscript = Path(run_dir, scheduler.suite, '.service', 'etc', 'job.sh')
    assert jobscript.exists()

And the best bit:

[gw0] [ 25%] PASSED itests/test_foo.py::test_cylc_version 
itests/test_foo.py::test_hold_start 
[gw0] [ 50%] PASSED itests/test_foo.py::test_hold_start 
itests/test_foo.py::test_shutdown 
[gw0] [ 75%] PASSED itests/test_foo.py::test_shutdown 
itests/test_foo.py::test_install 
[gw0] [100%] PASSED itests/test_foo.py::test_install

oliver-sanders · 2020-05-22T14:57:24Z

itests/__init__.py

+    success = True
+    contact = (run_dir / scheduler.suite / '.service' / 'contact')
+    try:
+        asyncio.get_event_loop().create_task(scheduler.start())


We can now import cylc.flow.scheduler.Scheduler, initiate and run it from Python, all you need to do is to call Scheduler.start from an event loop. You can start multiple schedulers in the same event loop.

oliver-sanders · 2020-05-22T15:01:23Z

itests/__init__.py

+        success = False
+        raise exc from None  # raise the exception so the test fails
+    finally:
+        await scheduler.shutdown(SchedulerStop(StopMode.AUTO.value))


All schedulers start in a single process which makes tidying up afterwards a lot easier, just call the shutdown method.

oliver-sanders · 2020-05-22T15:06:44Z

itests/conftest.py

+
+
+@pytest.fixture
+def run_flow(run_dir):


The basic fixtures are:

make_flow - writes the suite.rc to disk.

make_scheduler - initiates the Scheduler.

flow - shorthand for make_scheduler(make_flow) because I'm lazy.

run_flow - calls Scheduler.start.

There are mod_* and ses_* variants of each of these to allow you to create Pytest fixtures with module or session level scoping. This allows us to run a workflow once and use it in multiple tests for efficiency reasons.

This is parallel safe!

Pytest has been configured to run tests from the same module together, so all the tests in a module can share the same scheduler, no problem.

caveat the exception is you can't run workflows in the session scope, but that would be crazy anyway.

oliver-sanders · 2020-05-22T15:07:41Z

itests/test_client.py

+@pytest.mark.asyncio
+async def test_ping(flow_a_w_client):
+    """It should return True if running."""
+    scheduler, client = flow_a_w_client
+    assert await client.async_request('ping_suite')
+    assert not client.socket.closed


This is a cylc ping example, the bash equivalent would be:

set_tests 2 mkdir "$HOME/cylc-run/$REG" -p cat > "$HOME/cylc-run/$REG" <<<__SUITERC__ [scheduling [[dependencies]] graph = foo __SUITERC__ run_ok cylc run "$REG" _poll_suite_started run_ok cylc ping "$REG" cylc stop "$REG" _poll_suite_stopped purge_suite exit

oliver-sanders · 2020-05-22T15:52:38Z

Other thoughts:

Raising nasty exceptions whilst the suite is running.
Testing the main loop plugins independently.
Compatible with pytest.monkeypatch for hack-the-codebase tests (e.g. simulate FS lag).
Triggering tests run in simulation mode (would be good for testing alternate branding in SOD).
Other ideas?

hjoliver · 2020-05-26T22:09:38Z

Other ideas?

I'm already overwhelmed by the existing great ideas here 😁

* move cli stuff into the cli * move functional stuff out of the cli * add an interface for creating scheduler options objects * tidy the --format argument * move daemonise logic to scheudler_cli * move event loop logic to scheduler_cli * move logging into scheduler_cli * move start message to scheduler_cli * store id as top-level attr

* use the new integration battery instead

oliver-sanders · 2020-06-12T14:24:11Z

No one screamed loud enough so I'm going to close this proposal and re-raise as a PR.

oliver-sanders added POC Proof of Concept question Flag this as a question for the next Cylc project meeting. labels May 19, 2020

oliver-sanders added this to the cylc-8.0.0 milestone May 19, 2020

oliver-sanders self-assigned this May 19, 2020

oliver-sanders modified the milestones: cylc-8.0.0, some-day May 19, 2020

hjoliver mentioned this pull request May 20, 2020

Better queue config warning. #3618

Merged

6 tasks

oliver-sanders force-pushed the scheduler-options branch from 4051e56 to 1617ac2 Compare May 22, 2020 14:53

oliver-sanders commented May 22, 2020

View reviewed changes

oliver-sanders modified the milestones: some-day, cylc-8.0a3 May 22, 2020

oliver-sanders mentioned this pull request May 26, 2020

tests: re-write flaky test in bash #3611

Closed

6 tasks

oliver-sanders force-pushed the scheduler-options branch from 3d2b2f2 to 671930d Compare June 4, 2020 15:02

oliver-sanders added 11 commits June 8, 2020 13:00

pytest: add pytest-xdist to developer dependencies

3da88ca

tests: scheduler_cli

75b4817

itests: new integration test framework for cylc in python

fb827ad

itests: meta-testing

f476e29

tests: add readme files to explain differences

758a45b

itests: conftest setup

a6e6e35

itests: add cylc.flow.network.client.SuiteRuntimeClient tests

d02b5b4

temp: remove optparer2nametuple dead end

7b4a339

itests: mutiprocessing -> asycio

0d8d624

itests: test the tests

d2ab7a4

itests: niceify test interface - again

234f8ac

oliver-sanders force-pushed the scheduler-options branch from aaba326 to 85397a6 Compare June 11, 2020 12:33

oliver-sanders added 3 commits June 11, 2020 13:40

itests: niceify test_publisher and replace old unittest

28fce55

itests: convert cylc/flow/tests/data_store_mgr

40f8b10

itests: niceify test_client

b803f46

oliver-sanders force-pushed the scheduler-options branch 3 times, most recently from 955724a to 6a0be64 Compare June 12, 2020 13:45

oliver-sanders added 13 commits June 12, 2020 14:52

data store: don't crash if tcp server not started

80c10aa

scheduler: open python api to release tasks

13bd5e5

itests: advanced tidying

10a9ac5

itests: convert cylc.flow.tests.network.test_resolvers

0d02209

itests: nuclear option for scheduler shutdown

55b1136

itests: convert cylc.flow.tests.test_job_pool

2d30f65

itests: convert cylc.flow.tests.network.test_server

6799120

tests: remove CylcWorkflowTestCase

c84ea10

* use the new integration battery instead

itests: remove scheduler unit test

911b3a7

itests: functional documentation

11d37da

scheduler: fix daemonisation and exit codes

51ef58d

itests: migrate some tests from cylc.flow.tests.network.test_zmq

a511248

scheduler: run startup handler during startup

6f34ed3

oliver-sanders force-pushed the scheduler-options branch from 6a0be64 to 58e08ec Compare June 12, 2020 14:08

oliver-sanders added 2 commits June 12, 2020 15:22

tests: co-locate functional, integration and unit tests

1a930cb

actions: run integration tests

fc04f2d

oliver-sanders force-pushed the scheduler-options branch from 58e08ec to fc04f2d Compare June 12, 2020 14:22

oliver-sanders removed this from the cylc-8.0a3 milestone Jun 12, 2020

oliver-sanders removed the question Flag this as a question for the next Cylc project meeting. label Jun 12, 2020

oliver-sanders closed this Jun 12, 2020

oliver-sanders mentioned this pull request Jun 13, 2020

tests: integration test battery #3654

Merged

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

proposal: integration testing battery #3616

proposal: integration testing battery #3616

oliver-sanders commented May 19, 2020 •

edited

Loading

hjoliver commented May 20, 2020

wxtim commented May 20, 2020

oliver-sanders commented May 20, 2020

oliver-sanders commented May 22, 2020 •

edited

Loading

oliver-sanders left a comment

oliver-sanders May 22, 2020

oliver-sanders May 22, 2020

oliver-sanders May 22, 2020

oliver-sanders May 22, 2020

oliver-sanders commented May 22, 2020

hjoliver commented May 26, 2020

oliver-sanders commented Jun 12, 2020

proposal: integration testing battery #3616

proposal: integration testing battery #3616

Conversation

oliver-sanders commented May 19, 2020 • edited Loading

tldr

The Testing Situation

CylcWorkflowTestCase

Testing And Python

A New Framework

A New Battery?

This POC

Going Forward

Questions

hjoliver commented May 20, 2020

wxtim commented May 20, 2020

oliver-sanders commented May 20, 2020

oliver-sanders commented May 22, 2020 • edited Loading

oliver-sanders left a comment

Choose a reason for hiding this comment

oliver-sanders May 22, 2020

Choose a reason for hiding this comment

oliver-sanders May 22, 2020

Choose a reason for hiding this comment

oliver-sanders May 22, 2020

Choose a reason for hiding this comment

oliver-sanders May 22, 2020

Choose a reason for hiding this comment

oliver-sanders commented May 22, 2020

hjoliver commented May 26, 2020

oliver-sanders commented Jun 12, 2020

oliver-sanders commented May 19, 2020 •

edited

Loading

oliver-sanders commented May 22, 2020 •

edited

Loading