EPIC: Post feature freeze testing #1256

evan-forbes · 2023-01-18T01:15:18Z

MSevey · 2023-01-18T18:58:29Z

Something that might be good to add to the stretch goals is thinking about a framework for testing version upgrades and compatibility.

This could start by just ensuring we have the ability to launch nodes on different versions, and upgrade or downgrade a node without restarting the test.

evan-forbes · 2023-01-18T19:01:18Z

yes! good idea @MSevey, I totally forgot about #313, added to stretch goals

MSevey · 2023-01-18T19:08:08Z

nice yea, honestly i think that test #313 might make sense to bump up to a required and tie it into release. Basically launch the tests with half the nodes on the current path version and half on the new patch version as a sanity check that nothing breaks.

Then a future test could be more involved in terms of more patch versions, minor versions, and major versions to really stress test the network and understand compatibility issues. This one would of course be more of a debugging test. There might be some specific upgrade scenarios that we would want to build into the pipeline, but this and those would definitely be stretchy stretch goals.

Bidon15 · 2023-01-19T09:59:57Z

Thanks for the detail overview of the epic. It's Epic 😄
I would appreciate if we take our time to discuss how we approach this epic both from app and devops/testing teams

We can break down for example #1258 or compact blocks celestiaorg/celestia-core#883 to decide who does what so at the end the tests bring the most value and insights to core/app team
As well as we can learn how we can collaborate better for future joint issues

cmwaters · 2023-01-20T10:12:21Z

I think an overarching goal around our testing should be to put together some written artefact across the teams which can really act as a handbook making it clear for any team member or for anyone interested what we test, how we test it, when we test things and so forth. There is no monolithic one solution fits all kind of approach to testing. It really requires a myriad of different solutions that juggle the tradeoffs between different forms of coverage, "introspectability", time and resource costs. Thinking about testing the same systematic way we think about protocols and writing "specifications" will produce more reliable software. Having a place where people can read about everything will make it easier for us to cover our blindspots.

cmwaters · 2023-01-20T10:13:17Z

I also think @MSevey is totally spot on with his points. We should already be getting into the rhythm with upgrade and compatibility testing so it's like clockwork for us engineers by the time we hit our first mainnet upgrade. Previously at Tendermint, I had started working on extending the e2e framework for upgrade testing and perhaps we should endeavour to do the same here. We need to make sure this works alongside our versioning strategy. If we have long lived branches for our minor versions, we need to ensure that rolling upgrades happen smoothly for node operators. This is generally light-weight and thus could be tested on a per PR basis (in the same way you might detect protobuf breaking changes).

MSevey · 2023-01-20T17:56:45Z

I think an overarching goal around our testing should be to put together some written artefact across the teams which can really act as a handbook making it clear for any team member or for anyone interested what we test, how we test it, when we test things and so forth. There is no monolithic one solution fits all kind of approach to testing. It really requires a myriad of different solutions that juggle the tradeoffs between different forms of coverage, "introspectability", time and resource costs. Thinking about testing the same systematic way we think about protocols and writing "specifications" will produce more reliable software. Having a place where people can read about everything will make it easier for us to cover our blindspots.

One option to kick start this type of documentation could be a README in the test package that explains the different tooling and types of tests.

Ref: #1256, #1535 This commit introduces a new package `txsim` for contolled fuzz testing at a transaction level. It's purpose is to simulate a wide range of possible user interactions while also being able to apply a considerable load to the network.

Ref: #1256 This commit puts together the base pieces for an e2e testing suite. It includes a CLI that can: - Setup the file directories for multiple nodes, including generating a genesis with several accounts - Start the testnet (with different versions and at different start heights) - Stop the network and cleanup the used resources.

evan-forbes · 2024-02-21T10:53:46Z

we still have a dangling issues, but most have been ice-boxed

Ref: celestiaorg/celestia-app#1256, celestiaorg/celestia-app#1535 This commit introduces a new package `txsim` for contolled fuzz testing at a transaction level. It's purpose is to simulate a wide range of possible user interactions while also being able to apply a considerable load to the network.

Ref: celestiaorg/celestia-app#1256 This commit puts together the base pieces for an e2e testing suite. It includes a CLI that can: - Setup the file directories for multiple nodes, including generating a genesis with several accounts - Start the testnet (with different versions and at different start heights) - Stop the network and cleanup the used resources.

evan-forbes added the epic item groups other items for easier tracking label Jan 18, 2023

evan-forbes added this to the Mainnet milestone Jan 18, 2023

evan-forbes added this to Engineering Team Epics Jan 18, 2023

evan-forbes moved this to Todo in Engineering Team Epics Jan 18, 2023

This was referenced Feb 1, 2023

Create testing handbook #1324

Open

EPIC: Investigate and fix existing testnet network issues #1340

Closed

evan-forbes added the testing items that are strictly related to adding or extending test coverage label Feb 7, 2023

cmwaters mentioned this issue Mar 31, 2023

feat: minimal e2e framework #1586

Merged

cmwaters mentioned this issue Apr 11, 2023

feat: tx simulator #1613

Merged

evan-forbes removed this from the Mainnet milestone Aug 7, 2023

evan-forbes closed this as completed Feb 21, 2024

github-project-automation bot moved this from Todo to Done in Engineering Team Epics Feb 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

EPIC: Post feature freeze testing #1256

EPIC: Post feature freeze testing #1256

evan-forbes commented Jan 18, 2023 •

edited

Loading

MSevey commented Jan 18, 2023

evan-forbes commented Jan 18, 2023

MSevey commented Jan 18, 2023

Bidon15 commented Jan 19, 2023

cmwaters commented Jan 20, 2023

cmwaters commented Jan 20, 2023

MSevey commented Jan 20, 2023

evan-forbes commented Feb 21, 2024

EPIC: Post feature freeze testing #1256

EPIC: Post feature freeze testing #1256

Comments

evan-forbes commented Jan 18, 2023 • edited Loading

Stretch

MSevey commented Jan 18, 2023

evan-forbes commented Jan 18, 2023

MSevey commented Jan 18, 2023

Bidon15 commented Jan 19, 2023

cmwaters commented Jan 20, 2023

cmwaters commented Jan 20, 2023

MSevey commented Jan 20, 2023

evan-forbes commented Feb 21, 2024

evan-forbes commented Jan 18, 2023 •

edited

Loading