Document for general test format #39

djrtwo · 2018-10-04T02:44:33Z

Added test-format.md to describe the high level specification that all YAML test documents need to conform to. Once this is approved, I will add our current proposed test suites (ssz, chain tests, and shuffle) as separate documents under specs/test-suites

Any and all feedback welcome.

CC: @paulhauner, @mratsim

djrtwo · 2018-10-04T02:50:51Z

Note: I stole this sample test suite and test vectors from @paulhauner's https://notes.ethereum.org/n7fyPi4cR-Gg9Ypq7ylTrQ?view due to it being the simplest test suite (in terms of number of fields) that we've proposed so far

paulhauner · 2018-10-04T09:07:22Z

This looks good to me. The only thing I can think of is giving some guidance on how to deal with forks and versioning. (Note: I don't know how this works in present Ethereum, so happy to ditch this if there's an already established process).

Firstly, I'm assuming that whenever there's a fork all the specs need to change their fork to match the name of the new fork. This means that a single branch of the eth2.0-specs repo only deals with one single fork.

I propose we use the Semantic Versioning 2.0 MAJOR.MINOR.PATCH format like so:

PATCH: fixed some small bug in the tests. E.g. typo in a description.
MINOR: add a new test of the same structure. E.g. some edge case wasn't previously tested and it was added without changing the test_cases structure.
MAJOR: a new test requires the test_cases structure to change. E.g. input variable wasn't considered and the test structure needs to change.

Whenever there's a fork, the versioning resets but still communicates the stability of the test vectors. Consider the case where there's a fork that doesn't actually affect the test vectors, we can use the following guide:

Alpha 3.2.5 -> Beta 1.0.0
Alpha 0.3.2 -> Beta 0.1.0
Alpha 0.0.3 -> Beta 0.0.1

If the fork destroys the test cases and we need to start at zero again, it's fine to go from 1.0.0 to 0.0.1.

Thoughts?

mratsim · 2018-10-04T10:54:01Z

Assuming you are talking about the hard forks once Shasper is released, we shouldn't nuke the specs and tests once a new one is released.

Either we organize in a functional manner like https://github.com/ethereum/tests:

.
├── ABITests
│   └── basic_abi_tests.json
├── BasicTests
│   ├── README.md
│   ├── blockgenesistest.json
│   ├── crypto.json
│   ├── difficulty.json
│   ├── difficultyByzantium.json
│   ├── difficultyCustomHomestead.json
│   ├── difficultyCustomMainNetwork.json
│   ├── difficultyFrontier.json
│   ├── difficultyHomestead.json
│   ├── difficultyMainNetwork.json
│   ├── difficultyMorden.json
│   ├── difficultyOlimpic.json
│   ├── difficultyRopsten.json
│   ├── genesishashestest.json
│   ├── hexencodetest.json
│   ├── keyaddrtest.json
│   └── txtest.json
├── BlockchainTests
│   ├── GeneralStateTests
│   ├── TransitionTests
│   ├── bcBlockGasLimitTest
│   ├── bcExploitTest
│   ├── bcForgedTest
│   ├── bcForkStressTest
│   ├── bcGasPricerTest
│   ├── bcInvalidHeaderTest
│   ├── bcMultiChainTest
│   ├── bcRandomBlockhashTest
│   ├── bcStateTests
│   ├── bcTotalDifficultyTest
│   ├── bcUncleHeaderValidity
│   ├── bcUncleTest
│   ├── bcValidBlockTest
│   └── bcWalletTest
├── GeneralStateTests
│   ├── stArgsZeroOneBalance
│   ├── stAttackTest
│   ├── stBadOpcode
│   ├── stBugs
│   ├── stCallCodes
│   ├── stCallCreateCallCodeTest
│   ├── stCallDelegateCodesCallCodeHomestead
│   ├── stCallDelegateCodesHomestead
│   ├── stChangedEIP150
│   ├── stCodeCopyTest
│   ├── stCodeSizeLimit
│   ├── stCreate2
│   ├── stCreateTest
│   ├── stDelegatecallTestHomestead
│   ├── stEIP150Specific
│   ├── stEIP150singleCodeGasPrices
│   ├── stEIP158Specific
│   ├── stEWASMTests
│   ├── stExample
│   ├── stHomesteadSpecific
│   ├── stInitCodeTest
│   ├── stLogTests
│   ├── stMemExpandingEIP150Calls
...

Or we use a per-fork structure (names courtesy of the Ubuntu Name Generator)

.
├── specs
│   ├── 20190601_IceAge
│   ├── 20200101_Frontier
│   ├── 20200601_ErgonomicEmu
│   └── 20210101_OrthodoxOriole
└── tests
    ├── 20190601_IceAge
    ├── 20200101_Frontier
    ├── 20200601_ErgonomicEmu
    └── 20210101_OrthodoxOriole

Now versioning is also important because it's much easier to check generically in for a SSZ deserializer for example.

One thing regarding specs, when implementing Ethereum 1.0 I find it very hard to exhume the old gas costs of the earlier forks because I couldn't find an old Yellow Paper and EIPs are tracking history of changes but there was no way to get a snapshot of the state of the specs at a certain point of time. So client implementers that are catching up have to check Geth and Parity codebase.

djrtwo · 2018-10-04T12:06:44Z

All client codebases have to handle all forks and all the tests remain! yay legacy code.

To sync the chain after a fork, your codebase must remember how to process blocks before the fork and how to process blocks after the fork.

py-evm handles this pretty cleanly here
And then they instantiate different chains (like main net or test nets) like this defining fork block constants.

paulhauner · 2018-10-04T12:15:55Z

Good call @mratsim. I take back what I said about the repo only handling one fork.

…

On Thu, 4 Oct 2018 at 10:06 pm, Danny Ryan ***@***.***> wrote: All client codebases have to handle all forks and all the tests reamin! yay legacy code. To sync the chain after a fork, your codebase must remember how to process blocks before the fork *and* how to process blocks after the fork. py-evm handles this pretty cleanly here <https://github.com/ethereum/py-evm/tree/master/eth/vm/forks> And then they instantiate different chains (like main net or test nets) like this <https://github.com/ethereum/py-evm/blob/master/eth/chains/mainnet/constants.py> defining fork block constants. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#39 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AGWiNJklRVmq5KJEZ88aM7Kr48snzDrTks5uhfnWgaJpZM4XHRTX> .

mkalinin · 2018-10-05T14:41:21Z

Either we organize in a functional manner ... Or we use a per-fork structure

Recalling experience gained from eth1.0 test suite.
Previously, there was a per-fork structure which has been changed to the structure we have now (functinal manner):

.
├── tests 
    ├── GeneralStateTests
        ├── stCreate2
        ├── stShift
        ├── stZeroKnowledge
            ├── ecmul_1-2_2_28000_128.json
            ├── ecmul_1-2_2_28000_96.json
            ├── ecmul_1-2_5617_28000_128.json
                ├── IN
                ├── OUT_Homestead
                ├── OUT_Byzantium
                ├── OUT_Constantinople

With good implementation design, this structure makes it trivial to enable new forks on client side. It's just adding a new line to the list of forks. While per-fork manner would require to handle new folders each time the fork is added.

As I understand this structure is suitable for test writers as well. @winsvega might want to comment on that as he works on tests from the very beginning of eth1.0.

A bit specific thing, but @holiman has a proposal on improving #ethereum/tests repository: https://gist.github.com/holiman/fdec3547f2b104803abbd2c6e751a8e7#proposed-solution.
It's a solution for getting rid of PRs like that ethereum/tests#511. It might be early to think about test generators but it worth keeping this problem in mind for the future.

winsvega · 2018-10-06T20:55:46Z

The problem now is that if you want to generate new fork you have to refresh a file with all previous fork tests. So a solution would be to use 1 source file -> generates many test files.

Although it wont really work if we keep filler hash checking.

mratsim · 2018-10-09T16:33:57Z

Seems like we are rediscovering the Expression Problem which this blog post summarizes quite well for sum types vs interfaces:

With sum types, it’s easy to add uses but hard to add cases.
With interfaces, it’s easy to add cases but hard to add uses.

In our case:

With functional organization, it's easy to add new tests but it's add to hard forks in the test suite (need to update all tests with the new case)
With per-fork, it's easy to add forks but hard to add tests (need to add one in all forks)

mkalinin · 2018-10-10T14:38:07Z

With per-fork, it's easy to add forks but hard to add tests (need to add one in all forks)

My hope is that in this case hardness of adding a test could be mitigated somehow. I am not sure what filler hash checking does exactly mean but solution with one source file that produces several tests could do the trick.

IMHO, it's worth betting on some difficulties during test creation if this is a prize of making implementation updates trivial. Cause, there gonna be only one test repository that affects several implementations.

winsvega · 2018-10-11T09:29:28Z

filler hash checks that test been generated with the latest filler version

paulhauner · 2018-10-11T10:28:24Z

Sorry @winsvega, I'm not sure what a "filler" is.

winsvega · 2018-10-11T11:27:02Z

filler aka testFiller is a test source file.
each test source file generate a complete test. right now all tests for all forks are generated from testFiller into one file. and that file has a hash of testFiller that generated it. so in case changes are done and tests are not updated there will be an error.

if we put generated tests into separate files (in order not to touch it with each test updated on each new fork) the fillerHash field will become meaningless. I guess I will have to change it into expectSection hash or remove at all.

djrtwo · 2018-10-16T22:49:17Z

As discussed in the call, we are merging this document in as tentative so that we can use it for simple tests across clients -- SSZ, shuffle, etc.

We will iterate from there with the hope that we lock down some more specifications on overall structure and strategy of testing around devcon.

Also update binary output due to metadata change.

[test-format] Add base document for general test format

f1dda7f

djrtwo mentioned this pull request Oct 4, 2018

[SSZ] test vectors ethereum/beacon_chain#115

Open

hwwhww added the general:RFC Request for Comments label Oct 4, 2018

a note about wip

8aeb55d

djrtwo merged commit 0e86ab8 into master Oct 17, 2018

djrtwo deleted the test-format branch January 1, 2019 22:08

hwwhww pushed a commit that referenced this pull request Aug 17, 2020

Upgrade to Solidity 0.6.11 (#39)

ebba752

Also update binary output due to metadata change.

hartonosugi792 mentioned this pull request Jun 8, 2022

Acceptable Certification #2910

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Document for general test format #39

Document for general test format #39

djrtwo commented Oct 4, 2018

djrtwo commented Oct 4, 2018

paulhauner commented Oct 4, 2018 •

edited

Loading

mratsim commented Oct 4, 2018

djrtwo commented Oct 4, 2018 •

edited

Loading

paulhauner commented Oct 4, 2018 via email

mkalinin commented Oct 5, 2018

winsvega commented Oct 6, 2018 •

edited

Loading

mratsim commented Oct 9, 2018

mkalinin commented Oct 10, 2018

winsvega commented Oct 11, 2018

paulhauner commented Oct 11, 2018

winsvega commented Oct 11, 2018

djrtwo commented Oct 16, 2018

Document for general test format #39

Document for general test format #39

Conversation

djrtwo commented Oct 4, 2018

djrtwo commented Oct 4, 2018

paulhauner commented Oct 4, 2018 • edited Loading

mratsim commented Oct 4, 2018

djrtwo commented Oct 4, 2018 • edited Loading

paulhauner commented Oct 4, 2018 via email

mkalinin commented Oct 5, 2018

winsvega commented Oct 6, 2018 • edited Loading

mratsim commented Oct 9, 2018

mkalinin commented Oct 10, 2018

winsvega commented Oct 11, 2018

paulhauner commented Oct 11, 2018

winsvega commented Oct 11, 2018

djrtwo commented Oct 16, 2018

paulhauner commented Oct 4, 2018 •

edited

Loading

djrtwo commented Oct 4, 2018 •

edited

Loading

winsvega commented Oct 6, 2018 •

edited

Loading