Add tests for ONNX models #6

ScottTodd · 2024-08-09T23:54:41Z

Some work has gone into importing models already at https://github.com/nod-ai/SHARK-TestSuite/tree/main/e2eshark/onnx/models

Progress on #6. A sample test report HTML file is available here: https://scotttodd.github.io/iree-test-suites/onnx_models/report_2024_09_17.html These new tests * Download models from https://github.com/onnx/models * Extract metadata from the models to determine which functions to call with random data * Run the models through [ONNX Runtime](https://onnxruntime.ai/) as a reference implementation * Import the models using `iree-import-onnx` (until we have a better API: iree-org/iree#18289) * Compile the models using `iree-compile` (currently just for `llvm-cpu` but this could be parameterized later) * Run the models using `iree-run-module`, checking outputs using `--expected_output` and the reference data Tests are written in Python using a set of pytest helper functions. As the tests run, they can log details about what commands they are running. When run locally, the `artifacts/` directory will contain all the relevant files. More can be done in follow-up PRs to improve the ergonomics there (like generating flagfiles). Each test case can use XFAIL like `@pytest.mark.xfail(raises=IreeRunException)`. As we test across multiple backends or want to configure the test suite from another repo (e.g. [iree-org/iree](https://github.com/iree-org/iree)), we can explore more expressive marks. Note that unlike the ONNX _operator_ tests, these tests use `onnxruntime` and `iree-import-onnx` at test time. The operator tests handle that as an infrequently ran offline step. We could do something similar here, but the test inputs and outputs can be rather large for real models and that gets into Git LFS or cloud storage territory. If this test authoring model works well enough, we can do something similar for other ML frameworks like TFLite (#5).

Progress on iree-org/iree-test-suites#6. Current tests included and their statuses: ``` PASSED onnx_models/tests/vision/classification_models_test.py::test_alexnet PASSED onnx_models/tests/vision/classification_models_test.py::test_caffenet PASSED onnx_models/tests/vision/classification_models_test.py::test_densenet_121 PASSED onnx_models/tests/vision/classification_models_test.py::test_googlenet PASSED onnx_models/tests/vision/classification_models_test.py::test_inception_v2 PASSED onnx_models/tests/vision/classification_models_test.py::test_mnist PASSED onnx_models/tests/vision/classification_models_test.py::test_resnet50_v1 PASSED onnx_models/tests/vision/classification_models_test.py::test_resnet50_v2 PASSED onnx_models/tests/vision/classification_models_test.py::test_shufflenet PASSED onnx_models/tests/vision/classification_models_test.py::test_shufflenet_v2 PASSED onnx_models/tests/vision/classification_models_test.py::test_squeezenet PASSED onnx_models/tests/vision/classification_models_test.py::test_vgg19 XFAIL onnx_models/tests/vision/classification_models_test.py::test_efficientnet_lite4 XFAIL onnx_models/tests/vision/classification_models_test.py::test_inception_v1 XFAIL onnx_models/tests/vision/classification_models_test.py::test_mobilenet XFAIL onnx_models/tests/vision/classification_models_test.py::test_rcnn_ilsvrc13 XFAIL onnx_models/tests/vision/classification_models_test.py::test_zfnet_512 ``` * CPU only for now. We haven't yet parameterized those tests to allow for other backends or flags. * Starting with `--override-ini=xfail_strict=false` so newly _passing_ tests won't fail the job. Newly _failing_ tests will fail the job. We can add an external config file to customize which tests are expected to fail like the onnx op tests if we want to track which are passing/failing in this repository instead of in the test suite repo. Sample logs: https://github.com/iree-org/iree/actions/runs/11371239238/job/31633406729?pr=18795 ci-exactly: build_packages, test_onnx

ScottTodd · 2024-10-17T19:57:16Z

An initial set of ONNX model tests has landed in https://github.com/iree-org/iree-test-suites/tree/main/onnx_models and is now running on every IREE PR / commit. Sample logs from a test run in IREE: https://github.com/iree-org/iree/actions/runs/11390747229/job/31693710855#step:8:19.

Test summary:

=========================== short test summary info ============================
PASSED iree-test-suites/onnx_models/tests/vision/classification_models_test.py::test_alexnet
PASSED iree-test-suites/onnx_models/tests/vision/classification_models_test.py::test_caffenet
PASSED iree-test-suites/onnx_models/tests/vision/classification_models_test.py::test_densenet_121
PASSED iree-test-suites/onnx_models/tests/vision/classification_models_test.py::test_googlenet
PASSED iree-test-suites/onnx_models/tests/vision/classification_models_test.py::test_inception_v2
PASSED iree-test-suites/onnx_models/tests/vision/classification_models_test.py::test_mnist
PASSED iree-test-suites/onnx_models/tests/vision/classification_models_test.py::test_resnet50_v1
PASSED iree-test-suites/onnx_models/tests/vision/classification_models_test.py::test_resnet50_v2
PASSED iree-test-suites/onnx_models/tests/vision/classification_models_test.py::test_shufflenet
PASSED iree-test-suites/onnx_models/tests/vision/classification_models_test.py::test_shufflenet_v2
PASSED iree-test-suites/onnx_models/tests/vision/classification_models_test.py::test_squeezenet
PASSED iree-test-suites/onnx_models/tests/vision/classification_models_test.py::test_vgg19
XFAIL iree-test-suites/onnx_models/tests/vision/classification_models_test.py::test_efficientnet_lite4
XFAIL iree-test-suites/onnx_models/tests/vision/classification_models_test.py::test_inception_v1
XFAIL iree-test-suites/onnx_models/tests/vision/classification_models_test.py::test_mobilenet
XFAIL iree-test-suites/onnx_models/tests/vision/classification_models_test.py::test_rcnn_ilsvrc13
XFAIL iree-test-suites/onnx_models/tests/vision/classification_models_test.py::test_zfnet_512
================== 12 passed, 5 xfailed in 177.35s (0:02:57) ===================

Open tasks now:

Output and publish test report pages, like https://scotttodd.github.io/iree-test-suites/onnx_models/report_2024_09_17.html?sort=result
Import more tests from https://github.com/onnx/models.
- The current tests are from https://github.com/onnx/models/tree/main/validated/vision/classification.
- See also the SHARK-TestSuite ONNX model tests at https://github.com/nod-ai/SHARK-TestSuite/tree/main/alt_e2eshark/onnx_tests/models and https://github.com/nod-ai/SHARK-TestSuite/tree/main/e2eshark/onnx/models
- May need to edit the test util functions to support more data types. I started with just what those vision classification tests needed
Parameterize the test suite to allow for running tests on other compiler targets and runtime HAL devices (e.g. Vulkan)

…8819) Progress on iree-org/iree-test-suites#2 and iree-org/iree-test-suites#6 .

…8819) Progress on iree-org/iree-test-suites#2 and iree-org/iree-test-suites#6 . Signed-off-by: Elias Joseph <eljoseph@amd.com>

ScottTodd · 2024-12-18T18:15:02Z

I'll try to import more tests and parameterize across backends today.

ScottTodd self-assigned this Sep 6, 2024

This was referenced Sep 6, 2024

Begin testing models from the ONNX Model Zoo. #23

Merged

Import ONNX model zoo programs to iree_tests nod-ai/SHARK-TestSuite#275

Closed

ScottTodd mentioned this issue Oct 16, 2024

Run ONNX model tests as part of pkgci_test_onnx. iree-org/iree#18795

Merged

ScottTodd mentioned this issue Oct 17, 2024

Document new external ONNX model and linalg operator test suites. iree-org/iree#18819

Merged

ScottTodd added a commit to iree-org/iree that referenced this issue Oct 23, 2024

Document new external ONNX model and linalg operator test suites. (#1…

563b3e7

…8819) Progress on iree-org/iree-test-suites#2 and iree-org/iree-test-suites#6 .

Eliasj42 pushed a commit to iree-org/iree that referenced this issue Oct 31, 2024

Document new external ONNX model and linalg operator test suites. (#1…

474a967

…8819) Progress on iree-org/iree-test-suites#2 and iree-org/iree-test-suites#6 . Signed-off-by: Elias Joseph <eljoseph@amd.com>

ScottTodd added the enhancement New feature or request label Dec 11, 2024

ScottTodd mentioned this issue Dec 18, 2024

Add tests for PyTorch models #63

Open

This was referenced Dec 18, 2024

Parameterize ONNX model tests. #65

Draft

[infra] Run parameterized ONNX model tests across CPU, Vulkan, and HIP. iree-org/iree#19524

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add tests for ONNX models #6

Add tests for ONNX models #6

ScottTodd commented Aug 9, 2024 •

edited

Loading

ScottTodd commented Oct 17, 2024

ScottTodd commented Dec 18, 2024

Add tests for ONNX models #6

Add tests for ONNX models #6

Comments

ScottTodd commented Aug 9, 2024 • edited Loading

ScottTodd commented Oct 17, 2024

ScottTodd commented Dec 18, 2024

ScottTodd commented Aug 9, 2024 •

edited

Loading