Add backend tests from tarball using Tar.jl #89

ordicker · 2023-04-25T10:21:44Z

Hi,

ONNX have huge test suite
I wrote one test for review.

I think we should aim to pass all the tests.

Same PR but with tarball (uncompressed)

PR Checklist

Tests are added (only tests)
Documentation, if applicable

Pangoraw · 2023-04-25T11:50:06Z

Just my random opinion but it may be a good use case for an Artifact ? Putting the tar in the repo means that every user of ONNX.jl will need to download it even if not running tests.

ordicker · 2023-04-25T13:01:38Z

Not random at all. That's much better, and I try to implement that.

Pangoraw · 2023-04-25T13:09:26Z

One problem that might arise is where to host the Tar archive. A solution which has been done before is to create a new repo where we create GitHub releases and upload the tar archives as artifacts there (see NodeJSBuilder for example). However, I don't know what is the most convenient for the FluxML org.

ToucheSir · 2023-04-25T13:48:24Z

If this tarball is being derived from the ONNX repo, would it make sense to use it directly via a submodule? Not sure how good Pkg support for those is.

ordicker · 2023-04-26T15:25:25Z

So currently we continue without the artifact?

I think FluxML should have artifact repo for models and datasets etc.

ToucheSir · 2023-04-27T14:19:22Z

There's nothing fundamentally wrong with using artifacts, but they do require versioning and ongoing maintenance. Artifacts are also generally preferred for handling binary data or data that would be inappropriate to version control with git. My perhaps incorrect understanding of the ONNX tests is that they're not explicitly versioned, and everything is a VCS-compatible text file already. Thus it may make more sense to use a submodule for them instead of going through the longer process of creating and registering artifacts.

ordicker · 2023-04-27T18:40:52Z

Got it.
Could you help me setup the sub module?
I never worked with one.

ordicker · 2023-05-02T12:06:44Z

Do I need to make a new repo?
and add it using git submodule add <new-repo-url>
@ToucheSir

dfdx · 2023-05-02T22:43:14Z

Sorry for the silence! I don't know much about git submodules either, but the following seems to work:

dfdx@dfdx:~/work/ONNX.jl$ git submodule add git@github.com:onnx/onnx.git test/onnx
Cloning into '/home/azbs/work/ONNX.jl/test/onnx'...
remote: Enumerating objects: 42908, done.
remote: Counting objects: 100% (834/834), done.
remote: Compressing objects: 100% (417/417), done.
remote: Total 42908 (delta 462), reused 692 (delta 403), pack-reused 42074
Receiving objects: 100% (42908/42908), 29.19 MiB | 309.00 KiB/s, done.
Resolving deltas: 100% (25262/25262), done.

dfdx@dfdx:~/work/ONNX.jl$ git status
On branch test-suites-as-submodule
Changes to be committed:
  (use "git restore --staged <file>..." to unstage)
	new file:   .gitmodules
	new file:   test/onnx

ToucheSir · 2023-05-03T00:28:31Z

Yes, sorry. I saw your questions too, but I haven't yet looked for an answer on how to handle submodules in the usual build/CI workflow. Looking through JuliaLang/Pkg.jl#708, perhaps a subtree would be a better option than a submodule?

ordicker · 2023-05-03T15:44:28Z

That's looks bad.

I think we are overengineering the solution. It's 22MB of binary files (could be compressed, but that will add a pkg dependency).
I don't think we should trace the file itself, but the location should be public and owned by FluxML.

For any git base solution (submodule/subtree/ data repo), I need help opening that one under FluxML (just empty repo with meaningful name).
Other solution, If FluxML has a server, then we could upload the file and use it as an artifact.
I could upload it to my own repo (ordicker/onnx_backend_testing) but I think it should be under FluxML

ToucheSir · 2023-05-03T15:48:21Z

My main question with self hosting would be how to keep these files up to date and who is responsible for doing so. Also you said these were binary files? Are those created from the text files in the ONNX repo, or do they come from somewhere else?

ordicker · 2023-05-03T15:55:41Z

I don't we need to update them often (even at all).
They are generated by onnx repo for testing. link
I generate the testing files (protobuf .pb) bundle it using tarball.

ToucheSir · 2023-05-03T15:59:51Z

Oh I see, they're already binary files. What I meant by updating is, what happens when the ONNX maintainers decide to change the contents of e.g. https://github.com/onnx/onnx/tree/main/onnx/backend/test/data/node/test_adagrad_multiple on their end? I presume we'd want have those updates on our end too, but because Julia artifacts are generally considered a snapshot of some data in time I think it'd require manual effort to do so.

ordicker · 2023-05-03T16:25:00Z

I’m proposing do it manually.
Mostly because it doesn’t change often.

ordicker · 2023-05-03T16:51:04Z

BTW, what about onnx.proto3.
We don’t keep track on it either…

ToucheSir · 2023-05-05T02:16:19Z

I'm afraid I don't know what onnx.proto3 is. Does the official repro provide both protobuf v2 and v3 of the artifacts we're interested in, but we're only handling and/or able to handle the v2 versions right now?

ordicker · 2023-05-05T20:23:18Z

onnx.proto3 is the protobuf interface definition.
We use it to generate (automaticly using ProtoBuf.jl) onnx_pb.jl
I think onnx.proto3 support both v3 and v2, but I'm not sure.

I think we should focus on adding more operations, and the test suite is just scaffolding.
When we implement most of the interface, we can remove it.

dfdx · 2023-05-06T20:16:17Z

test/backend.jl

+    end
+
+    @testset "Nodes" begin
+        prefix = extract("backend_data.tar")*"/data/node/"


Are these files deleted after the test? If not, we should probably add the while directory to the gitignore.

Good point, I though that if it is going to /tmp/ I don’t have to worry about it, but you are right.
I will fix that.

dfdx · 2023-05-06T20:35:50Z

I don't like the overengineering discussion we are falling into, but adding a 22Mb file to the repo makes me nervous too. The problem is that once this files gets into the git history, there's no way to delete it from there. Even worse, when we ever change the file (and I assume sometimes we will do it), the size is multiplied. This way we may end up with > 100Mb repo from which we only 136Kb (current size of src) is actually useful for end users.

How about downloading the tar file directly from the Github release? I believe we can stick to the current release, download it in the beginning of the test suit and add the file to gitignore. I can foresee some edge cases in this approach too, but it looks like a little and totally reversible change that will unblock the real value - adding and testing more operations.

ordicker · 2023-05-07T05:01:27Z

100% agreed with you.
Github release sound good to me.
Could you help me setup it up?

dfdx · 2023-05-07T22:21:42Z

I mean using the release of the onnx repo. That is, instead of putting the tar file to ONNX.jl, use something like this:

const ONNX_RELEASE_URL = "https://github.com/onnx/onnx/archive/refs/tags/v1.14.0.tar.gz"

onnx_release_tar_path = dirname(@__FILE__) * "/tests/backend_data.tar.gz"
onnx_release_path = dirname(@__FILE__) * "/tests/data/"
if !isfile(onnx_release_path)
    download(ONNX_RELEASE_URL, onnx_release_tar_path)
    extract(onnx_release_tar_path, onnx_release_path)
end
...

ordicker · 2023-05-08T15:48:50Z

Well, we can do it, but it isn't like the files are in the repo, and we just need to access those.
They are generated using backend-test-tools generate-data. Do we want to generate files every time? (take a couple of minutes)

dfdx · 2023-05-08T16:01:59Z

Oh, I didn't realize it. Yet, I believe we can gnerate the files during the first run of the tests and keep the result on that directory.

ordicker · 2023-05-08T17:05:59Z

Can I assume that everyone has python ?
And I think installing protobuf was a bit painful.

dfdx · 2023-05-08T18:17:02Z

Ouch. Now I see your your point. I agree then that putting a single tar into a repo is just fine, let's do it and come back to the question when we want to update it.

@ToucheSir if you are ok with this approach, I will merge this PR.

ToucheSir · 2023-05-08T19:08:35Z

I can see why there was an argument made for using artifacts now, thanks for your patience in walking through this @ordicker. I'm fine with the current approach as long as we put a pin in finding something better when it comes time to update the tarball.

dfdx · 2023-05-08T22:24:02Z

Sounds good. Let's add the unpacked files to .gitignore and we are good to go.

ordicker · 2023-05-09T12:50:28Z

Sure thing @ToucheSir, but in the process you convinced me that adding tarball to a repo is a bad practice.
As a temporary solution, I suggest using other repo release (for example).

If you agree, I'll do:

tmp cleanup (extracted files).
Downloading the tarball (if not existed) for other repo (mine or other).
add the tarball to .gitignore

ordicker · 2023-05-17T11:57:28Z

Bump @ToucheSir @dfdx

dfdx · 2023-05-17T12:18:39Z

Oh, sorry, I didn't realize you are awaiting for the confirmation 🤦 Yes, the plan sounds great to me.

ordicker · 2023-05-17T14:41:01Z

I have done it...
If it's look good to you, then I think we can pull it.

ordicker · 2023-05-20T14:29:44Z

So, are we done? @ToucheSir @dfdx
I can’t merge it myself

dfdx · 2023-05-20T14:47:56Z

Damn! I missed it again! Sorry for that - I've been working in an extreme multitasking mode for the past few months, so if I don't reply quickly, don't hesitate to ping me again.

The code looks good to me. I've also checked the artifact itself and verified that tests don't expose any obvious remote code execution capabilities. Thank you for all the patience with us!

ordicker · 2023-05-20T19:46:56Z

No need to apologize, day job is the worst 🤪

Now my plan is to add one operator at the time and hopefully the community will join

Add backend tests from tarball using Tar.jl

a6e92a3

dfdx reviewed May 6, 2023

View reviewed changes

remove tarball (download it from a tmp repo)

4e6885f

dfdx merged commit 88046fe into FluxML:master May 20, 2023

Add backend tests from tarball using Tar.jl #89

Add backend tests from tarball using Tar.jl #89

Conversation

ordicker commented Apr 25, 2023

PR Checklist

Pangoraw commented Apr 25, 2023

ordicker commented Apr 25, 2023

Pangoraw commented Apr 25, 2023

ToucheSir commented Apr 25, 2023

ordicker commented Apr 26, 2023

ToucheSir commented Apr 27, 2023

ordicker commented Apr 27, 2023

ordicker commented May 2, 2023

dfdx commented May 2, 2023 • edited Loading

ToucheSir commented May 3, 2023

ordicker commented May 3, 2023

ToucheSir commented May 3, 2023

ordicker commented May 3, 2023

ToucheSir commented May 3, 2023

ordicker commented May 3, 2023

ordicker commented May 3, 2023

ToucheSir commented May 5, 2023

ordicker commented May 5, 2023

dfdx May 6, 2023

Choose a reason for hiding this comment

ordicker May 7, 2023

Choose a reason for hiding this comment

dfdx commented May 6, 2023

ordicker commented May 7, 2023

dfdx commented May 7, 2023

ordicker commented May 8, 2023

dfdx commented May 8, 2023

ordicker commented May 8, 2023

dfdx commented May 8, 2023 • edited Loading

ToucheSir commented May 8, 2023

dfdx commented May 8, 2023

ordicker commented May 9, 2023

ordicker commented May 17, 2023

dfdx commented May 17, 2023

ordicker commented May 17, 2023

ordicker commented May 20, 2023

dfdx commented May 20, 2023

ordicker commented May 20, 2023

dfdx commented May 2, 2023 •

edited

Loading

dfdx commented May 8, 2023 •

edited

Loading